key: cord-0025460-k1maxgo5 authors: Sbaoui, Yousra; Nouadi, Badreddine; Ezaouine, Abdelkarim; Rida Salam, Mohamed; Elmessal, Mariame; Bennis, Faiza; Chegdani, Fatima title: Functional Prediction of Biological Profile During Eutrophication in Marine Environment date: 2022-01-05 journal: Bioinform Biol Insights DOI: 10.1177/11779322211063993 sha: a5d77a6167ae6ed4d6d73d2b4d9c4ee25b051b22 doc_id: 25460 cord_uid: k1maxgo5 In the marine environment, coastal nutrient pollution and algal blooms are increasing in many coral reefs and surface waters around the world, leading to higher concentrations of dissolved organic carbon (DOC), nitrogen (N), phosphate (P), and sulfur (S) compounds. The adaptation of the marine microbiota to this stress involves evolutionary processes through mutations that can provide selective phenotypes. The aim of this in silico analysis is to elucidate the potential candidate hub proteins, biological processes, and key metabolic pathways involved in the pathogenicity of bacterioplankton during excess of nutrients. The analysis was carried out on the model organism Escherichia coli K-12, by adopting an analysis pipeline consisting of a set of packages from the Cystoscape platform. The results obtained show that the metabolism of carbon and sugars generally are the 2 driving mechanisms for the expression of virulence factors. In recent decades, the emergence of molecular methods, especially the omics approach, has facilitated the study of microbial communities to understand their activities, compositions, interactions between taxa, and the use of nutrients. 1 Transcriptomics has often been coupled with other methods to understand the response of microbes to ecological interactions, nutrient acquisition, membrane transport, and growth, generating a large number of results that require strong tools to derive useful information. 2, 3 The marine coastal areas are increasingly subjected to anthropogenic and natural pollutants that affect the growth of macroorganisms and microorganisms. 4 Bacterioplankton has been linked to several types of pollution including wastewater, 5,6 chemicals, 7 organic or biological products, and waste. 8 During nutrient pollution (NP) caused by excess of nutrients specifically in coastal areas, the biota is negatively affected by algal blooms, increased growth of macroalgae, increased sedimentation and oxygen consumption, oxygen depletion in lower water layers and, sometimes, mortality of benthic animals and fish. 9 Through these negative effects, the bacterioplankton also undergoes several types of stress that act directly and indirectly on the functioning of the ecosystem and the microbiota. 10 This stress is caused by the higher concentrations of dissolved organic carbon (DOC), Nitrogen (N), Phosphate (P), and Sulfur (S) compounds, 11, 12 to which the adaptation of bacterioplankton depends on the community structure, the physiology of the organisms, the variety of environmental conditions, and their interactions. 13, 14 To survive changing environments, bacteria have evolved exquisite systems that not only sense stress but also trigger appropriate responses. 15 Their responses are related to an adaptation that involves a known resistance process especially in pathogenic bacteria such as the case of Listeria monocytogenes and a direction also of the expression of virulence genes at the appropriate time and place. 16, 17 An appreciation of stress responses and their regulation is therefore essential to understand bacterial pathogenesis. Among the modules of understanding used is the analysis of changes at the molecular and cellular level regulated by highly complex signaling pathways. 18 The whole is modulated in the form of protein-protein interaction (PPI) networks and other resulting networks because the phenomenon of protection against stress strongly suggests the presence of central proteins that control the various responses to stress. 19 The study of PPI networks requires several open source or integrated software packages that allow the integration of biomolecular interaction networks with high-throughput expression data and other molecular states in a unified conceptual framework. 20 Cytoscape is a powerful platform in this field, with its various plugins and its conjunction with large databases, it allows the extraction of central processes, central metabolic pathways (MPs), and hubs proteins during a particular stress in humans and model organisms. [21] [22] [23] The investigation of interactomes in model organisms such as Arabidopsis thaliana (L.), 24 Saccharomyces cerevisiae (Meyen), and Escherichia coli K-12 25 has been involved in predicting and improving the understanding of cellular processes and biological interactions in other organisms. 26, 27 Furthermore, the power of Cytoscape plugins in the analysis of microbiota has been documented in several works and in different microbiomes including intestinal, 28 oral, 29 vaginal, 30 and marine. 31 The study of the behavior of bacterioplankton during nutrient excess as one of the environmental parameters that affect its capacity of pathogenesis is not well documented and has never been analyzed in silico. In this work, we want to study this capacity during eutrophication and algal blooming in the model organism Escherichia coli K12, through the analysis of a profile of differentially overexpressed genes (DEGs) collected from several bibliographic sources to predict hubs proteins, biological processes (BPs), and MPs involved in the selection of copiotrophic species and the virulence of bacterioplankton. and adjusted P value <.05 has been collected from scientific publications (Table 1 ). And to avoid disambiguation, their ID has been verified in the UniProt database (https://www.uniprot.org/) and EcoCyc (the Encyclopedia of E coli K-12 genes and metabolism). The predictive analysis has been performed using the bacterium strain E coli K12 as model organism for the aquatic bacterioplankton; E coli K12 serves as the best characterized and good leader model organism for bacterial genetics and molecular biology studies. To start the analysis, the DEGs' profile was annotated in multiple Cytoscape packages (Version: 3.8.2 https://cytoscape. org/) following the pipeline ( Figure 1 ). The raw list (196 DEGs) was queried in StringApp to obtain the PPI networks. The tab-delimited ranking list txt file generated from String was analyzed to generate a subnetwork with the hub proteins reflected by the network analyzer plugin to show a topological mapping. The subnetwork was analyzed by the ClueGO to identify BPs and MPs related to excess of nutrients. String analysis. The input list of 196 DEGs was analyzed by StringApp (Version: 11.0 https://string-db.org/) for a fixed search parameter with a confidence score cutoff to 0.4 without additional interactors. The resulting networks were customized by the layout and visual style in the control panel. Subselection and topological mapping of hub proteins analysis. Three networks obtained by StringApp were subselected based on degree and filtered to obtain the hub proteins. The highlighted hub proteins and their first neighbors obtained were filtered to select the most significant terms. The results were mapped by Network analyzer plugin "http://apps. cytoscape.org/media/networkanalyzer." ClueGO analysis. The subnetwork resulting from the subselection has been analyzed by ClueGO (Version: 1.5 http://www.ici. upmc.fr/) to select representative GO processes and pathways and visualizing them in functionally organized networks. Statistical analysis of ClueGO enrichment was defined using a hypergeometric test with P ⩽ .05, corrected by the Benjamin-Hochberg method, and kappa scores ⩾0.4 as primary endpoint. The list of 196 collected proteins ( Table 1 ) was imported and analyzed by the StringApp. This latter has mapped and annotated all the genes right away. The results were performed in the format of a network with different evidence indexes (Figure 2 ), and the PPI networks obtained have identified 7 associated networks with a total of 165 out of 196 nodes, 442 edges, and a P value <10 −16 . The 165 annotated proteins in the principal network are linked either directly or indirectly through one or more interacting proteins, which enhances the existence of functional links between them. These results suggest that the proteins are at least partially biologically connected as a group, maybe participate together in the same process and have the same phenotype, which has given great importance to co-expression and high weight to genetic and protein interactions. The obtained PPI network was accompanied by a global functional enrichment analysis where BP, hub proteins, and MPs were exported. The results of the most 5 representative terms are shown in Table 2 , where GO terms are generation of precursor metabolites and energy (GO.0006091), monocarboxylic process (GO:0032787), nicotinamide and metabolic process (GO:0046496), antibiotic metabolic process (GO:0016999), and small molecule biosynthetic process (GO:0044283), and the most significant MP are carbon metabolism (eco01200), pyruvate metabolism (eco00620), glycolysis/gluconeogenesis (eco00010), pentose phosphate pathway (eco00030), and methane metabolism (eco00680). These BPs and MPs involve biochemical reactions and pathways that ultimately lead to the formation of precursor metabolites and substances from which energy is derived. [39] [40] [41] [42] [43] [44] This energy production is essential for the regulation of nutrient content during stress, to persist long enough, continue its cycle, and invade a new host. 45 Simultaneously, the 10 genes chosen as hub proteins (Table 3) based on their combined score and their connectivity in Figure 2 , which shows a co-expression profile, neighborhood, and appearance links between them and between (eno, ftsH, ravA, codA, hemN/yggW, puuD, codA, mngB, norV, can) that encoded for virulence factors such as ferrochatalases, metalloenzymes, enolases, hydrolases, and cytotoxic chemotherapeutic agents. These factors are often linked to MPs for nutrients and toxins such as lipopolysaccharides, proteases (zinc metalloproteases), and virulence factors induced by sugar metabolism in bacteria. 46,47 3 Table 1 . List of genes differentially overexpressed during nutrient excess (log FC > 1), collected from several bibliographical sources. [33] [34] [35] [36] [37] [38] . UNIpROT ACCESSION ID pROTEIN NAME Kennelly and Potts (1996) have stated that during stress conditions, microorganisms develop signal transduction systems from the outside to the inside of the cell. 48 These signals include degradative enzymes such as proteases, lipases, and substrate capture enzymes such as glutamine synthetase and alkaline phosphatase to detect environmental stresses and to control the coordinated expression of genes involved in cellular defense mechanisms. [49] [50] [51] Their response to these signals will enable their survival; enhance their resistance to a number of environmental stresses such as low pH, heat, and oxidative stress; 52, 53 and/or enhance their virulence. This is relatively true because Gram-positive bacteria especially Actinobacteria and Firmicutes present a diverse collection of regulatory proteins (CcpA, CodY, and Rex) of central metabolic capacities and virulence, which have been shaped by reductive evolution. 45, 54, 55 Among these Gram-positive bacteria is Staphylococcus aureus (S aureus), a strain indigenous to aquatic environments and thus transferred by discharges. In the presence of excess carbon, the regulatory protein CcpA stimulates transcription of ilvBoperon, making CodY more active as a repressor of many pathways that remove intermediates from glycolysis and gluconeogenesis to be fully pathogenic. 45 And in Gram-negative bacteria, regulation is stimulated by FNR which is influenced by the histone-like protein H-NS; nevertheless, FNR has been shown to be important for virulence and survival of Salmonella. 15, 56 In the light of the above discussed results, we suggest that the metabolic behavior and central BPs are highly correlated Sbaoui et al 9 with nutrient metabolism, contributing toward the progression of complications that can affect cell behavior and bacterioplankton phenotype, because as it has been mentioned, the growth of microorganisms in a non-optimal environment suggests evolutionary adaptations through specific mutations responsible for a physical form. 57 In addition, the involvement of hub proteins related to carbohydrate metabolism, proteins, nucleic acids, and membrane transport have been reported in the selection of copiotrophic and pathogenic species, 34,58 but these results require further studies because the existing research to date has not thoroughly evaluated the 4 nutrients (C, N, P, and S) together. The network generated by string software was imported as a pre-existing unformatted array in Cytoscape software. The network analyzer plugin function was used for providing network filtration and customization. The principal subnetwork obtained (Figure 3 ) provides 72/165 nodes with a confidence score of 0.8 and a PPI enrichment P value <10 −16 . The list of 72 genes was filtered and 10 hub proteins were subselected ( Table 3) . All of these genes exhibit the highest interactions between them to regulate some cellular functions. Indeed, several studies have demonstrated the key role of these enzymes in microbial metabolism such as glycolysis/gluconeogenesis, 59 pyruvate metabolism, 60 secondary metabolite biosynthesis, carbon metabolism, 61 and other fundamental intracellular processes. These results would be linked to the virulence of bacteria in the presence of an excess of nutrient. 59 According to this work, other studies have suggested that these enzymes are Figure 2 . predicted protein-protein interaction networks. parameters: Score (0.4), no additional nodes; interaction sources used: experimentation, databases, co-expression, co-occurrence, gene fusion, and neighborhood. In the interaction networks, separate lines of different colors are used to show the type of evidence that supports each interaction. considered moonlight proteins and are involved in microbial virulence. 46, 47, 62, 63 ClueGO results ClueGOapp was launched by an ontological and metabolic analyses to evaluate over-represented GO terms and MP by annotating subselected proteins and their first neighbors in biological terms hierarchically (parent-child relation) and to assign them to functional MP pathways. The results are presented as a pie chart (Figure 4 ) for BP and a functionally grouped network ( Figure 5) for MP, and 80 terms were associated with the 72 proteins. The major representative terms for GO processes are the metabolic process of small molecules, the catabolic process of organic substances, the metabolic process of carbohydrates, the metabolic process of alpha-amino acids, and the positive regulation of biological process; the major representative terms for MP are glycolysis/glycogenesis, pyruvate metabolism, the 2-component system, purine metabolism, and oxidative phosphorylation for MPs. The ClueGO results are consistent with those provided by StringApp, which also involve biochemical reactions and pathways that ultimately lead to the formation of precursor metabolites and substances from which energy is derived and most of them refers to the MPs of the purine and citrate cycle (tricarboxylic acid [TCA] cycle). The metabolic process of purine seems to be a widespread phenomenon. 64 It has been found to be a key modulator in virulence of pathogens. 65 The TCA cycle, also known as the citric acid cycle or Krebs cycle, produces energy by the complete oxidation of acetate, derived from carbohydrates, fats and proteins, to carbon dioxide. 66 In Table 2 , 51 out of 165 proteins were assigned to carbon metabolism, which suggests it as the central metabolic process and the main nutrient during eutrophication. Deutscher et al and Görke and Stülke reported the binding of carbon catabolism to microbial virulence. 67, 68 Excessive carbon sources and DOC were documented as enhancers of bacterial growth, oxygen removal, and selector for copiotrophs and opportunistic pathogens in both seawater and coral holobiota 69, 70 using their preferred carbon substrate through ATP-binding cassette Sbaoui et al 13 (ABC) transporters. 71, 72 The ABC transporters were reported in studies involving genes related to virulence and symbiotic interactions 73 and highly reported in copiotrophs to the opposites of oligotrophs. 74 Haas et al 75 reported the abundance of Gammaproteobacteria and Alphaproteobacteria in enriched and algal-dominated waters in contrast to coral-dominated oligotrophic waters, and this suggests the possible adaptation of the studied bacterioplankton in case of existence in such an environment, but all this needs further study and discussion to draw strong conclusions. In Figure 5 , many proteins are multitasking and provide at least 2 MPs, which reminds us of moonlighting proteins. The existence of moonlighting proteins in microorganisms is a known, but still poorly understood phenomenon. 76 Most of these proteins exercise their role in the cytoplasm and outside the cell. Their existence has been linked to virulence and they are often domestic enzymes, especially those of the glycolytic pathway, such as enolase, aldolase, dehydrogenase, heat shock proteins, and transcription factors, and they may perform noncatalytic roles with different functions depending on their cellular localization and the concentration of substrates. 62 In the analyzed differential gene expression (DGE) profile, pyruvate metabolism, 60 carbon metabolism, 61 and glycolysis/gluconeogenesis 59 ( Figure 5 ) are central glycolytic MP that involved moonlight proteins and are related to virulence in bacteria. Taken together, the analyses of BP and MP (Figures 4 and 5) reveal that the interconnected proteins during the nutrient excess and the bloom proliferation phase in the model organism E Coli K12 are involved in chemical reactions and cellular metabolism involving carbohydrates and organic acids. Thus, several studies have reported the relationship between moonlight proteins, carbon catabolism, and microbial virulence factors. 67, 68 In addition, the involvement of hub proteins related to carbohydrate metabolism, proteins, nucleic acids, and membrane transport has been reported in the selection of copiotrophic and pathogenic species. 34, 58 Conclusions Transcriptomic data are increasingly numerous and varied, facilitating data mining at a system level. A large number of approaches/tools have been developed to detect pathways and processes that are significantly altered between different experimental conditions during stress by pollutants or other substances. The objective of this work is to study the capacity of bacterioplankton during eutrophication and algal blooms in the model organism E coli K12, through the analysis of a profile of DEGs collected from several bibliographic sources to predict hub proteins, BP and MP involved in copiotrophic species selection, and bacterioplankton virulence. The obtained results suggested that the metabolic behavior and central BPs are strongly correlated with carbon and carbohydrate metabolism, contributing to the progression of complications that can affect the cellular behavior and phenotype of bacterioplankton. The involvement of hub proteins related to carbohydrate, protein, nucleic acid metabolism, and membrane transport has been reported in the selection of copiotrophic and pathogenic species during excess of nutrients, but these findings require further study. The bacterial stress adaptation of E coli to excess nutrients and the possibility of increased virulence associated with stress need to be studied in more detail to prevent potential risks of host-microbiota interactions. This is important because understanding the mechanisms and regulation of bacterioplankton stress adaptation will provide information for pathogen control and enhance the effective design of new control methods. Furthermore, the identification of moonlight proteins is clearly not an easy process as most of the currently identified bacterial moonlight proteins were discovered by chance. Today, researchers are using antimicrobial susceptibility testing to address the problem of multidrug resistance by Gram-positive and Gram-negative commensal and pathogenic bacteria. But questions arise as to their use in the treatment of pathogenesis in aquatic habitats. In aquatic environments, the use of such strategy has often been associated with aquaculture. Moreover, with the mechanisms of microbial evolution, their adaptations, the poor practices of treatment, and discharge of microbes in some laboratories in developing countries and the discharge of wastewater into aquatic environments, such a process suggests the development and diffusion of resistance genes to biomolecules (phenolic compounds) through horizontal and vertical transfers while creating a new problem to be solved but in the long term. Substrate-controlled succession of marine bacterioplankton populations induced by a phytoplankton bloom Genome-wide transcriptional response of the archaeon Thermococcus gammatolerans to Cadmium Exploring mutualistic interactions between microalgae and bacteria in the omics age A latitudinal diversity gradient in planktonic marine bacteria Pilot-scale constructed wetlands for swine wastewater treatment: microbial community analysis in bacterioplankton and epiphyton and options for resource recovery SARS-CoV-2 as enteric virus in wastewater: which risk on the environment and human behavior Impacts of chemical contamination on bacterio-phytoplankton coupling The globalization of cultural eutrophication in the coastal ocean: causes and consequences Microbes and environment Marine eutrophication Eutrophication: sources and drivers of nutrient pollution Ecological consequences of bacterioplankton lifestyles: changes in concepts are needed Sensitivity of bacterioplankton to environmental disturbance: a review of Baltic sea field studies and experiments Bacterial stress responses during host infection The impact of environmental stress on Listeria monocytogenes virulence The role of stress and stress adaptations in determining the fate of the bacterial pathogen Listeria monocytogenes in the food chain Cell signaling pathways as control modules: complexity for simplicity? Protein-protein interaction networks (PPI) and complex diseases Protein-protein interactions essentials: key concepts to building and analyzing interactome networks Cytoscape: a software environment for integrated models of biomolecular interaction networks A travel guide to Cytoscape plugins Cytoscape 2.8: new features for data integration and network visualization Arabidopsis Interactome Mapping Consortium. Evidence for network evolution in an Arabidopsis interactome map A network biology approach to decipher stress response in bacteria using Escherichia coli as a model Interactome modeling Integrative analysis of the genes induced by the intestine microbiota of infant born to term and breastfed Oral microbiome alterations associated with early childhood caries highlight the importance of carbohydrate metabolic activities Complex species and strain ecology of the vaginal microbiome from pregnancy to postpartum and association with preterm birth Changes in the structure of the microbial community associated with Nannochloropsis salina following treatments with antibiotics and bioactive compounds Co-occurrence analysis of microbial taxa in the Atlantic ocean reveals high connectivity in the free-living bacterioplankton Metaproteomics reveals differential modes of metabolic coupling among ubiquitous oxygen minimum zone microbes Excess labile carbon promotes the expression of virulence factors in coral reef bacterioplankton Perceiving molecular evolution processes in Escherichia coli by comprehensive metabolite and gene expression profiling Development of an environmental functional gene microarray for soil microbial communities Transcriptomic profiling of marine bacteria between development and senescence phases of a phytoplankton bloom Comparative metagenomics study reveals pollution induced changes of microbial genes in mangrove sediments Metabolic regulation of a bacterial cell system with emphasis on Escherichia coli metabolism 1-Aminocyclopropane-1-carboxylic acid deaminase producing beneficial rhizobacteria ameliorate the biomass characters of Panicum maximum Jacq. by mitigating drought and salt stress Metabolism and biochemical properties of nicotinamide adenine dinucleotide (NAD) analogs, nicotinamide guanine dinucleotide (NGD) and nicotinamide hypoxanthine dinucleotide (NHD) NAD+ metabolism, metabolic stress, and infection Bacterial metabolism and antibiotic efficacy Nutrient stress small-molecule screening platform for Escherichia coli Regulating the intersection of metabolism and pathogenesis in Gram-positive bacteria Vibrio fischeri lipopolysaccharide induces developmental apoptosis, but not complete morphogenesis, of the Euprymna scolopes symbiotic light organ Vibrio aestuarianus zinc metalloprotease causes lethality in the Pacific oyster Crassostrea gigas and impairs the host cellular immune defenses Fancy meeting you here! A fresh look at "prokaryotic" protein phosphorylation The transient phase between growth and nongrowth of heterotrophic bacteria, with emphasis on the marine environment Genetic basis of starvation survival in nondifferentiating bacteria Life after log Starvation-induced stress resistance in Lactococcus lactis subsp Isolation and characterization of Staphylococcus aureus starvation-induced, stationary-phase mutants defective in survival or recovery Seryl-phosphorylated HPr regulates CcpA-independent carbon catabolite repression in conjunction with PTS permeases in Streptococcus mutans Nocardia species: host-parasite relationships Coordinated regulation of virulence during systemic infection of Salmonella enterica serovar Typhimurium Evolution experiments with microorganisms: the dynamics and genetic bases of adaptation The effects of harmful algal blooms on aquatic organisms Comparison of the regulation, metabolic functions, and roles in virulence of the glyceraldehyde-3-phosphate dehydrogenase homologues gapA and gapB in Staphylococcus aureus Role of the pyruvate metabolic network on carbohydrate metabolism and virulence in Streptococcus pneumoniae Correlations between carbon metabolism and virulence in bacteria Protein moonlighting: what is it, and why is it important Bacterial virulence in the moonlight: multitasking bacterial moonlighting proteins are virulence determinants in infectious disease The purinosome, a multi-protein complex involved in the de novo biosynthesis of purines in humans The link between purine metabolism and production of antibiotics in streptomyces The citric acid cycle Carbon catabolite repression in bacteria: many ways to make the most out of nutrients How phosphotransferase system-related protein phosphorylation regulates carbohydrate metabolism in bacteria. Microbiol Metagenomic analysis of stressed coral holobionts Effects of coral reef benthic primary producers on dissolved organic carbon and microbial activity Phosphoenolpyruvate:carbohydrate phosphotransferase systems of bacteria Regulation of mannose phosphotransferase system permease and virulence gene expression in Listeria monocytogenes by the EIItMan transporter Transcriptional control in marine copiotrophic and oligotrophic bacteria with streamlined genomes The genomic basis of trophic strategy in marine bacteria Global microbialization of coral reefs Dancing to another tune-adhesive moonlighting proteins in bacteria YS, BN, AE, RS, MEM, FB, and CF contributed to conceptualization; YS, BN, AE, and RS contributed to data curation; YS contributed to formal analysis; FB and CF contributed to validation; YS, MEM, FB, and CF contributed to supervision of this study; YS contributed to writing original draft, reviewing, and editing; YS, BN, AE, RS, MEM, FB, and CF contributed to visualization. Yousra Sbaoui https://orcid.org/0000-0002-6919-4929 Badreddine Nouadi https://orcid.org/0000-0001-5175-4601