key: cord-031937-qhlatg84 authors: Verma, Anukriti; Sharda, Shivani; Rathi, Bhawna; Somvanshi, Pallavi; Pandey, Bimlesh Dhar title: Elucidating potential molecular signatures through host-microbe interactions for reactive arthritis and inflammatory bowel disease using combinatorial approach date: 2020-09-15 journal: Sci Rep DOI: 10.1038/s41598-020-71674-8 sha: doc_id: 31937 cord_uid: qhlatg84 Reactive Arthritis (ReA), a rare seronegative inflammatory arthritis, lacks exquisite classification under rheumatic autoimmunity. ReA is solely established using differential clinical diagnosis of the patient cohorts, where pathogenic triggers linked to enteric and urogenital microorganisms e.g. Salmonella, Shigella, Yersinia, Campylobacter, Chlamydia have been reported. Inflammatory Bowel Disease (IBD), an idiopathic enteric disorder co-evolved and attuned to present gut microbiome dysbiosis, can be correlated to the genesis of enteropathic arthropathies like ReA. Gut microbes symbolically modulate immune system homeostasis and are elementary for varied disease patterns in autoimmune disorders. The gut-microbiota axis structured on the core host-microbe interactions execute an imperative role in discerning the etiopathogenesis of ReA and IBD. This study predicts the molecular signatures for ReA with co-evolved IBD through the enveloped host-microbe interactions and microbe-microbe ‘interspecies communication’, using synonymous gene expression data for selective microbes. We have utilized a combinatorial approach that have concomitant in-silico work-pipeline and experimental validation to corroborate the findings. In-silico analysis involving text mining, metabolic network reconstruction, simulation, filtering, host-microbe interaction, docking and molecular mimicry studies results in robust drug target/s and biomarker/s for co-evolved IBD and ReA. Cross validation of the target/s or biomarker/s was done by targeted gene expression analysis following a non-probabilistic convenience sampling. Studies were performed to substantiate the host-microbe disease network consisting of protein-marker-symptom/disease-pathway-drug associations resulting in possible identification of vital drug targets, biomarkers, pathways and inhibitors for IBD and ReA. Our study identified Na((+))/H((+)) anti-porter (NHAA) and Kynureninase (KYNU) to be robust early and essential host-microbe interacting targets for IBD co-evolved ReA. Other vital host-microbe interacting genes, proteins, pathways and drugs include Adenosine Deaminase (ADA), Superoxide Dismutase 2 (SOD2), Catalase (CAT), Angiotensin I Converting Enzyme (ACE), carbon metabolism (folate biosynthesis) and methotrexate. These can serve as potential prognostic/theranostic biomarkers and signatures that can be extrapolated to stratify ReA and related autoimmunity patient cohorts for further pilot studies. www.nature.com/scientificreports/ approach has advantages over the traditional approach for network analysis that can help to simultaneously characterize several protein interaction modules and has the potential to study complex diseases. The vital information obtained in our study from in-silico analysis is cross-validated through targeted gene expression experimental analysis on patient cohorts. This study will help us to obtain clinico-molecular informatics-based outcomes and expand our knowledge regarding the understanding of biological functions for IBD co-existent ReA. Text mining: data screening and selection. Systematic data search and organization was carried out incorporating data identification, data screening and data selection to find target microorganisms involved in Inflammatory Bowel Disease (IBD) and Reactive Arthritis (ReA). Data identification was carried out to obtain records through data sources utilising keywords (e.g. "Microorganism AND Inflammatory bowel disease AND Reactive arthritis") incorporating Boolean operators (AND/OR/NOT). Data screening and selection were carried as part of the manual curation through primary and secondary screening scrutinizing collected data records to obtain organized records relevant for the autoimmune and enteric disorders triggered by microorganisms, especially IBD and ReA and the microbial triggers implicated in IBD and ReA that were utilised for further metabolic network reconstruction. bottom-up approach consisting of draft reconstruction and manual reconstruction refinement was followed to create metabolic networks of obtained target microorganisms. Genome-scale Metabolic models Simulation, Reconstruction and Visualization (GEMSiRV) software 51 that includes reciprocal Basic Local Alignment Search Tool (BLAST) of target microorganisms against a template metabolic network of its phylogenetic neighbour and incorporates information from National Center for Biotechnology Information (NCBI), Kyoto Encyclopedia of Genes and Genomes (KEGG) and Transport DB was used for creating draft reconstructs. The manual curation of missing links or gaps in the draft reconstruct was done by mapping the incomplete information to other databases such as Expert Protein Analysis System (ExPASy) 52 and Integrated relational Enzyme database (IntEnz) 53 . This fully connected and annotated network was used for further simulation studies 54 . The metabolic networks thus obtained were visualized using CellDesigner, a tool for modelling and editing biochemical and gene-regulatory networks. Simulation analysis was carried by converting the metabolic networks obtained into a mathematical model and performing the gene deletion analysis to retrieve essential genes. Model conversion was through generation of stoichiometric based matrixes consisting of reactions (columns) and metabolites (rows) corresponding to respective genes. Upper boundary and lower boundary fluxes i.e. movement of matter across a system were generated for the gene associated reactions and metabolites that was extracted in Systems Biology Markup Language (SBML) format. The next step was gene deletion analysis done using the Constraint Based Reconstruction and Analysis toolbox (COBRA) that runs in Matrix Laboratory (MATLAB) 55 for finding the essential genes based upon the gene-reaction matrix and boolean relationship between genes and reactions 56 . The purpose of data filtering is to remove repeats and homologs from essential genes of target microorganisms associated with IBD co-existent ReA. The non-homologous protein sequences corresponding to the essential genes of target microorganisms were extracted from Pathosystems Resource Integration (PATRIC) database 57 . Refinement of protein sequences was further done using Cluster Database at High Identity with Tolerance (CD-HIT) 58 suite so as to have 60% identity non-repeat sequence tolerance stringency. BLAST-P was further used to remove the homologs from such non-repeats against human database at e-value of 10 -4 to obtain nonhomologous protein sequences used for further in-silico analysis. Essential host-microbe and microbe-microbe interactions. The host-microbe interactions of the non-homologous proteins for the selected target microorganisms were obtained using Host-pathogen Interaction Database (HPIDB) 59, 60 . The host-microbe interactions were visualised using Cytoscape. Simulation analysis (gene essentiality) was done to obtain the essential host proteins interacting with common microbe proteins of microorganisms triggering IBD and ReA utilising the human metabolic model HMR 2, a COBRA compliant metabolic model of human consisting of around 3,765 genes, 8,000 reactions and 3,000 metabolites 61 . This led to profiling of the common host-microbe and microbe-microbe interactions comprehending the complex 'interspecies communication' as complex interaction maps, executed using Search Tool for the Retrieval of Interacting Genes/proteins (STRING) 62,63 . Host-microbe disease network and molecular mimicry studies. The host-microbe disease network is a multilayered archetype that connects the protein-marker-symptom/disease-drug-pathway associations. The contributions of the microorganisms in the co-evolved IBD and ReA as part of the disease network was created through the interactive maps of the essential host interaction proteins (verified using literature survey) and the information processed through gene expression data analysis 64 . The information patronised here is mostly scored through the available non-specific protein diagnostic markers of both IBD and ReA e.g. C-Reactive Protein (CRP), Interleukin 6 (IL6) and Toll Like Receptor 4 (TLR4), Major Histocompatibility Complex, Class I, B (HLA-B) and Major Histocompatibility Complex, Class II, DR Beta 1 (HLA-DRB1) with the essential host proteins determined using STRING 65 . Database GeneCards 66 was used to assess the role of these interacting partners aka proteins further with symptoms/diseases associated with IBD and ReA. The pathways of the above host interacting proteins were found out using KEGG database that provides ontologies for proteins related to biological processes 67 www.nature.com/scientificreports/ Subsequently, the role of drugs or inhibitors used to suppress the effect of IBD and ReA such as indomethacin, prednisone, ciprofloxacin, sulfasalazine, azathioprine, methotrexate and hydroxychloroquine was scored in the disease network through their docking studies against the potential targets (both host as well microbial targets) as per published methodologies 68, 69 . The host-microbe disease network which is an amalgamation of all the above patterned associations was visualized using Cytoscape software 70 . Molecular mimicry analysis between the vital targets triggering IBD co-evolved ReA, essential human proteins including HLA-B27, HLA-B51 and HLA-DRB1 was done using data repository ExPASy. This led to retrieval of microbe relayed protein sequences that have been implicated in disease development after sequence alignment performed using EMBOSS 71 . Experimental evidences to identify the signature molecules in patient samples. The cross-validation of vital in-silico targets was done in ReA patient cohort cases via targeted gene expression analysis. Scientific and ethical clearance was taken from Amity University Ethics Committee and Institutional Ethics Committee, Fortis Noida for handling the patient samples. All experiments were performed in accordance with Indian Council of Medical Research (ICMR) guidelines constituting the ethics committees. The study was carried out for 6 months on the rare disorder ReA patients, with the inclusion criteria as patients having ReA according to European Spondyloarthropathy Study Group (ESSG) 72 and exclusion criteria as patients undergoing treatment from last 3-6 months and healthy controls (HC). The participants were inducted in the study design with an informed consent form along with a questionnaire containing information regarding symptomatic and diagnostic history of patient and linked disorders. Blood (5 mL) was drawn from participants in ethylenediaminetetraacetic acid (EDTA) vacutainers. These were transported to the laboratory for further analysis. The processing of the samples was done within 2-4 h of procurement 73 . Peripheral blood mononuclear cells (PBMC's) were isolated from blood using density gradient centrifugation 74 . RNA was isolated from PBMC's using TRIzol method 75 . The quantification of RNA was done using nano-drop 76 . The High Capacity cDNA Reverse Transcription Kit (Applied Biosystems™) was used for conversion of RNA to single-stranded cDNA as per the standard protocol 77 . Quantitative PCR analysis of target gene was executed using Biorad CFX96 Real time-PCR taking human housekeeping gene, GAPDH as a reference. Previously reported primers for qPCR analysis of target and reference gene were selected for this study 78, 79 following the standard protocol 80 . Relative gene expression analysis from qPCR data was performed using the Relative Expression Software Tool (REST® 2009) 81 that utilises the expression of reference genes to normalize expression of target genes in different samples. The schematic representation of methodology involved in our combinatorial analysis is provided in Fig. 1 . Text mining: data screening and selection. A systematic literature mining and curation for our thematic connecting autoimmune disorders, Inflammatory Bowel Disease (IBD) and Reactive Arthritis (ReA) was carried out. Data identification extracted 1,071 records (articles in journals, book chapters, conference papers etc.) corresponding to autoimmune and enteric disorders. Data screening extracted 426 records of autoimmune and enteric disorders triggered by microorganisms that belong to class of bacteria, fungi, protozoan, mites, virus, yeast and nematode. Data selection yielded 48 IBD, 32 ReA and 5 IBD co-evolved ReA records. Data selection was directed towards the microbial contenders implicated here resulting in 6 target microorganisms namely Campylobacter jejuni, Escherichia coli O157:H7, Klebsiella oxytoca, Salmonella typhimurium, Shigella dysenteriae and Yersinia enterocolitica, whose genome information was available. The etiopathogenesis in the co-evolved disorders have been documented through gut microbiome associated host-pathogen interactions studies, perpetuating where pathogen microorganisms involve in dysbiosis leading to autoimmunity. The results of text mining are provided in Fig. 2 . The list of microorganisms is provided in Supplementary Table S1 online. ing of genes along with their corresponding proteins, reactions and metabolites for the selected microorganisms serve as primary set of partial metabolic network information. The missing data persistent in the draft reconstruct obtained through Genome-scale Metabolic models Simulation, Reconstruction and Visualization (GEM-SiRV) was manually refined. Entirely associated metabolic networks of target microorganisms were obtained (genes, proteins and reactions). The essential genes of microorganisms (vital for survival. sustenance and growth) were obtained after performing simulation on mathematical models consisting of gene associated reactions and metabolites (metabolites, inner cell reactions, exchange reactions and essential genes). Due to lack of availability of exchange reactions for Campylobacter jejuni, simulation analysis on the partial metabolic network could not be carried out and essential genes could not be retrieved. An alternative approach for finding essential genes of Campylobacter jejuni was carried out. The essential genes of Campylobacter jejuni were taken from our previous published report and were found out to be 228 69 . Table 1 portrays the results of metabolic network reconstruction and simulation of target microorganisms. The metabolic network and simulation analysis data of target microorganisms is provided in Supplementary Table S2 online. The proteins corresponding to essential genes, non-repeats and non-homologs were obtained as stated below according to the parenthesis {proteins corresponding to essential genes, non-repeats, non-homologs}. The essential genes, their corresponding proteins, reactions and metabolites from the curated dataset were refined to create a list of most relevant molecular indicators to assess their coveted role in disease establishment. The non-redundant filtered proteins were utilised further in the computational work-pipeline canvassing the drug targets and signatures in the interspecies communication. Essential host-microbe and microbe-microbe interactions. The central mechanism of hostmicrobe/microbe interface conferred through gut microbiome was correlated for the selected microbial species and processed to obtain the common signatures so as to follow the core system of metabolic changes affecting the host harbouring them as either commensal or pathogenic loads. The interactors between human and target microorganisms were obtained. The interactors of Escherichia coli O157:H7 were 136; Klebsiella oxytoca were 141; Salmonella typhimurium were 136; Shigella dysenteriae were 117 and Yersinia enterocolitica were 133. There were no interactors for Campylobacter jejuni (Supplementary Table S3 -S7 online). Table 2 shows the results of filtering and host-microbe interactions of protein sequences corresponding to essential genes of target microorganisms. www.nature.com/scientificreports/ The host-microbe interactors were analysed for all the target microbial species and processed to obtain the common signatures. 43 proteins were found between all target microorganisms having interaction among themselves and with 130 human proteins. The essential host correlative targets to the microbial gene targets were followed by obtaining host essential genes and corresponding proteins from human metabolic model HMR 2. There were 1,401 essential proteins (Supplementary Table S8 online) the essential human protein was found out to be KYNU having interaction with essential microbial protein NHAA (Fig. 3) . NHAA was also having interactions with non-essential HCLS1 Associated Protein X-1 (HAX1), Prolyl endopeptidase-like (PPCEL), Biogenesis of Lysosomal Organelles Complex 3 Subunit 1 (HPS1) and Eukaryotic Translation Initiation Factor 2 Alpha Kinase 1 (E2AK1) proteins of human host. KYNU was further mapped with host proteins (direct and indirect) resulting in 1994 interactions. Out of these the single connected essential protein interactions were 988 and protein interactors were 412 ( Fig. 4 and see Supplementary Table S9 online). The research design here followed to assess the interaction map of essential proteins in human host to indicate the clinical insights in pathophysiological trends in the autoimmune development. Host-microbe disease network and molecular mimicry. The human essential proteome complement with its interacting proteins were analysed further as part of the disease network. 394 human essential protein interactors were found to be associated with IBD and similarly 3 essential protein interactors namely Adenosine Supplementary Table S10 online) . These 397 proteins can be postulated as probable contenders transcending their role in the simulated network as important regulators in the co-existent disorders. The composite associations of the above 397 proteins with non-specific protein diagnostic markers of IBD and ReA were obtained (see Supplementary Table S11 online) . This gave rise to a single connected protein network consisting of 402 proteins and 13,350 interactions. The association of above 402 with symptoms and diseases linked with IBD and ReA were obtained (see Supplementary Table S12 online) . Apart from non-specific diagnostic markers, the major protein linked with majority of symptoms/diseases is Angiotensin I Converting Enzyme (ACE). 78 pathways of the 402 proteins were obtained (see Supplementary Table S13 online) in total out of which the pathway associated with majority of proteins was carbon metabolism. Another layer of disease network substantiates the role of therapeutic regime followed in the studied autoimmune diseases, so the docking analysis of drugs used to suppress the effect of IBD and ReA against NHAA of target microorganisms and KYNU of human host was done. The docking analysis resulted in docking scores that represent binding of drugs with host KYNU and microbial NHAA of all 5 microorganisms selected in our study. Higher the negative docking score more is the binding 68 . Escherichia coli O157:H7 NHAA shows highest and lowest docking score with methotrexate (− 7.362) and azathioprine (− 3.491); Klebsiella oxytoca NHAA with methotrexate (− 5.083) and azathioprine (− 3.459); Salmonella typhimurium NHAA with ciprofloxacin (− 5.135) and hydroxychloroquine (− 2.597); Shigella dysenteriae NHAA with methotrexate (− 8.059) and azathioprine (− 3.847); Yersinia enterocolitica NHAA with hydroxychloroquine (− 7.47) and azathioprine (− 3.451) and human KYNU with hydroxychloroquine (− 5.357) and indomethacin (1.113). Our results portray methotrexate to have highest docking scores with maximum proteins and therefore can be considered as a vital drug for IBD associated ReA. The resultant docking scores are provided in Fig. 5 . The extensive interaction pattern of NHAA with KYNU along with 396 proteins, 5 markers, 66 symptoms/ diseases, 78 pathways and 7 drugs give rise to a host-microbe disease network of IBD co-existent ReA (Fig. 6 and see Supplementary Table S14 online) . The final league of information processed in this study design was to accommodate the concept of molecular mimicry between the essential host proteins and selected microorganisms. NHAA protein of target microorganisms shows homology with human HLA-B27, HLA-B51 and HLA-DRB1 (Fig. 7) . Peptides homologous to HLA-B27: Peptides homologous to HLA-DRB1: Experimental evidences to identify the signature molecules in patients. The in-silico analysis followed for the molecular signature identification till far through gene expression datasets and curated metabolic reconstructs strongly indicate the host protein, KYNU being the singular common predictive markers for all pathogenic microbes. KYNU has also been indicated in the expression data of inflammatory linked disorder, www.nature.com/scientificreports/ IBD. There is lack of data available regarding KYNU differential expression in ReA, therefore the experimental evaluation of KYNU through targeted expression analysis in ReA patients was carried out. A non-probabilistic convenience sampling was followed for our single blind study. This study encompassed 15 individuals: 60% male with mean age of 45.7 and 40% female with mean age of 38 (9 males and 6 females). Out of these cases were: 10 with ReA and controls were: 3 currently undergoing treatment, 1 with Poncet's Disease (PD) and 1 Healthy control (HC). The clinical characteristics of the patients recruited in the study included inflammatory back pain in 33%, fatigue in 60%, fever in 27%, swollen joint in 47%, Ankylosing Spondylitis (AS) that affects spine in 7%, dactylitis that is inflammation in finger or toe in 7% and Poncet's Disease (PD) in 7% of participants. The clinical characteristics of the recruits are provided in Table 3 . The expression of KYNU in Peripheral Blood Mononuclear Cells (PBMC'S) of ReA cases vs controls was evaluated using Relative Expression Software Tool (REST) software that estimated a sample's relative expression ratio in relation to the control housekeeping gene (here GAPDH) by calculating an intermediate absolute concentration value: where CP = point at which fluorescence escalates considerably above the background fluorescence. Here the CP values for reference and target genes are collectively redistributed to control and sample groups and the expression ratios are calculated based on the mean value. A Pair Wise Fixed Reallocation Randomisation Test is followed for normalisation of the target genes with a reference gene and for calculating the statistical difference of variation between 2 groups 81 . It utilises a bootstrapping technique providing a 95% confidence interval for expression ratios. It uses a P(H1) test for testing the significance between the samples and controls. According to our analysis, KYNU sample group is different to control group where P(H1) = 0.025. KYNU was found to be downregulated in sample group (in comparison to control group) by a mean factor of 0.115 (Standard error range is 0.018-0.837) as depicted in the whisker-box plot (Fig. 8) . KYNU expression showed a ~ ninefold decline in ReA cases as compared to controls. Gut microbiome is pitched to be the central theme housing enormous diversity of microbial species, characterizing the fine balance between healthy and diseased states. The physiological drifts from healthy to diseased and vice-versa is tuned to sophisticated interactive networks of human host and the microbial flora residing the gut. The autoimmune conditions Reactive Arthritis (ReA) and Inflammatory Bowel Disease (IBD) have been linked to prevalent dysbiosis of the gut, where disease development occurs as a perceptive reaction due invading population of microbes. To find out the basal networks of interactions at the host-microbe interface, common microbes affecting the co-evolved diseases with shared characteristics were studied. These involved comprehensive analysis of the bimolecular functional networks including the gene, protein, metabolite molecular signatures engraved at the host-microbe and microbe-microbe interface. This 'interspecies communication' have been linked now with immuno-pathogenesis of most human autoimmune disorders 82, 83 . www.nature.com/scientificreports/ The etiopathology of these interactions have remained elusive leading to non-specific diagnostic criteria and therapeutic regimes. It is suggested that microbial dysbiosis, pathogenic infection and host-microbe interactions cause incidence of ReA. In this study, utilising the combinatorial approach we have compiled a repertoire of microorganisms, biomolecules and pathways that are possibly involved in triggering co-evolved autoimmune disorders IBD and ReA. In our study, text mining results convey the presence of microorganisms namely Campylobacter jejuni, Escherichia coli O157:H7, Klebsiella oxytoca, Salmonella typhimurium, Shigella dysenteriae and Yersinia enterocolitica implicated in both the disorders. The thematic concepts for microbe contribution in host immunity have been explored in our previous analysis of metabolic reconstruction and simulation of Campylobacter jejuni and Salmonella enterica 69, 84 . In our current study, we used a designated work-pipeline for metabolic network reconstruction and simulation of target microorganisms. The analysis conducted extracted the information via constraint-based bottom-up approach that was filtered and utilised for further computational analysis. The essential genes, proteins and metabolites of microorganisms represent the promising drug targets as these are speculated to contribute towards infection triggered host physiological drifts leading to development of the co-evolved pattern of autoimmunity in IBD and ReA. A thorough curation pattern followed led to provide robust molecular cues in terms of essential proteins and biological networks that are correlated to the 'interspecies communication' using the host-microbe and microbemicrobe interaction profiling. The most closely associated common protein observed in all the selected common microbial species involved in both IBD and ReA is Na (+) /H (+) antiporter (NHAA), microbial integral membrane protein, catalyzing the exchange of 2 H (+) per Na (+)85 and involved in processes crucial for cell viability. Similarly, the common host interacting protein with NHAA is Kynureninase (KYNU), involved in tryptophan metabolism and whose differential expression (upregulation and downregulation based on the control samples) have been followed in IBD patient cohorts [86] [87] [88] . As per the scientific discourse presented in the studied disorders, the pathological mechanism hypothesizes that after bacterial infection, antigen-presenting cells transport bacterial antigens/peptides into the synovial membrane, where the bacterial components persist causing inflammation. It is suggested that in host-microbe interactions, bacterial proteins entering host cells interact with host proteins and inject their effector components, but has not been proven in ReA and IBD. So, this formed a basis of one of the parameters in our study design where we found the physical interactions between NHAA and KYNU and predicted that these might be the early host-microbe interactors for establishing pathogenesis in IBD associated ReA. This could assist to comprehend the very few reports indicated in the rare autoimmune ReA, where gene expression datasets of the co-evolved disorder IBD can serve to incorporate the larger theme of gut-microbiome associations. The theme of gut-microbiome paradigm shifts thus contemplates the vital cues in triggering autoimmunity with indirect linkages to diet and environmental triggers. This is indicative of the identified target molecular signature, KYNU, found to be differentially regulated in the patient cohorts with history of infection triggered or IBD co-evolved ReA. KYNU and NHAA could serve as the robust early and essential host-microbe interacting targets and molecular indicators involved in interspecies communication in IBD associated ReA. The investigations further were targeted for parallel analysis of other host-essential protein partners enmeshed to have interaction with host protein KYNU indicating the intricate details of host-microbe interaction information. The disease network constructed through our approach consists of 412 single connected essential protein interactors of KYNU, where 394 human essential protein interactors are found to be associated with IBD, while 3 of them (Adenosine Deaminase (ADA), Catalase (CAT) and Superoxide Dismutase 2 (SOD2)) are associated with both IBD and ReA. ADA protein has been reported in Juvenile Idiopathic Arthritis and ReA patient cohorts in serum samples 89 . Similarly, CAT and manganese superoxide dismutase (SOD) genes polymorphisms were observed in ReA patient cohorts 90, 91 . These become part of the host-microbe disease network where such molecular elements and co-regulatory pathways represent the intricate biological cross-talk followed during disease development. Pathological conditions can also trigger immune cells such as IL's and TLR's and various cytokines leading to immune cell infiltration in host and higher levels of inflammation. Genetic factors such as HLA alleles encode susceptibility, contribute to bacterial persistence and increase risk in ReA cases. Based on this we also found the interactions of important targets in our study with immunogenic and genetic factors. The host harboured assorted essential proteins were further probed for their association with non-specific protein diagnostic markers as well as with symptoms/diseases linked with IBD and ReA, accruing towards a single connected network consisting of 402 interdependent proteins. The reciprocation of these integrated protein indicators to the disease development is conveyed through metabolite monitoring as in the study, Angiotensin I Converting Enzyme (ACE) was found to be linked with maximum symptoms/diseases. ACE is involved in catalyzing the conversion of angiotensin I into angiotensin II that is a potent vasopressor and aldosterone-stimulating peptide that controls blood pressure and fluid-electrolyte balance 92 . This could be the indicator of involvement of microbe triggered host physiological drifts. Subsequently, the pathways associated with the proteins ramified into 78 pathways of human host speculated to give details of metabolic regulatory checkpoints where carbon metabolism is found to be associated with majority of deduced proteins. Carbon metabolism pathway implicated here as the vitally generic pathway for IBD co-related ReA confers how diet, balance of gut microbiome, antibiotic exposures can have layered impact on autoimmune disease progression and remissions. KYNU is found to be downregulated in ReA patients as compared to controls through our targeted gene expression analysis. Collectively, the disease network followed here confers interaction of microbial NHAA with host KYNU, that is further correlated to 396 proteins, 5 markers, 66 symptoms/diseases, 78 pathways and 7 drugs. Docking analysis of drugs used to suppress the effect of IBD and ReA predicts methotrexate as an important drug that could be useful for early treatment of IBD co-evolved ReA. www.nature.com/scientificreports/ Genetic factors found common in both ReA and IBD are HLA-B27, HLA-B51 and HLA-DRB1. The most important mechanism of susceptibility of HLA in ReA is molecular mimicry that is microbial peptides mimicking HLA autopeptides of human host leading to autoimmunity. This mechanism has been observed in ReA where reports have predicted microorganism peptides such as chlamydial proteins (ClpC, NQRA and DNAP) and Yersinia pseudotuberculosis peptides (YopH) showing homology with human HLA-B27 via bioinformatic analysis 14 . Similarly, molecular mimicry has also been observed in IBD cases having extraintestinal manifestations. We performed targeted molecular mimicry analysis in our study using our robust microbial protein (NHAA) with HLA-B27, HLA-B51 and HLA-DRB1, enhancing the importance of NHAA acting as a trigger for generating IBD associated ReA. We generate a putative hypothesis amalgamating key findings with literature. We state that the initial hostmicrobe triggers for IBD associated ReA is when pathogenic microbial protein NHAA interacts with host protein KYNU that further interacts with human proteins ADA, SOD2, CAT and ACE and carbon metabolism involving the above host proteins is hampered. Methotrexate regulates carbon metabolism and the associated host-microbe proteins reducing effect of IBD associated ReA. Since carbon metabolism is the most basic aspect of life and therefore an extensive network consisting of sub-pathways, we narrowed down our findings towards a consequentially central and a significant pathway that embrace the carbon metabolism pathway involving the molecular signatures KYNU, ADA, SOD2, CAT and ACE, further is also effectuated by potential drug methotrexate and is associated with IBD/ ReA/ IBD and ReA cohorts. It is reported that methotrexate is incorporated intracellularly interfering with adenosine concentrations and affecting proinflammatory cytokines in IBD reducing inflammation 93 . In inflammatory arthritis, the mechanisms reported by which methotrexate reduces inflammation include enhanced adenosine release, de novo synthesis of purines and pyrimidines, inhibition of transmethylation reactions, diminished accumulation of polyamines and nitric oxide synthase uncoupling. Most of the mechanisms are associated with folate biosynthesis, a type of carbon metabolism 94 . KYNU, ADA, SOD2, CAT and ACE are also found to be involved in folate biosynthesis and metabolism from GeneCards. Apart from the above targets, parallel interactors, pathways and drugs for IBD co-evolved ReA obtained in our host-microbe disease network can be utilised further as disease determinants. The experimental validation of these targets in patient cohorts need to be performed on a pilot scale in future to increase the robustness of this network. The intertwined information processed through the knowledge-base created for the linked disorders have given the most elaborate layout of patterns observed in disease diagnosis and analysis. The major information after processing the gene expression profiles, protein markers, molecular networks and metabolic networks involved here have led to chalk out as well as connect the strings for robust gut microbiome paradigm shifts. The current work on host-microbe interactions provides a starting point for researchers and clinicians to investigate Inflammatory Bowel Disease (IBD) associated Reactive Arthritis (ReA). In this study a combinatorial approach is utilised to reveal the interactions of gut microbes with human host extensively sketched through the work-pipeline providing the vital insights for the drug targets, biomarkers, pathways and inhibitors for etiology, prognosis, diagnosis and treatment attributes of pathogenic rheumatic autoimmunity. The information sorted through the combinatorial study will be useful in deciphering the etiopathogenesis of the co-linked disorders especially for the rare ReA, from synonymous analyses of IBD datasets, conferred through common microbial triggers. These predictions substantially furnish the intricate details of the cross-talk between post-infectious inflammatory reactions with shared patho-immunogenesis as the starting point for researchers and clinicians for detailed and newer experimental analysis. Future studies are required on larger cohort of patients having ReA due to IBD in order to have validated outputs of the predictive network. www.nature.com/scientificreports/ Reactive arthritis: a review Management of arthritis in patients with inflammatory bowel disease Enteric pathogens and reactive arthritis: a systematic review of Campylobacter, Salmonella and Shigella-associated reactive arthritis Vedolizumab as induction and maintenance therapy for ulcerative colitis Achieving deep remission in Crohn's disease: treating beyond symptoms Gut microbiota perturbations in reactive arthritis and postinfectious spondyloarthritis Epidemiology: time to revisit the concept of reactive arthritis Role of human leukocyte antigens (HLA) in autoimmune diseases Human leukocyte antigen (HLA) and immune regulation: how do classical and non-classical HLA alleles modulate immune response to human immunodeficiency virus and hepatitis C virus infections? Clostridium difficile: an under-recognized cause of reactive arthritis? Reactive arthritis after enteric infections in the United States: the problem of definition MHC class I and class II genes in Tunisian patients with reactive and undifferentiated arthritis Reiter's syndrome associated with HLAB51 Novel HLA-B27-restricted epitopes from chlamydia trachomatis generated upon endogenous processing of bacterial proteins suggest a role of molecular mimicry in reactive arthritis Mechanisms of disease: pathogenesis of Crohn's disease and ulcerative colitis Th1-type responses mediate spontaneous ileitis in a novel murine model of Crohn's disease Lack of TNFR p55 results in heightened expression of IFN-γ and IL-17 during the development of reactive arthritis Microbial antigens mediate HLA-B27 diseases via TLRs Microbes in gastrointestinal health and disease Arthritis associated with Yersinia enterocolitica infection Chlamydia pneumoniae-a new causative agent of reactive arthritis and undifferentiated oligoarthritis Diagnosis of Chlamydia trachomatis in patients with reactive arthritis and undifferentiated spondyloarthropathy Salmonella lipopolysaccharide in synovial cells from patients with reactive arthritis The role of intracellular organisms in the pathogenesis of inflammatory arthritis A Pilot Study for detection of intra-articular chromosomal and extra chromosomal genes of chlamydia trachomatis among genitourinary reactive arthritis patients in India Acute erosive reactive arthritis associated with Campylobacter jejuni-induced colitis Seroprevalence of campylobacteriosis and relevant post-infectious sequelae The role of microbiome in rheumatoid arthritis treatment Immunopathogenesis of rheumatoid arthritis Recombinant Salmonella typhimurium outer membrane protein A is recognized by synovial fluid CD8 cells and stimulates synovial fluid mononuclear cells to produce interleukin (IL)-17/IL-23 in patients with reactive arthritis and undifferentiated spondyloarthropathy Outer membrane protein of salmonella is the major antigenic target in patients with salmonella induced reactive arthritis A single nonamer from the Yersinia 60-kDa heat shock protein is the target of HLA-B27-restricted CTL response in Yersinia-induced reactive arthritis The 19 kDa protein of Yersinia enterocolitica O: 3 is recognized on the cellular and humoral level by patients with Yersinia induced reactive arthritis Identification of the Yersinia enterocolitica urease beta subunit as a target antigen for human synovial T lymphocytes in reactive arthritis Association between reactive arthritis and antecedent infection with shigella flexneri carrying a 2-md plasmid and encoding an hla-b27 mimetic epitope Role of 30 kDa antigen of enteric bacterial pathogens as a possible arthritogenic factor in post-dysenteric reactive arthritis The microbiome in autoimmune diseases Antivirulence activity of the human gut metabolome The human gut microbiome-a potential controller of wellness and disease Prevalence of antibodies against Chlamydia trachomatis and incidence of C. trachomatis-induced reactive arthritis in an early arthritis series in Finland in 2000 Campylobacter reactive arthritis: a systematic review Reactive arthritis: current perspectives The microbiome and autoimmunity: a paradigm from the gut-liver axis Aspects of gut microbiota and immune system interactions in infectious diseases Systematic review of gut microbiota and major depression Omic' technologies: proteomics and metabolomics learning objectives: ethical issues In vivo and in silico determination of essential genes of Campylobacter jejuni A community-driven global reconstruction of human metabolism Systems analysis of inflammatory bowel disease based on comprehensive gene information Introduction of inflammatory bowel disease biomarkers panel using protein-protein interaction (PPI) network analysis GEMSiRV: a software platform for GEnome-scale metabolic model simulation, reconstruction and visualization ExPASy: the proteomics server for in-depth protein knowledge and analysis A systematic reconstruction and constraint-based analysis of Leishmania donovani metabolic network: identification of potential antileishmanial drug targets Genome-scale metabolic reconstructions of Bifidobacterium adolescentis L2-32 and Faecalibacterium prausnitzii A2-165 and their interaction Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0 Reconstruction of the metabolic network of Pseudomonas aeruginosa to interrogate virulence factor synthesis PATRIC, the bacterial bioinformatics database and analysis resource Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences HPIDB-a unified resource for host-pathogen interactions Study of intra-inter species protein-protein interactions for potential drug targets identification and subsequent drug design for Escherichia coli O104:H4 C277-11 Integration of clinical data with a genome-scale metabolic model of the human adipocyte STRING v10: protein-protein interaction networks, integrated over the tree of life Oral squamous cell cancer protein-protein interaction network interpretation in comparison to esophageal adenocarcinoma Association of inflammatory bowel disease with arthritis: evidence from in silico gene expression patterns and network topological analysis The landscape of protein biomarkers proposed for periodontal disease: markers with functional meaning Significant modules and biological processes between active components of Salvia miltiorrhiza depside salt and aspirin Gene ontology and KEGG pathway enrichment analysis of a drug target-based classification system Flexible ligand docking with Glide Identification of novel drug targets against Campylobacter jejuni using metabolic network analysis Cytoscape: a software environment for integrated models of biomolecular interaction networks Multiple groups of endogenous epsilon-like retroviruses conserved across primates update of the EULAR recommendations for the management of early arthritis HLA-B27 Correlates with the intracellular elimination, replication, and trafficking of Salmonella enteritidis collected from reactive arthritis patients Recombinant Salmonella typhimurium outer membrane protein A and D reactive T cells are expanded in synovial fluid of patients with reactive arthritis and undifferentiated spondyloarthropathy (HUM6P. 251) Purification of RNA using TRIzol (TRI reagent) Unique transcriptome signatures and GM-CSF expression in lymphocytes from patients with spondyloarthritis Development of a reverse transcription-quantitative PCR system for detection and genotyping of Aichi viruses in clinical and environmental samples Characterization of the Kynurenine pathway and quinolinic acid production in macaque macrophages Persistence of gene expression changes in noninflamed and inflamed colonic mucosa in ulcerative colitis and their presence in colonic carcinoma Vitamin D receptor expression in dogs Relative expression software tool (REST) for group-wise comparison and statistical analysis of relative expression results in real-time PCR Anti-microbial antibodies, host immunity, and autoimmune disease Host-microbe interactions in the pathogenesis and clinical course of sarcoidosis Elucidating vital drug targets of Salmonella enterica utilizing the bioinformatic approach Overproduction and purification of a functional Na+/H+ antiporter coded by nhaA (ant) from Escherichia coli Pro-inflammatory miR-223 mediates the cross-talk between the IL23 pathway and the intestinal barrier in inflammatory bowel disease Pediatric Crohn disease patients exhibit specific ileal transcriptome and microbiome signature Disruption of macrophage pro-inflammatory cytokine release in Crohn's disease is associated with reduced optineurin expression in a subset of patients Sensitivity and specificity of adenosine deaminase in diagnosis of juvenile idiopathic arthritis Antioxidant enzyme levels in reactive arthritis and rheumatoid polyarthritis Cytochrome P450 1A1 and manganese superoxide dismutase genes polymorphisms in reactive arthritis Coronavirus disease 2019 (COVID-19): do angiotensin-converting enzyme inhibitors/angiotensin receptor blockers have a biphasic effect The current role of methotrexate in patients with inflammatory bowel disease Methotrexate and its mechanisms of action in inflammatory arthritis The authors are grateful to Amity Institute of Biotechnology, Amity University Uttar Pradesh, Noida and Department of Biotechnology, TERI School of Advanced Studies, New Delhi for providing the facility and technical support during the preparation of the manuscript. We also thank Fortis Hospital, Noida for providing the patient samples. S.S. and B.R. conceived the study concept; S.S., B.R. and P.S. jointly designed and supervised the work; B.D.P. supervised the clinical setting and recruitment of participants; B.D.P. and A.V. recruited the participants and contributed to the sample collection and preparation; A.V. performed the experiments; S.S., B.R., P.S. and A.V. contributed to the analysis and interpretation of data; A.V. generated all figures and tables; A.V. wrote the first draft of the manuscript; S.S., B.R., P.S. and B.D.P. critically reviewed and edited the manuscript; All authors reviewed and approved the final version of the manuscript. The authors declare no competing interests. Supplementary information is available for this paper at https ://doi.org/10.1038/s4159 8-020-71674 -8.Correspondence and requests for materials should be addressed to S.S.Reprints and permissions information is available at www.nature.com/reprints.Publisher's note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creat iveco mmons .org/licen ses/by/4.0/.