key: cord-0923325-hbystii6 authors: de Moraes, Diogo; Paiva, Brunno Vivone Buquete; Cury, Sarah Santiloni; Junior, João Pessoa Araújo; da Silva Mori, Marcelo Alves; Carvalho, Robson Francisco title: Prediction of SARS-CoV interaction with host proteins during lung aging reveals a potential role for TRIB3 in COVID-19 date: 2020-04-09 journal: bioRxiv DOI: 10.1101/2020.04.07.030767 sha: ceec895af54ccfb9f7a4954a63a96dfcd6b987c6 doc_id: 923325 cord_uid: hbystii6 COVID-19 is prevalent in the elderly. Old individuals are more likely to develop pneumonia and respiratory failure due to alveolar damage, suggesting that lung senescence may increase the susceptibility to SARS-CoV-2 infection and replication. Considering that human coronavirus (HCoVs; SARS-CoV-2 and SARS-CoV) require host cellular factors for infection and replication, we analyzed Genotype-Tissue Expression (GTEx) data to test whether lung aging is associated with transcriptional changes in human protein-coding genes that potentially interact with these viruses. We found decreased expression of the gene tribbles homolog 3 (TRIB3) during aging in male individuals, and its protein was predicted to interact with HCoVs nucleocapsid protein and RNA-dependent RNA polymerase. Using publicly available lung single-cell data, we found TRIB3 expressed mainly in alveolar epithelial cells that express SARS-CoV-2 receptor ACE2. Functional enrichment analysis of age-related genes, in common with SARS-CoV-induced perturbations, revealed genes associated with the mitotic cell cycle and surfactant metabolism. Given that TRIB3 was previously reported to decrease virus infection and replication, the decreased expression of TRIB3 in aged lungs may help explain why older male patients are related to more severe cases of the COVID-19. Thus, drugs that stimulate TRIB3 expression should be evaluated as a potential therapy for the disease. The first cases of infections with the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) in humans were identified in December 2019 in Wuhan, China 1,2 , and since then, the coronavirus disease 2019 (COVID-19) rapidly became pandemic 3 . Studies have shown that older individuals with comorbidities are associated with more severe cases of COVID-19 4 . These patients are more likely to develop pneumonia and respiratory failure due to alveolar damage 5, 6 , suggesting that lung aging impacts disease progression and mortality. SARS-CoV-2 requires host cellular factors for successful infection and replication 7 . For example, angiotensin-converting enzyme 2 (ACE2) is the receptor for the SARS-CoV-2 spike protein receptor-binding domain (RBD) for viral attachment 7, 8 . The conserved evolutionary relationship between the 2019 novel SARS-CoV-2 and SARS-CoV 9 opens up the possibility to explore the relationships between these human coronaviruses (HCoVs) in public databases. Computational predictions of SARS-CoV-human protein-protein interactions (PPIs) may identify mechanisms of viral infection and drug targets [9] [10] [11] . In this context, the Genotype-Tissue Expression (GTEx) database 12, 13 has provided insights into age-related genes 14, 15 and, associated with single-cell transcriptomics, could predict SARS-CoV-2-PPIs in aging lungs. This characterization is crucial for older adults, which are more vulnerable to the disease [16] [17] [18] . Thus, we analyzed whether lung aging is associated with transcriptional changes in proteins that potentially interact with SARS-CoV-2. We first identified differentially expressed genes (DEGs) during aging in GTEx human lung samples (release V7) (Data S1). The numbers of significant DEGs increased with aging (Log foldchange ≥ |1| and FDR < 0.05), and individuals of 60-69 year old (yo) presented the highest number of DEGs, in comparison to young adults (20-20 yo) (Figures S1-S2, Table S1 ). Clustering of these DEGs identified age-associated profiles ( Figure 1a ). Among the transcripts translated into proteins predicted as interacting with SARS-CoV, the hyaluronan and proteoglycan link protein 2 (HAPLN2) increased with aging, while tribbles homolog 3 (TRIB3) decreased ( Figure 1b , Table S2 ). HAPLN2 was predicted to interact with virus proteins spike glycoprotein and E2 glycoprotein precursors, while TRIB3 with nucleocapsid protein and RNA-dependent RNA polymerase ( Figure 1b ; Table S3 ). Importantly, the SARS-CoV-2 nucleocapsid protein has a sequence identity of 89.6% compared to SARS-CoV 9 . The expression of TRIB3 also decreased in the lung, specifically in males with ≥ 40 years (Figure 1c) , in a cohort with additional samples (GTEx, release V8). The reanalysis of lung single-cell RNA sequencing data 19, 20 demonstrated TRIB3 expressed mainly in alveolar type I (AT1) and type II (AT2) cells and in ciliated cells (Figure 1d -f, Figure S3 ), which also express SARS-CoV-2 receptor ACE2 7, 8, 21 . The involvement of TRIB3 in viral infection is poorly understood; however, its inhibition was associated with an increase of hepatitis C virus (HCV) replication 22 . Additionally, TRIB3 negatively regulates the entry step of the HCV life cycle and propagation 22 Table S8 ). We found that genes that decrease their expression with aging and genes that are up-regulated with SARS-CoV infections generated the most significant network, with overrepresented genes associated with mitotic cell cycle and surfactant metabolism ( Figure 2 ). The decreased capacity of cellular division on aging is associated with cellular senescence -a mechanism that stops cells with damaged DNA from replicating 26 -and progenitor cell exhaustion 27 . The altered metabolism or secretion of surfactants by AT2 cells reduces the ability of the lungs to expand and increases the risk of lung collapse in HCoVs infections 28, 29 . Moreover, Sftpc -/-(Surfactant Protein C) mice have worse viral infections than controls 30 , and its human homolog decreased with aging while it is up-regulated on SARS-CoV infections ( Figure 2 ). Thus, the pneumonia-like lung injury found on severe cases of COVID-19 infections 5,6 may be aggravated by impaired lung regeneration and altered metabolism of surfactants in older male patients. Although the genes and pathways we highlighted here were identified based on a robust statistical significance, we emphasize that other methods of over-time gene expression analyses applying different cutoffs could be considered; using GTEx V8 cohort or separating males and females may result in different sets of age-related genes in the lung. Further analyses should be conducted to identify more functional differences between male and female lungs during aging. Moreover, clinical data from these individuals -such as diabetes or heart disease -important factors influencing COVID-19 outcome -were not evaluated. However, due to the nature of GTEx donor consent, public phenotypes (including clinical data) are limited. Access to the protected phenotypes of the individuals needs an application via dbGaP (Genotypes and Phenotypes database), which, associated with reanalysis of the transcriptomics data, may take a significant amount of time. Part of the results presented herein derives from a previously unpublished paper focusing on aging lung on a different topic. Nevertheless, we decided to release this data focusing on SARS-CoV-2, due to the emergency of the current pandemic. In conclusion, we show that lung gene expression of TRIB3, a protein predicted to interact with the to MAM). The results shown here are, in part, based upon data generated by the Genotype-Tissue Expression project (GTEx) (https://gtexportal.org/). The RNA-Seq data analysis of lung tissues was performed using 427 samples from males and females available at the GTEx portal (release V7) (https://www.gtexportal.org/) 31 . We first used the BioJupies platform (https://amp.pharm.mssm.edu/biojupies/) 32 to identify the differentially expressed genes (DEG) in lung samples during aging. The expression data of each subject was distributed according to their age range: 30-39; 40-49; 50-59; 60-69 and 70-79 years old (yo), and each age range was compared with the group of young adults (20-29 yo) as a common control (Data S1, Supplementary Table S9 . Genes with Log2 of fold change ≥ |1| and false discovery rate ( FDR) < 0.05 were considered as differentially expressed (DEGs). Next, we performed a hierarchical clustering analysis based on Pearson correlation for age ranges using the mean expression (Trimmed Mean of M-values, TMM) of the DEGs found previously in, at least, one age range. This analysis aimed at the identification of clusters with gradients of gene expression across age ranges, showing increased or decreased expression during aging (method adapted from Theunissen et al., 2011 33 ). The clusterization of gene expression profiles on female lung samples generated clusters with less evident gradients compared to the males (Figure 1a and Figure S4 ). Thus, considering the clear clusterization found in males, and the fact that older males seem to have a worse prognosis on COVID-19 6 , we focused on male profiles to identify age-related lung genes. Our research group previously used the strategy described above to analyze RNA-Seq data of GTEx lung samples (release V7) during aging. We reutilized these results due to the urgency of the current pandemic situation (see limitations section). With the availability of the last release of the GTEx dataset (release V8), which presents a higher number of lung samples with transcriptomic data from individuals of both sexes (Table S10) , we used this last available cohort (V8) for further genespecific analyses. The expression of HAPLN2 and TRIB3 (TMM normalized; V8 cohort) were compared using One-way ANOVA followed by Dunnett's multiple comparisons test, with the GraphPad Prism version 6.00 for Windows (GraphPad Software, La Jolla, California, USA). Differences with a P-value < 0.05 were considered significant. The conserved evolutionary relationship between the 2019 novel SARS-CoV-2 and SARS-CoV 9 opens up the possibility to explore the relationships of these human coronaviruses (HCoVs) in publicly available databases. Thus, we compared the list of DEG in the lung during aging with corresponding human proteins that potentially interact with HCoVs (Table S2 ). The HCoVs-human PPIs were obtained using data from the Pathogen-Host Interactome Prediction using Structure Similarity (P-HIPSTer, http://phipster.org/) database, which is a comprehensive catalog of the virushuman PPIs predicted based on protein structural information 11 (Table S2) . P-HIPSTer has an experimental validation rate of ∼ 76% 11 . We also compared the list of DEG with recently added libraries for virus perturbations (up-and down-regulation) from GEO datasets (GSE33266, GSE50000, GSE49262, GSE50878, GSE49263, GSE40824, GSE50878, GSE49263, GSE47960, GSE47961, GSE47962, GSE17400, and GSE40824), available at the EnrichR database 34 . Access in March 2020. Genes that are common in either direction of both conditions were analyzed on STRING (https://string-db.org/) 35 . Access in March 2020. We analyzed the expression of TRIB3, HAPLN2, and ACE2, in different lung cell populations by using two previously published human single-cell RNA-seq data (Table S4) 19, 20 . The first dataset 20 was explored in the UCSC Cell Browser (http://nupulmonary.org/resources/), aiming the identification of the cell populations expressing those genes. The samples with pulmonary fibrosis presented in this dataset were omitted from our analysis, and only non-diseased lung samples were included (n=8). We further used an independent single-cell RNA-seq dataset 19 (n=5), available at the Human Cell Atlas Portal (https://data.humancellatlas.org/explore/projects/c4077b3c-5c98-4d26-a614-246d12c2e5d7), to confirm that expression TRIB3 and ACE2 are expressed in alveolar epithelial cells (types 1 and 2) and in ciliate cells. Access in March 2020. The corresponding proteins of the DEG shared with the list of DEG from libraries for virus perturbations were queried in the STRING database (Search Tool for Retrieval of Interacting Genes, version 10.5; https://string-db.org/) 35 , for the construction of PPI networks. We considered the following settings: text mining, experiments, databases, and co-expression as sources of active interaction. We selected the minimum interaction score of 0.900 (highest confidence), and the disconnected nodes were hidden to simplify the display. We evaluated the PPI enrichment P-values, which verifies the number of interactions of a set of proteins compared with a random set of similar size. The PPI enrichment P-value represents the statistical significance provided by STRING. Access in March 2020. The clustering analyses of the expression profiles were performed using the web tool Morpheus (https://software.broadinstitute.org/morpheus) 36 . Venn diagrams were plotted using the Jvenn online tool (https://jvenn.toulouse.inra.fr) 37 . Volcano Plots were constructed using the web tool: https://paolo.shinyapps.io/ShinyVolcanoPlot/. All data is available in the manuscript. human samples in a t-distributed Stochastic Neighbor Embedding (tSNE) plot, as described previously 20 . Grey dots represent single-cells from pulmonary fibrosis samples that were not included in the present analysis. Single-cell gene expression of TRIB3 (B), HAPLN2 (C), and ACE2 (D) in different cell populations of the lung. The images were generated using the dataset 20 , available at nupulmonary.org/resources/. The range represents the minimum and maximum expression. Figure S3 . Heatmap with the mean expression of DEGs on female samples (n=129), normalized by the Trimmed Mean of M-values (TMM) and Z-scored by row. Fewer genes clustered on gradients compared with males (Figure 1) , which could mean a more substantial noise on the data due to decreased sample size or female hormonal cycle. A new coronavirus associated with human respiratory disease in China A pneumonia outbreak associated with a new coronavirus of probable bat origin An interactive web-based dashboard to track COVID-19 in real time Clinical characteristics of 140 patients infected with SARS CoV 2 in Wuhan Time Course of Lung Changes On Chest CT During Recovery From 2019 Novel Coronavirus (COVID-19) Pneumonia Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus Network-based drug repurposing for novel coronavirus 2019-nCoV/SARS-CoV-2 Understanding Human-Virus Protein-Protein Interactions Using a Human Protein Complex-Based Analysis Framework. mSystems 4 A Structure-Informed Atlas of Human-Virus Interactions The Genotype-Tissue Expression (GTEx) project The Genotype-Tissue Expression (GTEx) pilot analysis: Multitissue gene regulation in humans. Science (80-. ) Transcriptome analysis reveals the difference between "healthy" and "common" aging and their connection with age related diseases An analysis of aging-related genes derived from the Genotype-Tissue Expression project (GTEx) Updated understanding of the outbreak of 2019 novel coronavirus (2019 nCoV) in Wuhan Early Transmission Dynamics in Wuhan, China, of Novel Coronavirus-Infected Pneumonia Clinical features of patients infected with 2019 novel coronavirus in Wuhan Lung, spleen and oesophagus tissue remains stable for scRNAseq in cold preservation Single-cell transcriptomic analysis of human lung provides insights into the pathobiology of pulmonary fibrosis SARS-CoV-2 Cell Entry Depends on ACE2 and TMPRSS2 and is Blocked by a Clinically Proven Protease Inhibitor Nonstructural 3 Protein of Hepatitis C Virus Modulates the Tribbles Homolog 3/Akt Signaling Pathway for Persistent Viral Infection Single-cell transcriptional dynamics of flavivirus infection Therapeutic potential of the new TRIB3-mediated cell autophagy anticancer drug ABTL0812 in endometrial cancer Focusing on the Unfolded Protein Response and Autophagy Related Pathways to Reposition Common Approved Drugs against COVID-19 Senescence and aging: Causes, consequences, and therapeutic avenues XThe hallmarks of aging Surfactant alteration and replacement in acute respiratory distress syndrome Molecular pathology of emerging coronavirus infections Surfactant protein C-deficient mice are susceptible to respiratory syncytial virus infection A scaling normalization method for differential expression analysis of RNA-seq data Automated Generation of Interactive