key: cord-0854119-j633s3kc authors: Klaassen, Kristel; Stankovic, Biljana; Zukic, Branka; Kotur, Nikola; Gasic, Vladimir; Pavlovic, Sonja; Stojiljkovic, Maja title: Functional prediction and comparative population analysis of variants in genes for proteases and innate immunity related to SARS-CoV-2 infection date: 2020-08-07 journal: Infect Genet Evol DOI: 10.1016/j.meegid.2020.104498 sha: 07c9bdff7953ff79873aef2f1845be9f0bcdf49a doc_id: 854119 cord_uid: j633s3kc New coronavirus SARS-CoV-2 is capable to infect humans and cause a novel disease COVID-19. Aiming to understand a host genetic component of COVID-19, we focused on variants in genes encoding proteases and genes involved in innate immunity that could be important for susceptibility and resistance to SARS-CoV-2 infection. Analysis of sequence data of coding regions of FURIN, PLG, PRSS1, TMPRSS11a, MBL2 and OAS1 genes in 143 unrelated individuals from Serbian population identified 22 variants with potential functional effect. In silico analyses (PolyPhen-2, SIFT, MutPred2 and Swiss-Pdb Viewer) predicted that 10 variants could impact the structure and/or function of proteins. These protein-altering variants (p.Gly146Ser in FURIN; p.Arg261His and p.Ala494Val in PLG; p.Asn54Lys in PRSS1; p.Arg52Cys, p.Gly54Asp and p.Gly57Glu in MBL2; p.Arg47Gln, p.Ile99Val and p.Arg130His in OAS1) may have predictive value for inter-individual differences in the response to the SARS-CoV-2 infection. Next, we performed comparative population analysis for the same variants using extracted data from the 1000 genomes project. Population genetic variability was assessed using delta MAF and Fst statistics. Our study pointed to 7 variants in PLG, TMPRSS11a, MBL2 and OAS1 genes with noticeable divergence in allelic frequencies between populations worldwide. Three of them, all in MBL2 gene, were predicted to be damaging, making them the most promising population-specific markers related to SARS-CoV-2 infection. Comparing allelic frequencies between Serbian and other populations, we found that the highest level of genetic divergence related to selected loci was observed with African, followed by East Asian, Central and South American and South Asian populations. When compared with European populations, the highest divergence was observed with Italian population. In conclusion, we identified 4 variants in genes encoding proteases (FURIN, PLG and PRSS1) and 6 in genes involved in the innate immunity (MBL2 and OAS1) that might be relevant for the host response to SARS-CoV-2 infection. A new coronavirus SARS-CoV-2 capable to infect humans, emerged in mid-December 2019 in Wuhan, China, causing the novel disease COVID-19 (Wu et al., 2020) . Clinical manifestations of this viral infection vary from asymptomatic to severe acute respiratory syndrome and death. The number of infected people and affected countries have quickly risen and already in March 2020, World Health Organization declared a pandemic (WHO). Therefore, it became highly important to understand if specific human genes and genetic variants could be associated with susceptibility or resistance to SARS-CoV-2 infection and how frequencies of these variants vary between different populations. Human genes usually associated with susceptibility and resistance to viral infection are those associated with the point of viral entry into the human host cells, such as genes encoding receptors, co-receptors and enzymes that modify receptors (Kenney et al., 2017) . Furthermore, various genes involved in the immune response, such as virus sensing, signaling in response to virus, antiviral factors etc. have also been found to be important for the severity and the outcome of viral infections. A recent study which analyzed COVID-19 symptoms in monozygotic and dizygotic twins, reported that 50 (95% confidence intervals 29-70)% of the variance of 'predicted COVID-19' phenotype is due to genetic factors (Williams et al., 2020) . Deciphering the RNA sequence of the SARS-CoV-2 genome showed that this new virus belongs to lineage B betacoronaviruses, together with the SARS-CoV virus which emerged in 2002 (Wu et al., 2020) . Genetic susceptibility and resistance to SARS-CoV had been extensively studied by genotyping SARS patients with extremely severe and extremely mild clinical manifestations. As J o u r n a l P r e -p r o o f Journal Pre-proof a result, variants in OAS1, MX1, MBL2, CCL2, CCL5, ASHG, IFNgamma, CD14 and CD209 genes were associated with genetic susceptibility to SARS-CoV (Chan et al., 2010; Chong et al., 2006; He et al., 2006; Ng et al., 2007; Tu et al., 2015; Yuan et al., 2007; Zhu et al., 2011) . Majority of variants that emerged in such studies were located in non-coding parts of the human genome and their effect is observable in the fine tuning of the gene expression. Also, variants residing in coding regions, such as rs1800450 in MBL2 gene (Tu et al., 2015) were particularly interesting as they alter protein structure and function. SARS-CoV and SARS-CoV-2 share ~76% amino acid sequence identity of the spike (S) protein sequence, a crucial part of the viral envelope which enables specific binding to the receptors at human cells, therefore contributing to viral potential to infect humans (Chan et al., 2020; Lu et al., 2015) . It was shown that SARS-CoV-2 binds to the human angiotensin-converting enzyme 2 (ACE2) receptors, as SARS-CoV does, however with the higher affinity (Wrapp et al., 2020) . Human transmembrane protease serine 2 (TMPRSS2), an enzyme important for the entry of SARS-CoV (Matsuyama et al., 2010) , was recently found to activate the S protein of the SARS-CoV-2 (Hoffmann et al., 2020) . Moreover, it was found that the S protein of the SARS-CoV-2 contains a furin-like cleavage site which is absent in coronaviruses of the same clade (Coutard et al., 2020) . Knowing that furin cleavage sites are responsible for the high virulence of human influenza viruses (Chen et al., 1998) , it was suggested that furin-like site in the S protein of SARS-CoV-2 represents its advantage in attaching to the human cells expressing ACE2 receptor. The SARS-CoV-2 relies on the host cell proteases to cut its S protein in two parts thus forming N-terminal part which recognizes ACE2 receptor and C-terminal part involved in the viral entry which must be further cleaved by furin and/or other furin-like enzymes (Coutard et al., 2020) . It J o u r n a l P r e -p r o o f Journal Pre-proof has been shown that plasmin is also capable to cleave furin sites and that individuals with elevated plasmin demonstrated higher susceptibility to COVID-19 . In addition to plasmin, S protein of coronaviruses may be cleaved by other airway proteases such as trypsin-1 and TMPRSS11a . At the beginning of May 2020, in the full swing of COVID-19 pandemic, the data on genetic susceptibility or resistance to SARS-CoV-2 which would be based on the genotyping of individuals infected by SARS-CoV-2 are still lacking. However, a detailed study using computational prediction methods for protein structure analyses revealed that 17 variants in the coding regions of the ACE2 gene are located at positions important for the binding of ACE2 with the SARS-CoV-2 S protein (Hussain et al., 2020) . Based on these predictions, individuals carrying these variants would probably be resistant to SARS-CoV-2 infection. These variants were rare (less than 0.00388 allele frequency) and their association with human diseases or disorders has never been reported (Hussain et al., 2020) . Having in mind the rarity of variants that could directly influence the binding of SARS-CoV-2 to ACE2, it is not surprising that comparative genetic analysis of the frequency of ACE2 variants in different populations did not predict the existence of individuals resistant to SARS-CoV-2 infection (Cao et al., 2020) . ACE2 is not the only possible player that could influence the interaction between humans and SARS-CoV-2 virus. The furin could be the second important factor to contribute to high pathogenicity of the novel virus. Furthermore, we can learn from clinical experiences of the SARS-CoV-2 infection which showed that plasmin and other airway proteases could also be important for patients experiencing more severe form of COVID-19 . In addition, the knowledge based on SARS epidemic shows that MBL2 and OAS1 genes, involved in the innate immune response, could also modulate susceptibility to infection with betacoronaviruses. J o u r n a l P r e -p r o o f Taking all these players into account, we performed genetic analysis of variants in coding regions of FURIN, plasminogen (PLG), trypsin-1 (PRSS1), TMPRSS11a, MBL2 and OAS1 genes in Serbian populations aiming to identify possible genetic markers that are capable to impact protein structure/function and thus contribute to the susceptibility or resistance to SARS-CoV-2 infection. We postulate that the variants in genes encoding proteases could be advantageous, while variants in genes encoding proteins involved in the innate immunity add some disadvantage to individuals in combating COVID-19. We also performed comparative genetic analysis in different populations in order to assess the divergence between populations for those variants. In this study, we analyzed the genomic sequence data of unrelated individuals from Serbian population, extracted from the in-house database of Laboratory for Molecular Biomedicine, Institute of Molecular Genetics and Genetic Engineering, University of Belgrade. Written informed consent was obtained from all participants. The study was conducted in accordance with the Helsinki Declaration and approved by the Ethics Committee of Institute of Molecular Genetics and Genetic Engineering, University of Belgrade. Total of 143 unrelated Serbian individuals (84 males and 59 females) were previously analyzed by NGS approach using the Illumina Clinical Exome Sequencing TruSight One Gene Panel J o u r n a l P r e -p r o o f Journal Pre-proof (Illumina, San Diego, CA, USA) , as previously described (Skakic et al., 2018) . VCF files were further annotated and examined using the Illumina VariantStudio 3.0 Data Analysis Software (Illumina, San Diego, CA, USA). Variants that did not pass variant call quality filters (those with quality score 8) in more than 10% of samples were excluded. Variants that were considered for further analysis were either missense, start loss, stop gain, splice region variants and frameshift, while synonymous variants were omitted. For all selected variants, Hardy-Weinberg equilibrium was tested, using the chi square goodness of fit test. regarding the 1kGP data were described previously (Genomes Project et al., 2015) . Fisher exact test was used to measure significant differences in genotypes distributions between Serbian and 1kGP populations, applying Bonferoni correction for multiple testing. We examined the level of population genetic variability at each selected locus using: (1) the maximal global differences in minor allele frequencies (delta MAF) calculated by subtracting the maximum and the minimum MAF across analyzed population groups, (2) using Fst statistics (Nei, 1987) , which is widely used in population genetics (Holsinger and Weir, 2009 missense variants, 1 start lost, 1 missense/splice region and 1 splice region variant. Majority of the detected variants were rare (8 variants had minor allele frequency of 0.3%, while 4 variants of 0.7%). We detected the highest minor allelic frequency for the OAS1 p.Gly162Ser, being 41%. All detected variants were in Hardy-Weinberg equilibrium. Next, we performed in silico prediction analysis of the identified variants' effect, using PolyPhen-2, SIFT and MutPred2 algorithms ( Other 2 variants with potentially damaging effect, p.Arg52Cys and p.Gly54Glu in MBL2 gene, having allelic frequency of 8% and 14%, respectively, are the most promising populationspecific markers to be considered for association study in COVID-19 patients in Serbia. In total, 4 variants in genes encoding proteases (FURIN, PLG and PRSS1) and 6 in genes involved in the innate immunity (MBL2 and OAS1) might be interesting for further studies in relation to the response to SARS-CoV-2 infection. While mutations in the RNA of betacoronavirus opened the possibility to jump to a new specie, variants in various human genes may contribute to increased susceptibility to a new pathogen, or could have a protective role. One of the most illustrious examples are variants in CD4 receptor which contribute to susceptibility to some strains of HIV infection, while variants in CCR5 gene provide resistance to its carriers (Marmor et al., 2006; Oyugi et al., 2009 ). Recent studies addressed rare variants in ACE2 gene which could directly influence the binding to S protein of the SARS-CoV-2 and the population-specific differences of more frequent ACE2 gene variants (Cao et al., 2020; Hussain et al., 2020) . However, having in mind complexity of the interaction between the virus and the host, which besides the S protein -ACE2 receptor interaction, includes the role of different host proteases and at least some elements of innate immunity with antiviral activity, in this study we selected additional genes that could be important for the susceptibility or resistance to the SARS-CoV-2 viral infection. We focused on genetic variants in genes FURIN, PLG, PRSS1, TMPRSS11a, that encode proteases, as well as J o u r n a l P r e -p r o o f variants in genes MBL2 and OAS1, that encode proteins involved in innate immunity, attempting to find promising candidate alleles that might be included in the future genetic test related to new waives of the SARS-CoV-2 pandemics. To systematically examine the coding variants in selected genes and differences in the allele frequency between populations, we performed comparative analysis of 22 variants found in FURIN, PLG, PRSS1, TMPRSS11a, MBL2 and OAS1 genes from Serbian population database with different European populations and super-populations extracted from 1kGP database (Genomes Project et al., 2015) . Also, we used bioinformatic tools to predict the effect of these genetic variants on the structure and/or function of proteins encoded by FURIN, PLG, PRSS1, TMPRSS11a, MBL2 and OAS1 genes. It could be possible that the existence of variants in genes encoding proteases provide advantage while variants in genes for proteins involved in the innate immunity add some disadvantage to individuals in combating COVID-19. Host proteases, such as furin, plasmin, trypsin-1 and TMPRSS11a, are crucial in the process of cutting the S protein of the SARS-CoV and SARS-CoV-2 envelope at the S1/S2 cleavage site, which is a necessary event needed to release the S1 fragment (N-terminal part) and the S2 fragment (C-terminal part) (Coutard et al., 2020) . The S1 fragment recognizes ACE2 receptor at the surface of the human cells, and the S2 fragment is involved in the viral entry into the cells . Furthermore, the spike protein of the SARS-CoV-2 must be cleaved at both, S1/S2 cleavage site and at the furin-like cleavage site inside the S2 fragment in order to J o u r n a l P r e -p r o o f enable viral entry (Coutard et al., 2020) . It was previously shown that the cleavage by host proteases plasmin, trypsin-1 and TMPRSS11a at both sites of the S protein is mandatory for the entry of SARS-CoV into human bronchial epithelial cells in vitro (Kam et al., 2009 ). Interestingly, the furin like S2' cleavage site is identical between SARS-CoV and SARS-CoV-2 (Coutard et al., 2020) thus implying the capability of furin, plasmin, trypsin-1 and TMPRSS11a to perform activation steps and enable viral entry into the human cells. The efficiency of all these four proteases contribute to regulation of cellular tropism and determination of viral pathogenesis. Thus, genetic variants that would impact the structure and/or function of furin, plasmin, trypsin-1 and TMPRSS11 may lead to inter-individual differences in the response to the SARS-CoV-2 infection. FURIN encodes a type 1 membrane bound protease, one of the seven basic amino acid-specific members which cleave their substrates at single or paired basic residues. First, furin is autocatalytically processed in the endoplasmatic reticulum and then transported to the trans-Golgi network where a second autocatalytic event takes place and the catalytic activity is acquired. In addition to having affinity for various substrates in human tissues, it is probably one of the proteases responsible for the activation of SARS-CoV-2 envelope S glycoproteins (Coutard et al., 2020) . Two rare variants, p.Thr33Ala and p.Gly146Ser, were detected in FURIN. Thr33 residue is located in the pro-domain of a peptidase and therefore cleaved off in the process of activation of the furin enzyme, whereas Gly146 residue is located within the peptidase domain of the enzyme (Dahms et al., 2016) . A substitution of Gly146 to Ser is predicted to be probably Plasminogen is a protease which belongs to peptidase family S1 and it is encoded by the PLG gene. Proteolysis of plasminogen results in multiple forms of the active plasmin. Plasminogen is present in the human lung tissues, more precisely in the airway, alveolar type I and II epithelial cells as well as in endothelial cells . Six very rare variants were identified in the (Law et al., 2012) . Given that the Pan-Apple domains mediate protein-protein or protein-carbohydrate interactions, whereas Kringle domains play a role in binding mediators, these variants may be involved in fine tuning of plasmin activity. Having in mind that individuals with elevated plasmin had greater susceptibility to COVID-19 and more severe clinical manifestations , rare variants in PLG, such as p.Arg261His and p.Ala494Val may be recognized as potential markers of interindividual differences in susceptibility to coronavirus. Trypsin-1 is encoded by the PRSS1 gene which is a member of the trypsin-1 family of serine proteases. It is active on peptide linkages involving the carboxyl group of lysine or arginine. This J o u r n a l P r e -p r o o f enzyme is secreted by the pancreas and cleaved to its active form in the small intestine. It is also present in airway and alveolar type I and II epithelial cells . Two rare variants in the PRSS1 gene, c.592-8C>T and p.Asn54Lys, were detected. Variant c.592-8C>T was previously detected in patients with cystic fibrosis presenting with chronic pancreatitis (Sofia et al., 2018) . Given that this is a splice region variant, it may affect splicing and therefore the level of trypsin-1 protein. Trypsin-1 variant p.Asn54Lys leads to the substitution of polar asparagine residue to a positively charged lysine. This variant is predicted to be possibly damaging/deleterious by PolyPhen-2 and SIFT algorithms, while MutPred2 software suggested that this change may alter the catalytic site. Given that the catalytic triad of trypsin-1 enzyme is consisted of His57, Asn102 and Ser195 (Polgar, 2005) , the proximity of Asn54 to His57 might modify the enzymatic activity. The same codon is affected by an alternative variant, p.Asn54Ser, which was classified pathogenic by UniProt, and associated with chronic pancreatitis (Richards et al., 2015; Teich et al., 2005) . Nevertheless, having in mind that the variant p.Asn54Lys was observed in healthy adults, this variant is not regarded as a disease causing in the context of Mendelian diseases. Its mechanism might be through the fine modification of trypsin-1 enzymatic activity, which could contribute to the resistance of some individuals to SARS-CoV-2 infection. Transmembrane serine protease 11A, encoded by the TMPRSS11a gene is one of the members of type II transmembrane serine proteases and it is present in upper respiratory tract (pharynx and trachea) and digestive tract (Kam et al., 2009) . Two rare variants were detected in the TMPRSS11A gene: p.Lys48Arg and p.Arg328Gln. For both variants, PolyPhen-2, SIFT and MutPred2 algorithms predict benign/tolerated effect. Given the scarce data on the protein structure of TMPRSS11A, the effect of this variants is yet to be ascertained. Innate immunity is important in defending organism from viral infections. After SARS-CoV outbreak it was found that type I interferons could inhibit the replication of this virus and that these interferons further induce different proteins with antiviral activity (Cinatl et al., 2003) . One of such proteins is encoded by the OAS1 gene. OAS1 synthesizes 2',5'-oligoadenylates, and as a consequence activates latent RNase L which cuts single-stranded RNAs thus leading to the viral RNA degradation and inhibition of viral replication (Rebouillat and Hovanessian, 1999) . Three rare variants were detected in the OAS1 gene: p.Arg47Gln, p.Ile99Val and p.Arg130His. Interestingly, human OAS1 protein recognizes two adjacent minor grooves of dsRNA with Cterminal lobe and N-terminal lobe, where the Arg47 residue is one of the residues involved in the recognition (Donovan et al., 2013) . When analyzed in Swiss-Pdb Viewer, the side chain of Arg47 residue was predicted to form hydrogen bond with dsRNA, while a change to glutamine at this position was predicted to abolish this hydrogen bond. Furthermore, Gln at this position is predicted to form hydrogen bonds with Cys45 and Phe46, and to cause steric clashes with Glu43 and Arg44. Nevertheless, variant p.Arg47Gln is predicted to be benign/tolerated by PolyPhen-2, SIFT and MutPred2 algorithms, so its effect on the OAS1 protein seems not to affect the structure itself, but it may be apparent upon the binding of the dsRNA. Variant p.Ile99Val is predicted to be possibly damaging/deleterious by prediction algorithms, but the analysis in Swiss-Pdb Viewer revealed that the backbone of the Ile99 residue is predicted to form hydrogen bonds with Arg95, Gly96 and Arg103, so the change to Val at this position did not change the J o u r n a l P r e -p r o o f existing bonds, nor did it form novel ones. Variant p.Arg130His is predicted to be probably damaging/deleterious by prediction algorithms, and the analysis in Swiss-Pdb Viewer showed that in case of Arg130, a change to His at this position was predicted to result in the formation of novel hydrogen bonds with Thr188 and also a steric clash with Glu185. These rare variants may have an effect on the OAS1 structure and/or function through the formation of novel hydrogen bonds and steric clashes, leading to a less stable protein. In case of variants located at the protein/RNA interface formed upon the binding, its effects reflect on the formation of (to some extent) weaker bond with the RNA thus lowering its 2'-5'-oligoadenylate synthetase activity, as it was shown that variants at the protein/RNA interface impair OAS1 activity by 60-to 2,500-fold (Donovan et al., 2013) . Furthermore, having in mind that the active form of human OAS1 is tetrameric, any variant potentially impairing the oligomerization process would also effect on the activity of the enzyme, the synthesis of 2',5'-oligoadenylates, and consequently the activation of the latent RNase and the degradation of viral RNA of SARS-CoV-2. Knowing that variants in OAS1, such as p.Gly397Arg were previously associated with susceptibility to SARS-CoV infection in some populations (He et al., 2006) , the effects of p.Arg47Gln, p.Ile99Val and p.Arg130His might be interesting for further investigation. The comparative population analysis pointed to the 7 coding variants in PLG, TMPRSS11a, MBL2 and OAS1 with noticeable divergence in allelic frequencies between analyzed populations. Three variants were found in genes encoding proteases. Variant p.Asp472Asn in PLG gene showed notable MAF discrepancy between European and Asian populations, being the highest in Spain (0.34) and the lowest in East Asians (0.0). This substitution of neutral to acidic amino acid arises in a loop region that connects two Kringle domains of plasminogen and it was presumed to have a functional effect through altering the alignment of Kringle domains (Zaas et al., 2008) . This variant was shown previously to influence susceptibility to invasive aspergillosis (Zaas et al., 2008) . Having in mind the MAF difference between Spanish and East Asian populations, it would be interesting to further investigate whether it also reflects the remarkable differences in susceptibility and morbidity to COVID-19 between these populations. Two variants in (Fedetz et al., 2006) , but a more J o u r n a l P r e -p r o o f recent study showed that this variant was found not to interfere with OAS1 enzyme activity (Kjaer et al., 2014) . Another important element of the innate immune system is mannose-binding protein (soluble mannose-binding lectin) which is encoded by the MBL2 gene. The protein recognizes mannose and N-acetylglucosamine expressed on the surface of many microorganisms, and is capable of activating the classical complement pathway. Deficiencies of this gene have been associated with increased susceptibility to SARS-CoV and other autoimmune and infectious diseases (Tu et al., 2015) . Interestingly, out of 7 genetic variants that demonstrated delta MAF>0.1 in our study, only p.Arg52Cys, p.Gly54Asp and p.Gly57Glu in MBL2 were predicted to be probably damaging/deleterious by prediction algorithms. Moreover, these three variants were already extensively studied and functionally characterized -they showed compromised oligomerization and thus the activity of the final protein (Larsen et al., 2004) . In addition to promoter variants, these coding variants have been shown to influence the stability and serum concentration of the protein, where, consequently, low levels of MBL2 have been associated with increased susceptibility to infections (Garred et al., 2006) . Therefore, the influence of these three variants was studied in various immunological sceneries, from autoimmune to infections, including SARS-CoV (Ip et al., 2005; Tu et al., 2015) . Nevertheless, all three variants are found in all Taken altogether, comparing Serbian to other populations, it was found that the highest level of genetic differentiation related to selected loci was observed with African, followed by East Asian and South Asian populations. When compared with European populations, the highest divergence was observed with Italian population. In conclusion, results of our analysis showed that variants predicted to have altering effect to the proteins are very rare in each of the selected European populations as well as superpopulations. Thus, although it is not likely to perform massive genetic testing aiming to detect these variants in order to predict prognosis to COVID-19, they may provide answers for the inter-individual differences in the clinical course of disease among patients of the same age and the same genetic background which received identical medical treatment. On the other hand, variants which have divergent allele frequencies between populations were mostly predicted to lack effect on the structure and/or the function of the proteins. However, few variants, such as those in MBL2 gene were predicted to have some functional effect and their contribution to population differences regarding COVID-19 should be further evaluated. In general, findings of this study may lead us to conclude that 4 coding variants in genes encoding proteases (FURIN, PLG and PRSS1) and 6 in genes involved in the innate immunity J o u r n a l P r e -p r o o f Fisher exact test was used to measure significant differences in genotypes distributions between Serbian and 1kGP populations. Test was two tailed and p value was considered significant at 0.0007 after Bonferoni correction for multiple testing. Comparative genetic analysis of the novel coronavirus (2019-nCoV/SARS-CoV-2) receptor ACE2 in different populations Genomic characterization of the 2019 novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting Wuhan. Emerging microbes & infections 9 Association of a single nucleotide polymorphism in the CD209 (DC-SIGN) promoter with SARS severity Structure of the hemagglutinin precursor cleavage site, a determinant of influenza pathogenicity and the origin of the labile conformation The interferon gamma gene polymorphism +874 A/T is associated with severe acute respiratory syndrome Treatment of SARS with human interferons The spike glycoprotein of the new coronavirus 2019-nCoV contains a furin-like cleavage site absent in CoV of the same clade Structural basis for cytosolic double-stranded RNA surveillance by human oligoadenylate synthetase 1 OAS1 gene haplotype confers susceptibility to multiple sclerosis A global reference for human genetic variation Association of SARS susceptibility with single nucleic acid polymorphisms of OAS1 and MxA genes: a case-control study SARS-CoV-2 Cell Entry Depends on ACE2 and TMPRSS2 and Is Blocked by a Clinically Proven Protease Inhibitor Genetics in geographically structured populations: defining, estimating and interpreting F(ST) Structural variations in human ACE2 may influence its binding with SARS-CoV-2 spike protein Mannose-binding lectin in severe acute respiratory syndrome coronavirus infection Elevated Plasmin(ogen) as a Common Risk Factor for COVID-19 Susceptibility Human Genetic Determinants of Viral Diseases Disease-associated mutations in human mannose-binding lectin compromise oligomerization and activity of the final protein The X-ray crystal structure of full-length human plasminogen Bat-to-human: spike features determining 'host jump' of coronaviruses SARS-CoV, MERS-CoV, and beyond Reassessment of the type I diabetes association of the OAS1 locus The human 2',5'-oligoadenylate synthetase family: interferoninduced proteins with unique enzymatic properties Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology Trans-heterozygosity for mutations enhances the risk of recurrent/chronic pancreatitis in patients with Cystic Fibrosis Differential Cumulative Risk of Genetic Polymorphisms in Familial and Nonfamilial Esophageal Squamous Cell Carcinoma. Cancer epidemiology, biomarkers & prevention : a publication of the Gene conversion between functional trypsinogen genes PRSS1 and PRSS2 associated with chronic pancreatitis in a six-year-old girl Functional polymorphisms of the CCL2 and MBL genes cumulatively increase susceptibility to severe acute respiratory syndrome coronavirus infection Receptor Recognition by the Novel Coronavirus from Wuhan: an Analysis Based on Decade-Long Structural Studies of SARS Coronavirus Cryo-EM structure of the 2019-nCoV spike in the prefusion conformation High prevalence of the CD14-159CC genotype in patients infected with severe acute respiratory syndrome-associated coronavirus Plasminogen alleles influence susceptibility to invasive aspergillosis Plasmin improves oedematous blood-gas barrier by cleaving epithelial sodium channels Genetic variation of the human alpha-2-Heremans-Schmid glycoprotein (AHSG) gene associated with the risk of SARS-CoV infection Writing -review & editing. Biljana Stankovic: Data curation, Formal analysis, Investigation, Writingoriginal draft, Writing -review & editing. Branka Zukic: Project administration, Supervision, Writing ± review & editing. Nikola Kotur: Data curation, Investigation, Validation, Writingreview & editing. Vladimir Gasic: Data curation, Investigation, Validation, Writing -review & editing. Sonja Pavlovic: Conceptualization, Project administration, Supervision, Writingoriginal draft, Writing -review & editing. Maja Stojiljkovic: Conceptualization, Formal analysis J o u r n a l P r e -p r o o f