key: cord-0906428-pxezk012 authors: Wang, Qiong; Qiu, Ye; Li, Jin-Yan; Zhou, Zhi-Jian; Liao, Ce-Heng; Ge, Xing-Yi title: A Unique Protease Cleavage Site Predicted in the Spike Protein of the Novel Pneumonia Coronavirus (2019-nCoV) Potentially Related to Viral Transmissibility date: 2020-03-20 journal: Virol Sin DOI: 10.1007/s12250-020-00212-7 sha: a5729379cb2d18730da32cf117283be2cc16cf05 doc_id: 906428 cord_uid: pxezk012 nan In December 2019, a novel human coronavirus caused an epidemic of severe pneumonia (Coronavirus Disease 2019, in Wuhan, Hubei, China Zhu et al. 2020) . So far, this virus has spread to all areas of China and even to other countries. The epidemic has caused 67,102 confirmed infections with 1526 fatal cases worldwide by February 14th, 2020. The viral infection incubation period varies from 2 to 14 days and typical clinical symptoms are fever, dry cough, dyspnea, headache, and pneumonia. Disease onset may result in progressive respiratory failure due to alveolar damage and even death (Chan et al. 2020; Chen et al. 2020; Huang et al. 2020) . The nomenclature for this coronavirus is still controversial. It was initially named as 2019-nCoV, indicating that it is a novel coronavirus identified in the year of 2019. Recently, the International Committee on Taxonomy of Viruses (ICTV) has suggested to name this virus as severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) based on its phylogenetic relationship with SARS-CoV. However, many researchers point out that the name SARS-CoV-2 implies this virus as a pathogen of SARS, but actually, COVID-19 is a disease different from SARS, and the name SARS-CoV-2 may mislead the treatment and prevention of COVID-19. Hence, a new name, human coronavirus 2019 (HCoV-19), has been suggested, which is more appropriate and consistent with the disease name of COVID-19 . Nevertheless, to avoid potential confusions, we use 2019-nCoV to indicate the new coronavirus in this letter, mimicking most articles published recently. Coronaviruses are enveloped non-segmented positive sense RNA viruses belonging to the genus Betacoronavirus of the subfamily Orthocoronavirinae in the family Coronaviridae. Based on the phylogenetic tree, 2019-nCoV is clustered into the Sarbecovirus subgenus with other severe acute respiratory syndrome-related coronaviruses (SARSr-CoVs), such as SARS-CoV and bat SARSr-CoV . Compared to other human coronaviruses, 2019-nCoV emerges several unique features. First, the mortality of 2019-nCoV infection is estimated to be 3.06% which is much lower than that of SARS-CoV (10%) and MERS-CoV (37%). Second, SARS-CoV-2 has a basic reproduction rate (R 0 ) of 3.3-5.5, higher than those of SARS-CoV (2-5) and MERS-CoV (2.7-3.9), indicating a higher transmissibility of SARS-CoV-2 than other human coronaviruses (Lipsitch et al. 2003; Wallinga 2004; Lin et al. 2018) . The underlying mechanism determining these features is still unclear and more relevant studies are urged considering the serious 2019-nCoV epidemic recently. Receptor recognition is an important factor determining host range and cross-species infection of viruses. For coronaviruses, their receptor binding and subsequent internalization mainly depend on the spike protein (S protein) anchored in the viral envelope. All coronavirus S proteins consist of three domains: an extracellular domain (EC), a transmembrane anchor domain and a short intracellular tail. EC contains two functional subunits, a receptor-binding subunit (S1) and a membrane-fusion subunit (S2). S1 contains two independent domains, an N-terminal domain (S1-NTD) and receptor binding domain (RBD), which plays a key role in receptor recognition and binding (Heald-Sargent and Gallagher 2012; Li 2012). During host-virus membrane fusion, spikes protein is cleaved at the S1/S2 boundary by host proteases, releasing Qiong Wang and Ye Qiu have authors contributed equally to this work. the spike fusion peptide, which is necessary for virus entry. The host proteases for S protein cleavage vary among different coronaviruses, which is a key factor determining the epidemiological and pathological features of virus, including host range, tissue tropism, transmissibility and mortality. For instance, a variety of human proteases, such as trypsin, tryptase Clara, human airway trypsin-like protease (HAT) and transmembrane protease serine 2 (TMPRSS2), are known to cleave and activate the S protein of SARS-CoV (Bosch et al. 2008; Bertram et al. 2011 ). These proteases are widely expressed in many important organs, which is critical reason for the systematic infection, serious pathogenicity and high mortality of SARS-CoV. Laboratory studies also demonstrated that the IBV Beaudette strain, compared to other IBV strains, have an extended tropism, which infects not only primary chicken cells but also many other cell lines, due to containing an additional consensus furin cleavage site on its S protein (Yamada and Liu 2009) . In this study, we discovered one deletion and three insertions in 2019-nCoV S protein by amino acid sequence alignment. Notably, four additional amino acid residues (-PRRA-) were inserted between S1 and S2 subunits, potentially affecting the cleavage of S protein as our hypothesis (Fig. 1A) . To verify our speculation, we comprehensively predicted the protease cleavage sites on different coronavirus S protein by using the ProP 1.0 server (www.cbs.dtu.dk/services/ProP/). This server is designed to predict arginine and lysine propeptide cleavage sites in eukaryotic protein sequences by using an ensemble of neural networks. As a result, 2019-nCoV S protein showed a unique furin cleavage (-RRAR-) within the S1/S2 domain which was overlapped with insertion described above (Fig. 1B) . This furin cleavage site was located between the residues 682 and 685, distinct from SARS-CoV and all other SARS-like coronaviruses which only contain a trypsin or TMPRSS2 cleavage site at R667 (corresponding to residues 685 in 2019-nCoV S) (Fig. 1B) . Furin is a protease ubiquitously expressed in a variety of organs and tissues, including brain, lung, gastrointestinal tract, liver, pancreas and reproductive tissues. With the furin cleavage site on the S protein, 2019-nCoV probably gains ability to infect organs or tissues insensitive to other coronaviruses, leading to systematic infection of 2019-nCoV in the body. Even worse, the wide distribution of 2019-nCoV in a patient body may release the virus into the environment via more diverse ways, severely enhancing the transmission of 2019-nCoV. This hypothesis is supported by the current reports about the trace of 2019-nCoV in some place distinct from other coronaviruses, such as feces and eyes. However, these speculations are mainly based on our sequence studies, and further functional studies are required to characterize how these differences affect the functionality and pathogenesis of 2019-nCoV. In summary, our sequence analysis on the S protein of 2019-nCoV has predicted a novel furin cleavage site at S1/S2 linkage. The ubiquitous expression of furin in different organs and tissues may be a reason for the high transmissibility and pathogenicity of 2019-nCoV observed in the current epidemic. However, since our findings were mainly based on bioinformatic analysis, more laboratory studies on 2019-nCoV in cell and animal models are required to verify our speculations and to avoid any bias. Cleavage and activation of the severe acute respiratory syndrome coronavirus spike protein by human airway trypsin-like protease Cathepsin L functionally cleaves the severe acute respiratory syndrome coronavirus class I fusion protein upstream of rather than adjacent to the fusion peptide A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster Epidemiological and clinical characteristics of 99 cases of 2019 novel coronavirus pneumonia in Wuhan, China: a descriptive study Ready, set, fuse! The coronavirus spike protein and acquisition of fusion competence Clinical features of patients infected with 2019 novel coronavirus in Wuhan A distinct name is needed for the new coronavirus Evidence for a common evolutionary origin of coronavirus spike protein receptor-binding subunits Modeling the spread of Middle East respiratory syndrome coronavirus in Saudi Arabia Transmission dynamics and control of severe acute respiratory syndrome Different epidemic curves for severe acute respiratory syndrome reveal similar impacts of control measures A new coronavirus associated with human respiratory disease in China Proteolytic activation of the spike protein at a novel RRRR/S motif is implicated in furin-dependent entry, syncytium formation, and infectivity of coronavirus infectious bronchitis virus in cultured cells China Novel Coronavirus I, Research T (2020) A novel coronavirus from patients with pneumonia in China Acknowledgements This work was jointly funded by the National