id sid tid token lemma pos cord-260054-iihgc5nr 1 1 key key NN cord-260054-iihgc5nr 1 2 : : : cord-260054-iihgc5nr 1 3 cord-260054-iihgc5nr cord-260054-iihgc5nr . cord-260054-iihgc5nr 2 1 authors author NNS cord-260054-iihgc5nr 2 2 : : : cord-260054-iihgc5nr 2 3 Cavallo Cavallo NNP cord-260054-iihgc5nr 2 4 , , , cord-260054-iihgc5nr 2 5 Luigi Luigi NNP cord-260054-iihgc5nr 2 6 ; ; : cord-260054-iihgc5nr 2 7 Oliva Oliva NNP cord-260054-iihgc5nr 2 8 , , , cord-260054-iihgc5nr 2 9 Romina Romina NNP cord-260054-iihgc5nr 2 10 title title NN cord-260054-iihgc5nr 2 11 : : : cord-260054-iihgc5nr 2 12 D936Y D936Y NNP cord-260054-iihgc5nr 2 13 and and CC cord-260054-iihgc5nr 2 14 Other Other NNP cord-260054-iihgc5nr 2 15 Mutations Mutations NNPS cord-260054-iihgc5nr 2 16 in in IN cord-260054-iihgc5nr 2 17 the the DT cord-260054-iihgc5nr 2 18 Fusion Fusion NNP cord-260054-iihgc5nr 2 19 Core Core NNP cord-260054-iihgc5nr 2 20 of of IN cord-260054-iihgc5nr 2 21 the the DT cord-260054-iihgc5nr 2 22 SARS SARS NNP cord-260054-iihgc5nr 2 23 - - HYPH cord-260054-iihgc5nr 2 24 Cov-2 Cov-2 NNP cord-260054-iihgc5nr 2 25 Spike Spike NNP cord-260054-iihgc5nr 2 26 Protein Protein NNP cord-260054-iihgc5nr 2 27 Heptad Heptad NNP cord-260054-iihgc5nr 2 28 Repeat repeat NN cord-260054-iihgc5nr 2 29 1 1 CD cord-260054-iihgc5nr 2 30 Undermine undermine VBP cord-260054-iihgc5nr 2 31 the the DT cord-260054-iihgc5nr 2 32 Post Post NNP cord-260054-iihgc5nr 2 33 - - NNP cord-260054-iihgc5nr 2 34 Fusion Fusion NNP cord-260054-iihgc5nr 2 35 Assembly Assembly NNP cord-260054-iihgc5nr 2 36 date date NN cord-260054-iihgc5nr 3 1 : : : cord-260054-iihgc5nr 3 2 2020 2020 CD cord-260054-iihgc5nr 3 3 - - HYPH cord-260054-iihgc5nr 3 4 06 06 CD cord-260054-iihgc5nr 3 5 - - HYPH cord-260054-iihgc5nr 3 6 08 08 CD cord-260054-iihgc5nr 3 7 journal journal NN cord-260054-iihgc5nr 3 8 : : : cord-260054-iihgc5nr 4 1 bioRxiv biorxiv IN cord-260054-iihgc5nr 4 2 DOI DOI NNP cord-260054-iihgc5nr 4 3 : : : cord-260054-iihgc5nr 5 1 10.1101/2020.06.08.140152 10.1101/2020.06.08.140152 CD cord-260054-iihgc5nr 5 2 sha sha NNP cord-260054-iihgc5nr 5 3 : : : cord-260054-iihgc5nr 6 1 7bec72c240a061d6182d0c823d19efbae5014583 7bec72c240a061d6182d0c823d19efbae5014583 NNP cord-260054-iihgc5nr 6 2 doc_id doc_id CD cord-260054-iihgc5nr 6 3 : : : cord-260054-iihgc5nr 6 4 260054 260054 CD cord-260054-iihgc5nr 6 5 cord_uid cord_uid NNS cord-260054-iihgc5nr 6 6 : : : cord-260054-iihgc5nr 7 1 iihgc5nr iihgc5nr NNP cord-260054-iihgc5nr 8 1 The the DT cord-260054-iihgc5nr 8 2 iconic iconic JJ cord-260054-iihgc5nr 8 3 “ " `` cord-260054-iihgc5nr 8 4 red red JJ cord-260054-iihgc5nr 8 5 crown crown NN cord-260054-iihgc5nr 8 6 ” " '' cord-260054-iihgc5nr 8 7 of of IN cord-260054-iihgc5nr 8 8 the the DT cord-260054-iihgc5nr 8 9 severe severe JJ cord-260054-iihgc5nr 8 10 acute acute JJ cord-260054-iihgc5nr 8 11 respiratory respiratory JJ cord-260054-iihgc5nr 8 12 syndrome syndrome NN cord-260054-iihgc5nr 8 13 coronavirus coronavirus NN cord-260054-iihgc5nr 8 14 2 2 CD cord-260054-iihgc5nr 9 1 ( ( -LRB- cord-260054-iihgc5nr 9 2 SARS SARS NNP cord-260054-iihgc5nr 9 3 - - HYPH cord-260054-iihgc5nr 9 4 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 9 5 ) ) -RRB- cord-260054-iihgc5nr 9 6 is be VBZ cord-260054-iihgc5nr 9 7 made make VBN cord-260054-iihgc5nr 9 8 of of IN cord-260054-iihgc5nr 9 9 its -PRON- PRP$ cord-260054-iihgc5nr 9 10 spike spike NN cord-260054-iihgc5nr 9 11 ( ( -LRB- cord-260054-iihgc5nr 9 12 S s NN cord-260054-iihgc5nr 9 13 ) ) -RRB- cord-260054-iihgc5nr 9 14 glycoprotein glycoprotein NN cord-260054-iihgc5nr 9 15 . . . cord-260054-iihgc5nr 10 1 The the DT cord-260054-iihgc5nr 10 2 S S NNP cord-260054-iihgc5nr 10 3 protein protein NN cord-260054-iihgc5nr 10 4 is be VBZ cord-260054-iihgc5nr 10 5 the the DT cord-260054-iihgc5nr 10 6 Trojan Trojan NNP cord-260054-iihgc5nr 10 7 horse horse NN cord-260054-iihgc5nr 10 8 of of IN cord-260054-iihgc5nr 10 9 coronaviruses coronaviruse NNS cord-260054-iihgc5nr 10 10 , , , cord-260054-iihgc5nr 10 11 mediating mediate VBG cord-260054-iihgc5nr 10 12 their -PRON- PRP$ cord-260054-iihgc5nr 10 13 entry entry NN cord-260054-iihgc5nr 10 14 into into IN cord-260054-iihgc5nr 10 15 the the DT cord-260054-iihgc5nr 10 16 host host NN cord-260054-iihgc5nr 10 17 cells cell NNS cord-260054-iihgc5nr 10 18 . . . cord-260054-iihgc5nr 11 1 While while IN cord-260054-iihgc5nr 11 2 SARS SARS NNP cord-260054-iihgc5nr 11 3 - - HYPH cord-260054-iihgc5nr 11 4 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 11 5 was be VBD cord-260054-iihgc5nr 11 6 becoming become VBG cord-260054-iihgc5nr 11 7 a a DT cord-260054-iihgc5nr 11 8 global global JJ cord-260054-iihgc5nr 11 9 threat threat NN cord-260054-iihgc5nr 11 10 , , , cord-260054-iihgc5nr 11 11 scientists scientist NNS cord-260054-iihgc5nr 11 12 have have VBP cord-260054-iihgc5nr 11 13 been be VBN cord-260054-iihgc5nr 11 14 accumulating accumulate VBG cord-260054-iihgc5nr 11 15 data datum NNS cord-260054-iihgc5nr 11 16 on on IN cord-260054-iihgc5nr 11 17 the the DT cord-260054-iihgc5nr 11 18 virus virus NN cord-260054-iihgc5nr 11 19 at at IN cord-260054-iihgc5nr 11 20 an an DT cord-260054-iihgc5nr 11 21 impressive impressive JJ cord-260054-iihgc5nr 11 22 pace pace NN cord-260054-iihgc5nr 11 23 , , , cord-260054-iihgc5nr 11 24 both both CC cord-260054-iihgc5nr 11 25 in in IN cord-260054-iihgc5nr 11 26 terms term NNS cord-260054-iihgc5nr 11 27 of of IN cord-260054-iihgc5nr 11 28 genomic genomic JJ cord-260054-iihgc5nr 11 29 sequences sequence NNS cord-260054-iihgc5nr 11 30 and and CC cord-260054-iihgc5nr 11 31 of of IN cord-260054-iihgc5nr 11 32 three three CD cord-260054-iihgc5nr 11 33 - - HYPH cord-260054-iihgc5nr 11 34 dimensional dimensional JJ cord-260054-iihgc5nr 11 35 structures structure NNS cord-260054-iihgc5nr 11 36 . . . cord-260054-iihgc5nr 12 1 On on IN cord-260054-iihgc5nr 12 2 April April NNP cord-260054-iihgc5nr 12 3 21st 21st NN cord-260054-iihgc5nr 12 4 , , , cord-260054-iihgc5nr 12 5 the the DT cord-260054-iihgc5nr 12 6 GISAID GISAID NNP cord-260054-iihgc5nr 12 7 resource resource NN cord-260054-iihgc5nr 12 8 had have VBD cord-260054-iihgc5nr 12 9 collected collect VBN cord-260054-iihgc5nr 12 10 10,823 10,823 CD cord-260054-iihgc5nr 12 11 SARS SARS NNP cord-260054-iihgc5nr 12 12 - - HYPH cord-260054-iihgc5nr 12 13 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 12 14 genomic genomic JJ cord-260054-iihgc5nr 12 15 sequences sequence NNS cord-260054-iihgc5nr 12 16 . . . cord-260054-iihgc5nr 13 1 We -PRON- PRP cord-260054-iihgc5nr 13 2 extracted extract VBD cord-260054-iihgc5nr 13 3 from from IN cord-260054-iihgc5nr 13 4 them -PRON- PRP cord-260054-iihgc5nr 13 5 all all PDT cord-260054-iihgc5nr 13 6 the the DT cord-260054-iihgc5nr 13 7 complete complete JJ cord-260054-iihgc5nr 13 8 S S NNP cord-260054-iihgc5nr 13 9 protein protein NN cord-260054-iihgc5nr 13 10 sequences sequence NNS cord-260054-iihgc5nr 13 11 and and CC cord-260054-iihgc5nr 13 12 identified identify VBN cord-260054-iihgc5nr 13 13 point point NN cord-260054-iihgc5nr 13 14 mutations mutation NNS cord-260054-iihgc5nr 13 15 thereof thereof RB cord-260054-iihgc5nr 13 16 . . . cord-260054-iihgc5nr 14 1 Six six CD cord-260054-iihgc5nr 14 2 mutations mutation NNS cord-260054-iihgc5nr 14 3 were be VBD cord-260054-iihgc5nr 14 4 located locate VBN cord-260054-iihgc5nr 14 5 on on IN cord-260054-iihgc5nr 14 6 a a DT cord-260054-iihgc5nr 14 7 14-residue 14-residue CD cord-260054-iihgc5nr 14 8 segment segment NN cord-260054-iihgc5nr 14 9 ( ( -LRB- cord-260054-iihgc5nr 14 10 929 929 CD cord-260054-iihgc5nr 14 11 - - SYM cord-260054-iihgc5nr 14 12 943 943 CD cord-260054-iihgc5nr 14 13 ) ) -RRB- cord-260054-iihgc5nr 14 14 in in IN cord-260054-iihgc5nr 14 15 the the DT cord-260054-iihgc5nr 14 16 “ " `` cord-260054-iihgc5nr 14 17 fusion fusion NN cord-260054-iihgc5nr 14 18 core core NN cord-260054-iihgc5nr 14 19 ” " '' cord-260054-iihgc5nr 14 20 of of IN cord-260054-iihgc5nr 14 21 the the DT cord-260054-iihgc5nr 14 22 heptad heptad NN cord-260054-iihgc5nr 14 23 repeat repeat NN cord-260054-iihgc5nr 14 24 1 1 CD cord-260054-iihgc5nr 14 25 ( ( -LRB- cord-260054-iihgc5nr 14 26 HR1 HR1 NNP cord-260054-iihgc5nr 14 27 ) ) -RRB- cord-260054-iihgc5nr 14 28 . . . cord-260054-iihgc5nr 15 1 Our -PRON- PRP$ cord-260054-iihgc5nr 15 2 modeling modeling NN cord-260054-iihgc5nr 15 3 in in IN cord-260054-iihgc5nr 15 4 the the DT cord-260054-iihgc5nr 15 5 pre- pre- JJ cord-260054-iihgc5nr 15 6 and and CC cord-260054-iihgc5nr 15 7 post post JJ cord-260054-iihgc5nr 15 8 - - JJ cord-260054-iihgc5nr 15 9 fusion fusion JJ cord-260054-iihgc5nr 15 10 S S NNP cord-260054-iihgc5nr 15 11 protein protein NN cord-260054-iihgc5nr 15 12 conformations conformation NNS cord-260054-iihgc5nr 15 13 revealed reveal VBD cord-260054-iihgc5nr 15 14 , , , cord-260054-iihgc5nr 15 15 for for IN cord-260054-iihgc5nr 15 16 three three CD cord-260054-iihgc5nr 15 17 of of IN cord-260054-iihgc5nr 15 18 them -PRON- PRP cord-260054-iihgc5nr 15 19 , , , cord-260054-iihgc5nr 15 20 the the DT cord-260054-iihgc5nr 15 21 loss loss NN cord-260054-iihgc5nr 15 22 of of IN cord-260054-iihgc5nr 15 23 interactions interaction NNS cord-260054-iihgc5nr 15 24 stabilizing stabilize VBG cord-260054-iihgc5nr 15 25 the the DT cord-260054-iihgc5nr 15 26 post post JJ cord-260054-iihgc5nr 15 27 - - JJ cord-260054-iihgc5nr 15 28 fusion fusion JJ cord-260054-iihgc5nr 15 29 assembly assembly NN cord-260054-iihgc5nr 15 30 . . . cord-260054-iihgc5nr 16 1 On on IN cord-260054-iihgc5nr 16 2 May May NNP cord-260054-iihgc5nr 16 3 29th 29th NN cord-260054-iihgc5nr 16 4 , , , cord-260054-iihgc5nr 16 5 the the DT cord-260054-iihgc5nr 16 6 SARS SARS NNP cord-260054-iihgc5nr 16 7 - - HYPH cord-260054-iihgc5nr 16 8 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 16 9 genomic genomic JJ cord-260054-iihgc5nr 16 10 sequences sequence NNS cord-260054-iihgc5nr 16 11 in in IN cord-260054-iihgc5nr 16 12 GISAID GISAID NNP cord-260054-iihgc5nr 16 13 were be VBD cord-260054-iihgc5nr 16 14 34,805 34,805 CD cord-260054-iihgc5nr 16 15 . . . cord-260054-iihgc5nr 17 1 An an DT cord-260054-iihgc5nr 17 2 analysis analysis NN cord-260054-iihgc5nr 17 3 of of IN cord-260054-iihgc5nr 17 4 the the DT cord-260054-iihgc5nr 17 5 occurrences occurrence NNS cord-260054-iihgc5nr 17 6 of of IN cord-260054-iihgc5nr 17 7 the the DT cord-260054-iihgc5nr 17 8 HR1 HR1 NNP cord-260054-iihgc5nr 17 9 mutations mutation NNS cord-260054-iihgc5nr 17 10 in in IN cord-260054-iihgc5nr 17 11 this this DT cord-260054-iihgc5nr 17 12 updated update VBN cord-260054-iihgc5nr 17 13 dataset dataset NN cord-260054-iihgc5nr 17 14 revealed reveal VBD cord-260054-iihgc5nr 17 15 a a DT cord-260054-iihgc5nr 17 16 significant significant JJ cord-260054-iihgc5nr 17 17 increase increase NN cord-260054-iihgc5nr 17 18 for for IN cord-260054-iihgc5nr 17 19 the the DT cord-260054-iihgc5nr 17 20 S929I S929I NNP cord-260054-iihgc5nr 17 21 and and CC cord-260054-iihgc5nr 17 22 S939F S939F NNP cord-260054-iihgc5nr 17 23 mutations mutation NNS cord-260054-iihgc5nr 17 24 and and CC cord-260054-iihgc5nr 17 25 a a DT cord-260054-iihgc5nr 17 26 dramatic dramatic JJ cord-260054-iihgc5nr 17 27 increase increase NN cord-260054-iihgc5nr 17 28 for for IN cord-260054-iihgc5nr 17 29 the the DT cord-260054-iihgc5nr 17 30 D936Y d936y JJ cord-260054-iihgc5nr 17 31 mutation mutation NN cord-260054-iihgc5nr 17 32 , , , cord-260054-iihgc5nr 17 33 which which WDT cord-260054-iihgc5nr 17 34 was be VBD cord-260054-iihgc5nr 17 35 particularly particularly RB cord-260054-iihgc5nr 17 36 widespread widespread JJ cord-260054-iihgc5nr 17 37 in in IN cord-260054-iihgc5nr 17 38 Sweden Sweden NNP cord-260054-iihgc5nr 17 39 and and CC cord-260054-iihgc5nr 17 40 Wales Wales NNP cord-260054-iihgc5nr 17 41 / / SYM cord-260054-iihgc5nr 17 42 England England NNP cord-260054-iihgc5nr 17 43 . . . cord-260054-iihgc5nr 18 1 We -PRON- PRP cord-260054-iihgc5nr 18 2 notice notice VBP cord-260054-iihgc5nr 18 3 that that IN cord-260054-iihgc5nr 18 4 this this DT cord-260054-iihgc5nr 18 5 is be VBZ cord-260054-iihgc5nr 18 6 also also RB cord-260054-iihgc5nr 18 7 the the DT cord-260054-iihgc5nr 18 8 mutation mutation NN cord-260054-iihgc5nr 18 9 causing cause VBG cord-260054-iihgc5nr 18 10 the the DT cord-260054-iihgc5nr 18 11 loss loss NN cord-260054-iihgc5nr 18 12 of of IN cord-260054-iihgc5nr 18 13 a a DT cord-260054-iihgc5nr 18 14 strong strong JJ cord-260054-iihgc5nr 18 15 inter inter JJ cord-260054-iihgc5nr 18 16 - - JJ cord-260054-iihgc5nr 18 17 monomer monomer JJ cord-260054-iihgc5nr 18 18 interaction interaction NN cord-260054-iihgc5nr 18 19 , , , cord-260054-iihgc5nr 18 20 the the DT cord-260054-iihgc5nr 18 21 D936-R1185 D936-R1185 NNP cord-260054-iihgc5nr 18 22 salt salt NN cord-260054-iihgc5nr 18 23 bridge bridge NN cord-260054-iihgc5nr 18 24 , , , cord-260054-iihgc5nr 18 25 thus thus RB cord-260054-iihgc5nr 18 26 clearly clearly RB cord-260054-iihgc5nr 18 27 weakening weaken VBG cord-260054-iihgc5nr 18 28 the the DT cord-260054-iihgc5nr 18 29 post post JJ cord-260054-iihgc5nr 18 30 - - JJ cord-260054-iihgc5nr 18 31 fusion fusion JJ cord-260054-iihgc5nr 18 32 assembly assembly NN cord-260054-iihgc5nr 18 33 . . . cord-260054-iihgc5nr 19 1 Coronavirus Coronavirus NNP cord-260054-iihgc5nr 19 2 Disease Disease NNP cord-260054-iihgc5nr 19 3 2019 2019 CD cord-260054-iihgc5nr 19 4 is be VBZ cord-260054-iihgc5nr 19 5 caused cause VBN cord-260054-iihgc5nr 19 6 by by IN cord-260054-iihgc5nr 19 7 the the DT cord-260054-iihgc5nr 19 8 severe severe JJ cord-260054-iihgc5nr 19 9 acute acute JJ cord-260054-iihgc5nr 19 10 respiratory respiratory JJ cord-260054-iihgc5nr 19 11 syndrome syndrome NN cord-260054-iihgc5nr 19 12 coronavirus coronavirus NN cord-260054-iihgc5nr 19 13 2 2 CD cord-260054-iihgc5nr 20 1 ( ( -LRB- cord-260054-iihgc5nr 20 2 SARS SARS NNP cord-260054-iihgc5nr 20 3 - - HYPH cord-260054-iihgc5nr 20 4 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 20 5 ) ) -RRB- cord-260054-iihgc5nr 20 6 . . . cord-260054-iihgc5nr 21 1 SARS SARS NNP cord-260054-iihgc5nr 21 2 - - HYPH cord-260054-iihgc5nr 21 3 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 21 4 is be VBZ cord-260054-iihgc5nr 21 5 a a DT cord-260054-iihgc5nr 21 6 novel novel JJ cord-260054-iihgc5nr 21 7 virus virus NN cord-260054-iihgc5nr 21 8 belonging belong VBG cord-260054-iihgc5nr 21 9 to to IN cord-260054-iihgc5nr 21 10 the the DT cord-260054-iihgc5nr 21 11 β β XX cord-260054-iihgc5nr 21 12 genus genus NN cord-260054-iihgc5nr 21 13 coronaviruses coronaviruses NNP cord-260054-iihgc5nr 21 14 , , , cord-260054-iihgc5nr 21 15 which which WDT cord-260054-iihgc5nr 21 16 also also RB cord-260054-iihgc5nr 21 17 include include VBP cord-260054-iihgc5nr 21 18 two two CD cord-260054-iihgc5nr 21 19 highly highly RB cord-260054-iihgc5nr 21 20 pathogenic pathogenic JJ cord-260054-iihgc5nr 21 21 human human JJ cord-260054-iihgc5nr 21 22 viruses virus NNS cord-260054-iihgc5nr 21 23 identified identify VBN cord-260054-iihgc5nr 21 24 in in IN cord-260054-iihgc5nr 21 25 the the DT cord-260054-iihgc5nr 21 26 last last JJ cord-260054-iihgc5nr 21 27 two two CD cord-260054-iihgc5nr 21 28 decades decade NNS cord-260054-iihgc5nr 21 29 , , , cord-260054-iihgc5nr 21 30 the the DT cord-260054-iihgc5nr 21 31 severe severe JJ cord-260054-iihgc5nr 21 32 acute acute JJ cord-260054-iihgc5nr 21 33 respiratory respiratory JJ cord-260054-iihgc5nr 21 34 syndrome syndrome NN cord-260054-iihgc5nr 21 35 coronavirus coronavirus NN cord-260054-iihgc5nr 21 36 ( ( -LRB- cord-260054-iihgc5nr 21 37 SARS SARS NNP cord-260054-iihgc5nr 21 38 - - HYPH cord-260054-iihgc5nr 21 39 CoV CoV NNP cord-260054-iihgc5nr 21 40 ) ) -RRB- cord-260054-iihgc5nr 21 41 and and CC cord-260054-iihgc5nr 21 42 the the DT cord-260054-iihgc5nr 21 43 Middle Middle NNP cord-260054-iihgc5nr 21 44 East East NNP cord-260054-iihgc5nr 21 45 respiratory respiratory JJ cord-260054-iihgc5nr 21 46 syndrome syndrome NN cord-260054-iihgc5nr 21 47 coronavirus coronavirus NN cord-260054-iihgc5nr 21 48 ( ( -LRB- cord-260054-iihgc5nr 21 49 MERS MERS NNP cord-260054-iihgc5nr 21 50 - - HYPH cord-260054-iihgc5nr 21 51 CoV CoV NNP cord-260054-iihgc5nr 21 52 ) ) -RRB- cord-260054-iihgc5nr 21 53 ( ( -LRB- cord-260054-iihgc5nr 21 54 1 1 LS cord-260054-iihgc5nr 21 55 ) ) -RRB- cord-260054-iihgc5nr 21 56 ( ( -LRB- cord-260054-iihgc5nr 21 57 2 2 LS cord-260054-iihgc5nr 21 58 ) ) -RRB- cord-260054-iihgc5nr 21 59 ( ( -LRB- cord-260054-iihgc5nr 21 60 3 3 CD cord-260054-iihgc5nr 21 61 ) ) -RRB- cord-260054-iihgc5nr 21 62 . . . cord-260054-iihgc5nr 22 1 Coronaviruses coronaviruse NNS cord-260054-iihgc5nr 22 2 are be VBP cord-260054-iihgc5nr 22 3 named name VBN cord-260054-iihgc5nr 22 4 after after IN cord-260054-iihgc5nr 22 5 the the DT cord-260054-iihgc5nr 22 6 protruding protrude VBG cord-260054-iihgc5nr 22 7 spike spike NN cord-260054-iihgc5nr 22 8 ( ( -LRB- cord-260054-iihgc5nr 22 9 S s NN cord-260054-iihgc5nr 22 10 ) ) -RRB- cord-260054-iihgc5nr 22 11 glycoproteins glycoprotein NNS cord-260054-iihgc5nr 22 12 on on IN cord-260054-iihgc5nr 22 13 their -PRON- PRP$ cord-260054-iihgc5nr 22 14 envelope envelope NN cord-260054-iihgc5nr 22 15 , , , cord-260054-iihgc5nr 22 16 giving give VBG cord-260054-iihgc5nr 22 17 a a DT cord-260054-iihgc5nr 22 18 crown crown NN cord-260054-iihgc5nr 22 19 ( ( -LRB- cord-260054-iihgc5nr 22 20 corona corona NN cord-260054-iihgc5nr 22 21 in in IN cord-260054-iihgc5nr 22 22 latin latin NNP cord-260054-iihgc5nr 22 23 ) ) -RRB- cord-260054-iihgc5nr 22 24 shape shape NN cord-260054-iihgc5nr 22 25 to to IN cord-260054-iihgc5nr 22 26 the the DT cord-260054-iihgc5nr 22 27 virions virion NNS cord-260054-iihgc5nr 22 28 ( ( -LRB- cord-260054-iihgc5nr 22 29 4 4 CD cord-260054-iihgc5nr 22 30 ) ) -RRB- cord-260054-iihgc5nr 22 31 . . . cord-260054-iihgc5nr 23 1 Of of IN cord-260054-iihgc5nr 23 2 the the DT cord-260054-iihgc5nr 23 3 four four CD cord-260054-iihgc5nr 23 4 structural structural JJ cord-260054-iihgc5nr 23 5 proteins protein NNS cord-260054-iihgc5nr 23 6 of of IN cord-260054-iihgc5nr 23 7 coronavirues coronavirue NNS cord-260054-iihgc5nr 23 8 , , , cord-260054-iihgc5nr 23 9 S S NNP cord-260054-iihgc5nr 23 10 , , , cord-260054-iihgc5nr 23 11 envelope envelope NN cord-260054-iihgc5nr 23 12 ( ( -LRB- cord-260054-iihgc5nr 23 13 E E NNP cord-260054-iihgc5nr 23 14 ) ) -RRB- cord-260054-iihgc5nr 23 15 , , , cord-260054-iihgc5nr 23 16 membrane membrane NN cord-260054-iihgc5nr 23 17 ( ( -LRB- cord-260054-iihgc5nr 23 18 M M NNP cord-260054-iihgc5nr 23 19 ) ) -RRB- cord-260054-iihgc5nr 23 20 , , , cord-260054-iihgc5nr 23 21 and and CC cord-260054-iihgc5nr 23 22 nucleocapsid nucleocapsid NN cord-260054-iihgc5nr 23 23 ( ( -LRB- cord-260054-iihgc5nr 23 24 N N NNP cord-260054-iihgc5nr 23 25 ) ) -RRB- cord-260054-iihgc5nr 23 26 , , , cord-260054-iihgc5nr 23 27 the the DT cord-260054-iihgc5nr 23 28 S S NNP cord-260054-iihgc5nr 23 29 protein protein NN cord-260054-iihgc5nr 23 30 is be VBZ cord-260054-iihgc5nr 23 31 the the DT cord-260054-iihgc5nr 23 32 one one NN cord-260054-iihgc5nr 23 33 playing play VBG cord-260054-iihgc5nr 23 34 a a DT cord-260054-iihgc5nr 23 35 key key JJ cord-260054-iihgc5nr 23 36 role role NN cord-260054-iihgc5nr 23 37 in in IN cord-260054-iihgc5nr 23 38 mediating mediate VBG cord-260054-iihgc5nr 23 39 the the DT cord-260054-iihgc5nr 23 40 viral viral JJ cord-260054-iihgc5nr 23 41 entry entry NN cord-260054-iihgc5nr 23 42 into into IN cord-260054-iihgc5nr 23 43 the the DT cord-260054-iihgc5nr 23 44 host host NN cord-260054-iihgc5nr 23 45 cells cell NNS cord-260054-iihgc5nr 23 46 ( ( -LRB- cord-260054-iihgc5nr 23 47 5 5 CD cord-260054-iihgc5nr 23 48 ) ) -RRB- cord-260054-iihgc5nr 23 49 ( ( -LRB- cord-260054-iihgc5nr 23 50 6 6 LS cord-260054-iihgc5nr 23 51 ) ) -RRB- cord-260054-iihgc5nr 23 52 ( ( -LRB- cord-260054-iihgc5nr 23 53 7 7 CD cord-260054-iihgc5nr 23 54 ) ) -RRB- cord-260054-iihgc5nr 23 55 , , , cord-260054-iihgc5nr 23 56 making make VBG cord-260054-iihgc5nr 23 57 it -PRON- PRP cord-260054-iihgc5nr 23 58 one one CD cord-260054-iihgc5nr 23 59 of of IN cord-260054-iihgc5nr 23 60 the the DT cord-260054-iihgc5nr 23 61 main main JJ cord-260054-iihgc5nr 23 62 targets target NNS cord-260054-iihgc5nr 23 63 for for IN cord-260054-iihgc5nr 23 64 the the DT cord-260054-iihgc5nr 23 65 development development NN cord-260054-iihgc5nr 23 66 of of IN cord-260054-iihgc5nr 23 67 therapeutic therapeutic JJ cord-260054-iihgc5nr 23 68 drugs drug NNS cord-260054-iihgc5nr 23 69 and and CC cord-260054-iihgc5nr 23 70 vaccines vaccine NNS cord-260054-iihgc5nr 23 71 ( ( -LRB- cord-260054-iihgc5nr 23 72 8) 8) NNP cord-260054-iihgc5nr 23 73 ( ( -LRB- cord-260054-iihgc5nr 23 74 9 9 CD cord-260054-iihgc5nr 23 75 ) ) -RRB- cord-260054-iihgc5nr 23 76 ( ( -LRB- cord-260054-iihgc5nr 23 77 10 10 CD cord-260054-iihgc5nr 23 78 ) ) -RRB- cord-260054-iihgc5nr 23 79 ( ( -LRB- cord-260054-iihgc5nr 23 80 11 11 CD cord-260054-iihgc5nr 23 81 ) ) -RRB- cord-260054-iihgc5nr 23 82 ( ( -LRB- cord-260054-iihgc5nr 23 83 12 12 CD cord-260054-iihgc5nr 23 84 ) ) -RRB- cord-260054-iihgc5nr 23 85 ( ( -LRB- cord-260054-iihgc5nr 23 86 13 13 CD cord-260054-iihgc5nr 23 87 ) ) -RRB- cord-260054-iihgc5nr 23 88 ( ( -LRB- cord-260054-iihgc5nr 23 89 14 14 CD cord-260054-iihgc5nr 23 90 ) ) -RRB- cord-260054-iihgc5nr 23 91 . . . cord-260054-iihgc5nr 24 1 Comprised comprise VBN cord-260054-iihgc5nr 24 2 of of IN cord-260054-iihgc5nr 24 3 two two CD cord-260054-iihgc5nr 24 4 functional functional JJ cord-260054-iihgc5nr 24 5 subunits subunit NNS cord-260054-iihgc5nr 24 6 , , , cord-260054-iihgc5nr 24 7 S1 S1 NNP cord-260054-iihgc5nr 24 8 and and CC cord-260054-iihgc5nr 24 9 S2 S2 NNP cord-260054-iihgc5nr 24 10 , , , cord-260054-iihgc5nr 24 11 it -PRON- PRP cord-260054-iihgc5nr 24 12 first first RB cord-260054-iihgc5nr 24 13 binds bind VBZ cord-260054-iihgc5nr 24 14 to to IN cord-260054-iihgc5nr 24 15 a a DT cord-260054-iihgc5nr 24 16 host host NN cord-260054-iihgc5nr 24 17 receptor receptor NN cord-260054-iihgc5nr 24 18 through through IN cord-260054-iihgc5nr 24 19 the the DT cord-260054-iihgc5nr 24 20 receptor receptor NN cord-260054-iihgc5nr 24 21 - - HYPH cord-260054-iihgc5nr 24 22 binding bind VBG cord-260054-iihgc5nr 24 23 domain domain NN cord-260054-iihgc5nr 24 24 ( ( -LRB- cord-260054-iihgc5nr 24 25 RBD RBD NNP cord-260054-iihgc5nr 24 26 ) ) -RRB- cord-260054-iihgc5nr 24 27 in in IN cord-260054-iihgc5nr 24 28 the the DT cord-260054-iihgc5nr 24 29 S1 S1 NNP cord-260054-iihgc5nr 24 30 subunit subunit NN cord-260054-iihgc5nr 24 31 and and CC cord-260054-iihgc5nr 24 32 then then RB cord-260054-iihgc5nr 24 33 fuses fuse VBZ cord-260054-iihgc5nr 24 34 the the DT cord-260054-iihgc5nr 24 35 viral viral JJ cord-260054-iihgc5nr 24 36 and and CC cord-260054-iihgc5nr 24 37 host host NN cord-260054-iihgc5nr 24 38 membranes membrane NNS cord-260054-iihgc5nr 24 39 through through IN cord-260054-iihgc5nr 24 40 the the DT cord-260054-iihgc5nr 24 41 S2 S2 NNP cord-260054-iihgc5nr 24 42 subunit subunit NN cord-260054-iihgc5nr 24 43 ( ( -LRB- cord-260054-iihgc5nr 24 44 7 7 CD cord-260054-iihgc5nr 24 45 , , , cord-260054-iihgc5nr 24 46 15 15 CD cord-260054-iihgc5nr 24 47 ) ) -RRB- cord-260054-iihgc5nr 24 48 . . . cord-260054-iihgc5nr 25 1 In in IN cord-260054-iihgc5nr 25 2 the the DT cord-260054-iihgc5nr 25 3 prefusion prefusion NN cord-260054-iihgc5nr 25 4 conformation conformation NN cord-260054-iihgc5nr 25 5 , , , cord-260054-iihgc5nr 25 6 the the DT cord-260054-iihgc5nr 25 7 SARS SARS NNP cord-260054-iihgc5nr 25 8 - - HYPH cord-260054-iihgc5nr 25 9 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 25 10 S S NNP cord-260054-iihgc5nr 25 11 protein protein NN cord-260054-iihgc5nr 25 12 forms form VBZ cord-260054-iihgc5nr 25 13 homotrimers homotrimer NNS cord-260054-iihgc5nr 25 14 protruding protrude VBG cord-260054-iihgc5nr 25 15 from from IN cord-260054-iihgc5nr 25 16 the the DT cord-260054-iihgc5nr 25 17 viral viral JJ cord-260054-iihgc5nr 25 18 surface surface NN cord-260054-iihgc5nr 25 19 , , , cord-260054-iihgc5nr 25 20 where where WRB cord-260054-iihgc5nr 25 21 its -PRON- PRP$ cord-260054-iihgc5nr 25 22 RBD RBD NNP cord-260054-iihgc5nr 25 23 binds bind VBZ cord-260054-iihgc5nr 25 24 to to IN cord-260054-iihgc5nr 25 25 the the DT cord-260054-iihgc5nr 25 26 angiotensin angiotensin NN cord-260054-iihgc5nr 25 27 - - HYPH cord-260054-iihgc5nr 25 28 converting convert VBG cord-260054-iihgc5nr 25 29 enzyme enzyme NN cord-260054-iihgc5nr 25 30 2 2 CD cord-260054-iihgc5nr 25 31 ( ( -LRB- cord-260054-iihgc5nr 25 32 ACE2 ACE2 NNP cord-260054-iihgc5nr 25 33 ) ) -RRB- cord-260054-iihgc5nr 25 34 receptor receptor NN cord-260054-iihgc5nr 25 35 on on IN cord-260054-iihgc5nr 25 36 the the DT cord-260054-iihgc5nr 25 37 host host NN cord-260054-iihgc5nr 25 38 cell cell NN cord-260054-iihgc5nr 25 39 surface surface NN cord-260054-iihgc5nr 25 40 ( ( -LRB- cord-260054-iihgc5nr 25 41 1 1 CD cord-260054-iihgc5nr 25 42 ) ) -RRB- cord-260054-iihgc5nr 26 1 ( ( -LRB- cord-260054-iihgc5nr 26 2 like like IN cord-260054-iihgc5nr 26 3 the the DT cord-260054-iihgc5nr 26 4 SARS SARS NNP cord-260054-iihgc5nr 26 5 - - HYPH cord-260054-iihgc5nr 26 6 CoV CoV NNP cord-260054-iihgc5nr 26 7 homolog homolog NN cord-260054-iihgc5nr 26 8 ( ( -LRB- cord-260054-iihgc5nr 26 9 16 16 CD cord-260054-iihgc5nr 26 10 ) ) -RRB- cord-260054-iihgc5nr 26 11 , , , cord-260054-iihgc5nr 26 12 and and CC cord-260054-iihgc5nr 26 13 differently differently RB cord-260054-iihgc5nr 26 14 from from IN cord-260054-iihgc5nr 26 15 MERS MERS NNP cord-260054-iihgc5nr 26 16 - - HYPH cord-260054-iihgc5nr 26 17 CoV CoV NNP cord-260054-iihgc5nr 26 18 S S NNP cord-260054-iihgc5nr 26 19 , , , cord-260054-iihgc5nr 26 20 which which WDT cord-260054-iihgc5nr 26 21 recognizes recognize VBZ cord-260054-iihgc5nr 26 22 a a DT cord-260054-iihgc5nr 26 23 different different JJ cord-260054-iihgc5nr 26 24 receptor receptor NN cord-260054-iihgc5nr 26 25 , , , cord-260054-iihgc5nr 26 26 the the DT cord-260054-iihgc5nr 26 27 dipeptidyl dipeptidyl JJ cord-260054-iihgc5nr 26 28 peptidase peptidase NN cord-260054-iihgc5nr 26 29 4 4 CD cord-260054-iihgc5nr 26 30 ( ( -LRB- cord-260054-iihgc5nr 26 31 17 17 CD cord-260054-iihgc5nr 26 32 ) ) -RRB- cord-260054-iihgc5nr 26 33 ) ) -RRB- cord-260054-iihgc5nr 26 34 . . . cord-260054-iihgc5nr 27 1 Receptor receptor NN cord-260054-iihgc5nr 27 2 binding binding NN cord-260054-iihgc5nr 27 3 and and CC cord-260054-iihgc5nr 27 4 proteolytic proteolytic JJ cord-260054-iihgc5nr 27 5 processing processing NN cord-260054-iihgc5nr 27 6 by by IN cord-260054-iihgc5nr 27 7 cellular cellular JJ cord-260054-iihgc5nr 27 8 proteases protease NNS cord-260054-iihgc5nr 27 9 then then RB cord-260054-iihgc5nr 27 10 cause cause VBP cord-260054-iihgc5nr 27 11 S1 s1 NN cord-260054-iihgc5nr 27 12 to to TO cord-260054-iihgc5nr 27 13 dissociate dissociate VB cord-260054-iihgc5nr 27 14 and and CC cord-260054-iihgc5nr 27 15 S2 S2 NNP cord-260054-iihgc5nr 27 16 to to TO cord-260054-iihgc5nr 27 17 undergo undergo VB cord-260054-iihgc5nr 27 18 large large JJ cord-260054-iihgc5nr 27 19 - - HYPH cord-260054-iihgc5nr 27 20 scale scale NN cord-260054-iihgc5nr 27 21 conformational conformational JJ cord-260054-iihgc5nr 27 22 changes change NNS cord-260054-iihgc5nr 27 23 towards towards IN cord-260054-iihgc5nr 27 24 a a DT cord-260054-iihgc5nr 27 25 stable stable JJ cord-260054-iihgc5nr 27 26 structure structure NN cord-260054-iihgc5nr 27 27 , , , cord-260054-iihgc5nr 27 28 bringing bring VBG cord-260054-iihgc5nr 27 29 viral viral JJ cord-260054-iihgc5nr 27 30 and and CC cord-260054-iihgc5nr 27 31 cellular cellular JJ cord-260054-iihgc5nr 27 32 membranes membrane NNS cord-260054-iihgc5nr 27 33 into into IN cord-260054-iihgc5nr 27 34 close close JJ cord-260054-iihgc5nr 27 35 proximity proximity NN cord-260054-iihgc5nr 27 36 for for IN cord-260054-iihgc5nr 27 37 fusion fusion NN cord-260054-iihgc5nr 27 38 and and CC cord-260054-iihgc5nr 27 39 infection infection NN cord-260054-iihgc5nr 27 40 ( ( -LRB- cord-260054-iihgc5nr 27 41 7 7 CD cord-260054-iihgc5nr 27 42 , , , cord-260054-iihgc5nr 27 43 15 15 CD cord-260054-iihgc5nr 27 44 , , , cord-260054-iihgc5nr 27 45 18 18 CD cord-260054-iihgc5nr 27 46 ) ) -RRB- cord-260054-iihgc5nr 27 47 . . . cord-260054-iihgc5nr 28 1 While while IN cord-260054-iihgc5nr 28 2 the the DT cord-260054-iihgc5nr 28 3 outbreak outbreak NN cord-260054-iihgc5nr 28 4 of of IN cord-260054-iihgc5nr 28 5 COVID-19 COVID-19 NNP cord-260054-iihgc5nr 28 6 was be VBD cord-260054-iihgc5nr 28 7 rapidly rapidly RB cord-260054-iihgc5nr 28 8 spreading spread VBG cord-260054-iihgc5nr 28 9 all all RB cord-260054-iihgc5nr 28 10 over over IN cord-260054-iihgc5nr 28 11 the the DT cord-260054-iihgc5nr 28 12 world world NN cord-260054-iihgc5nr 28 13 , , , cord-260054-iihgc5nr 28 14 affecting affect VBG cord-260054-iihgc5nr 28 15 millions million NNS cord-260054-iihgc5nr 28 16 of of IN cord-260054-iihgc5nr 28 17 people people NNS cord-260054-iihgc5nr 28 18 and and CC cord-260054-iihgc5nr 28 19 becoming become VBG cord-260054-iihgc5nr 28 20 a a DT cord-260054-iihgc5nr 28 21 global global JJ cord-260054-iihgc5nr 28 22 threat threat NN cord-260054-iihgc5nr 28 23 , , , cord-260054-iihgc5nr 28 24 laboratories laboratory NNS cord-260054-iihgc5nr 28 25 worldwide worldwide RB cord-260054-iihgc5nr 28 26 promptly promptly RB cord-260054-iihgc5nr 28 27 started start VBD cord-260054-iihgc5nr 28 28 to to TO cord-260054-iihgc5nr 28 29 sequence sequence VB cord-260054-iihgc5nr 28 30 a a DT cord-260054-iihgc5nr 28 31 large large JJ cord-260054-iihgc5nr 28 32 number number NN cord-260054-iihgc5nr 28 33 of of IN cord-260054-iihgc5nr 28 34 SARS SARS NNP cord-260054-iihgc5nr 28 35 - - HYPH cord-260054-iihgc5nr 28 36 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 28 37 genomes genome NNS cord-260054-iihgc5nr 28 38 . . . cord-260054-iihgc5nr 29 1 All all PDT cord-260054-iihgc5nr 29 2 the the DT cord-260054-iihgc5nr 29 3 available available JJ cord-260054-iihgc5nr 29 4 genomic genomic JJ cord-260054-iihgc5nr 29 5 data datum NNS cord-260054-iihgc5nr 29 6 is be VBZ cord-260054-iihgc5nr 29 7 accessible accessible JJ cord-260054-iihgc5nr 29 8 through through IN cord-260054-iihgc5nr 29 9 the the DT cord-260054-iihgc5nr 29 10 Global Global NNP cord-260054-iihgc5nr 29 11 Initiative Initiative NNP cord-260054-iihgc5nr 29 12 on on IN cord-260054-iihgc5nr 29 13 Sharing Sharing NNP cord-260054-iihgc5nr 29 14 All all DT cord-260054-iihgc5nr 29 15 Influenza Influenza NNP cord-260054-iihgc5nr 29 16 Data Data NNP cord-260054-iihgc5nr 29 17 ( ( -LRB- cord-260054-iihgc5nr 29 18 GISAID GISAID NNP cord-260054-iihgc5nr 29 19 ) ) -RRB- cord-260054-iihgc5nr 29 20 website website NN cord-260054-iihgc5nr 29 21 , , , cord-260054-iihgc5nr 29 22 an an DT cord-260054-iihgc5nr 29 23 invaluable invaluable JJ cord-260054-iihgc5nr 29 24 open open JJ cord-260054-iihgc5nr 29 25 access access NN cord-260054-iihgc5nr 29 26 resource resource NN cord-260054-iihgc5nr 29 27 ( ( -LRB- cord-260054-iihgc5nr 29 28 19 19 CD cord-260054-iihgc5nr 29 29 , , , cord-260054-iihgc5nr 29 30 20 20 CD cord-260054-iihgc5nr 29 31 ) ) -RRB- cord-260054-iihgc5nr 29 32 . . . cord-260054-iihgc5nr 30 1 Simultaneously simultaneously RB cord-260054-iihgc5nr 30 2 , , , cord-260054-iihgc5nr 30 3 crucial crucial JJ cord-260054-iihgc5nr 30 4 structural structural JJ cord-260054-iihgc5nr 30 5 knowledge knowledge NN cord-260054-iihgc5nr 30 6 has have VBZ cord-260054-iihgc5nr 30 7 been be VBN cord-260054-iihgc5nr 30 8 achieved achieve VBN cord-260054-iihgc5nr 30 9 on on IN cord-260054-iihgc5nr 30 10 SARS SARS NNP cord-260054-iihgc5nr 30 11 - - HYPH cord-260054-iihgc5nr 30 12 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 30 13 , , , cord-260054-iihgc5nr 30 14 especially especially RB cord-260054-iihgc5nr 30 15 regarding regard VBG cord-260054-iihgc5nr 30 16 the the DT cord-260054-iihgc5nr 30 17 S S NNP cord-260054-iihgc5nr 30 18 protein protein NN cord-260054-iihgc5nr 30 19 . . . cord-260054-iihgc5nr 31 1 3D 3d JJ cord-260054-iihgc5nr 31 2 structures structure NNS cord-260054-iihgc5nr 31 3 are be VBP cord-260054-iihgc5nr 31 4 now now RB cord-260054-iihgc5nr 31 5 available available JJ cord-260054-iihgc5nr 31 6 from from IN cord-260054-iihgc5nr 31 7 the the DT cord-260054-iihgc5nr 31 8 Protein Protein NNP cord-260054-iihgc5nr 31 9 Data Data NNP cord-260054-iihgc5nr 31 10 Bank Bank NNP cord-260054-iihgc5nr 31 11 ( ( -LRB- cord-260054-iihgc5nr 31 12 PDB PDB NNP cord-260054-iihgc5nr 31 13 ) ) -RRB- cord-260054-iihgc5nr 31 14 ( ( -LRB- cord-260054-iihgc5nr 31 15 21 21 CD cord-260054-iihgc5nr 31 16 ) ) -RRB- cord-260054-iihgc5nr 31 17 for for IN cord-260054-iihgc5nr 31 18 the the DT cord-260054-iihgc5nr 31 19 SARS SARS NNP cord-260054-iihgc5nr 31 20 - - HYPH cord-260054-iihgc5nr 31 21 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 31 22 S S NNP cord-260054-iihgc5nr 31 23 protein protein NN cord-260054-iihgc5nr 31 24 in in IN cord-260054-iihgc5nr 31 25 the the DT cord-260054-iihgc5nr 31 26 pre pre JJ cord-260054-iihgc5nr 31 27 - - JJ cord-260054-iihgc5nr 31 28 fusion fusion JJ cord-260054-iihgc5nr 31 29 conformation conformation NN cord-260054-iihgc5nr 31 30 , , , cord-260054-iihgc5nr 31 31 also also RB cord-260054-iihgc5nr 31 32 bound bind VBN cord-260054-iihgc5nr 31 33 to to IN cord-260054-iihgc5nr 31 34 the the DT cord-260054-iihgc5nr 31 35 ACE2 ACE2 NNP cord-260054-iihgc5nr 31 36 receptor receptor NN cord-260054-iihgc5nr 31 37 ( ( -LRB- cord-260054-iihgc5nr 31 38 22 22 CD cord-260054-iihgc5nr 31 39 ) ) -RRB- cord-260054-iihgc5nr 31 40 ( ( -LRB- cord-260054-iihgc5nr 31 41 23 23 CD cord-260054-iihgc5nr 31 42 ) ) -RRB- cord-260054-iihgc5nr 31 43 ( ( -LRB- cord-260054-iihgc5nr 31 44 24 24 CD cord-260054-iihgc5nr 31 45 ) ) -RRB- cord-260054-iihgc5nr 31 46 ( ( -LRB- cord-260054-iihgc5nr 31 47 25 25 CD cord-260054-iihgc5nr 31 48 ) ) -RRB- cord-260054-iihgc5nr 31 49 ( ( -LRB- cord-260054-iihgc5nr 31 50 26 26 CD cord-260054-iihgc5nr 31 51 ) ) -RRB- cord-260054-iihgc5nr 31 52 ( ( -LRB- cord-260054-iihgc5nr 31 53 27 27 CD cord-260054-iihgc5nr 31 54 ) ) -RRB- cord-260054-iihgc5nr 31 55 ( ( -LRB- cord-260054-iihgc5nr 31 56 28 28 CD cord-260054-iihgc5nr 31 57 ) ) -RRB- cord-260054-iihgc5nr 31 58 , , , cord-260054-iihgc5nr 31 59 and and CC cord-260054-iihgc5nr 31 60 for for IN cord-260054-iihgc5nr 31 61 the the DT cord-260054-iihgc5nr 31 62 post post JJ cord-260054-iihgc5nr 31 63 - - JJ cord-260054-iihgc5nr 31 64 fusion fusion JJ cord-260054-iihgc5nr 31 65 core core NN cord-260054-iihgc5nr 31 66 of of IN cord-260054-iihgc5nr 31 67 its -PRON- PRP$ cord-260054-iihgc5nr 31 68 S2 S2 NNP cord-260054-iihgc5nr 31 69 subunit subunit NN cord-260054-iihgc5nr 31 70 in in IN cord-260054-iihgc5nr 31 71 the the DT cord-260054-iihgc5nr 31 72 postfusion postfusion NN cord-260054-iihgc5nr 31 73 conformation conformation NN cord-260054-iihgc5nr 31 74 ( ( -LRB- cord-260054-iihgc5nr 31 75 29 29 CD cord-260054-iihgc5nr 31 76 ) ) -RRB- cord-260054-iihgc5nr 31 77 . . . cord-260054-iihgc5nr 32 1 On on IN cord-260054-iihgc5nr 32 2 April April NNP cord-260054-iihgc5nr 32 3 21 21 CD cord-260054-iihgc5nr 32 4 st st NNP cord-260054-iihgc5nr 32 5 2020 2020 CD cord-260054-iihgc5nr 32 6 , , , cord-260054-iihgc5nr 32 7 4 4 CD cord-260054-iihgc5nr 32 8 months month NNS cord-260054-iihgc5nr 32 9 after after IN cord-260054-iihgc5nr 32 10 the the DT cord-260054-iihgc5nr 32 11 first first JJ cord-260054-iihgc5nr 32 12 sequencing sequencing NN cord-260054-iihgc5nr 32 13 ( ( -LRB- cord-260054-iihgc5nr 32 14 30 30 CD cord-260054-iihgc5nr 32 15 ) ) -RRB- cord-260054-iihgc5nr 32 16 , , , cord-260054-iihgc5nr 32 17 10,823 10,823 CD cord-260054-iihgc5nr 32 18 genomic genomic JJ cord-260054-iihgc5nr 32 19 sequences sequence NNS cord-260054-iihgc5nr 32 20 of of IN cord-260054-iihgc5nr 32 21 SARS SARS NNP cord-260054-iihgc5nr 32 22 - - HYPH cord-260054-iihgc5nr 32 23 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 32 24 were be VBD cord-260054-iihgc5nr 32 25 available available JJ cord-260054-iihgc5nr 32 26 from from IN cord-260054-iihgc5nr 32 27 GISAID GISAID NNP cord-260054-iihgc5nr 32 28 . . . cord-260054-iihgc5nr 33 1 Therefore therefore RB cord-260054-iihgc5nr 33 2 , , , cord-260054-iihgc5nr 33 3 we -PRON- PRP cord-260054-iihgc5nr 33 4 considered consider VBD cord-260054-iihgc5nr 33 5 the the DT cord-260054-iihgc5nr 33 6 time time NN cord-260054-iihgc5nr 33 7 ripe ripe JJ cord-260054-iihgc5nr 33 8 for for IN cord-260054-iihgc5nr 33 9 an an DT cord-260054-iihgc5nr 33 10 assessment assessment NN cord-260054-iihgc5nr 33 11 of of IN cord-260054-iihgc5nr 33 12 the the DT cord-260054-iihgc5nr 33 13 mutational mutational JJ cord-260054-iihgc5nr 33 14 spectrum spectrum NN cord-260054-iihgc5nr 33 15 of of IN cord-260054-iihgc5nr 33 16 the the DT cord-260054-iihgc5nr 33 17 SARS SARS NNP cord-260054-iihgc5nr 33 18 - - HYPH cord-260054-iihgc5nr 33 19 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 33 20 spike spike NN cord-260054-iihgc5nr 33 21 protein protein NN cord-260054-iihgc5nr 33 22 . . . cord-260054-iihgc5nr 34 1 To to IN cord-260054-iihgc5nr 34 2 this this DT cord-260054-iihgc5nr 34 3 aim aim NN cord-260054-iihgc5nr 34 4 , , , cord-260054-iihgc5nr 34 5 we -PRON- PRP cord-260054-iihgc5nr 34 6 extracted extract VBD cord-260054-iihgc5nr 34 7 all all PDT cord-260054-iihgc5nr 34 8 the the DT cord-260054-iihgc5nr 34 9 complete complete JJ cord-260054-iihgc5nr 34 10 S S NNP cord-260054-iihgc5nr 34 11 protein protein NN cord-260054-iihgc5nr 34 12 sequences sequence NNS cord-260054-iihgc5nr 34 13 from from IN cord-260054-iihgc5nr 34 14 the the DT cord-260054-iihgc5nr 34 15 GISAID GISAID NNP cord-260054-iihgc5nr 34 16 21 21 CD cord-260054-iihgc5nr 34 17 st st NNP cord-260054-iihgc5nr 34 18 April April NNP cord-260054-iihgc5nr 34 19 dataset dataset NN cord-260054-iihgc5nr 34 20 and and CC cord-260054-iihgc5nr 34 21 identified identify VBD cord-260054-iihgc5nr 34 22 all all PDT cord-260054-iihgc5nr 34 23 the the DT cord-260054-iihgc5nr 34 24 mutations mutation NNS cord-260054-iihgc5nr 34 25 occurring occur VBG cord-260054-iihgc5nr 34 26 in in IN cord-260054-iihgc5nr 34 27 at at RB cord-260054-iihgc5nr 34 28 least least JJS cord-260054-iihgc5nr 34 29 2 2 CD cord-260054-iihgc5nr 34 30 identical identical JJ cord-260054-iihgc5nr 34 31 sequences sequence NNS cord-260054-iihgc5nr 34 32 ( ( -LRB- cord-260054-iihgc5nr 34 33 see see VB cord-260054-iihgc5nr 34 34 Table Table NNP cord-260054-iihgc5nr 34 35 S1 S1 NNP cord-260054-iihgc5nr 34 36 ) ) -RRB- cord-260054-iihgc5nr 34 37 . . . cord-260054-iihgc5nr 35 1 From from IN cord-260054-iihgc5nr 35 2 this this DT cord-260054-iihgc5nr 35 3 analysis analysis NN cord-260054-iihgc5nr 35 4 , , , cord-260054-iihgc5nr 35 5 a a DT cord-260054-iihgc5nr 35 6 14-amino 14-amino CD cord-260054-iihgc5nr 35 7 acid acid NN cord-260054-iihgc5nr 35 8 segment segment NN cord-260054-iihgc5nr 35 9 in in IN cord-260054-iihgc5nr 35 10 the the DT cord-260054-iihgc5nr 35 11 fusion fusion NN cord-260054-iihgc5nr 35 12 core core NN cord-260054-iihgc5nr 35 13 of of IN cord-260054-iihgc5nr 35 14 the the DT cord-260054-iihgc5nr 35 15 heptad heptad NN cord-260054-iihgc5nr 35 16 repeat repeat NN cord-260054-iihgc5nr 35 17 1 1 CD cord-260054-iihgc5nr 35 18 ( ( -LRB- cord-260054-iihgc5nr 35 19 HR1 HR1 NNP cord-260054-iihgc5nr 35 20 ) ) -RRB- cord-260054-iihgc5nr 35 21 emerged emerge VBD cord-260054-iihgc5nr 35 22 as as IN cord-260054-iihgc5nr 35 23 a a DT cord-260054-iihgc5nr 35 24 hotspot hotspot NN cord-260054-iihgc5nr 35 25 for for IN cord-260054-iihgc5nr 35 26 mutations mutation NNS cord-260054-iihgc5nr 35 27 . . . cord-260054-iihgc5nr 36 1 While while IN cord-260054-iihgc5nr 36 2 the the DT cord-260054-iihgc5nr 36 3 mutations mutation NNS cord-260054-iihgc5nr 36 4 we -PRON- PRP cord-260054-iihgc5nr 36 5 identified identify VBD cord-260054-iihgc5nr 36 6 corresponded correspond VBD cord-260054-iihgc5nr 36 7 to to IN cord-260054-iihgc5nr 36 8 a a DT cord-260054-iihgc5nr 36 9 1 1 CD cord-260054-iihgc5nr 36 10 mutation mutation NN cord-260054-iihgc5nr 36 11 every every DT cord-260054-iihgc5nr 36 12 12 12 CD cord-260054-iihgc5nr 36 13 positions position NNS cord-260054-iihgc5nr 36 14 along along IN cord-260054-iihgc5nr 36 15 the the DT cord-260054-iihgc5nr 36 16 protein protein NN cord-260054-iihgc5nr 36 17 sequence sequence NN cord-260054-iihgc5nr 36 18 , , , cord-260054-iihgc5nr 36 19 as as RB cord-260054-iihgc5nr 36 20 many many JJ cord-260054-iihgc5nr 36 21 as as IN cord-260054-iihgc5nr 36 22 6 6 CD cord-260054-iihgc5nr 36 23 amino amino NN cord-260054-iihgc5nr 36 24 acids acid NNS cord-260054-iihgc5nr 36 25 were be VBD cord-260054-iihgc5nr 36 26 found find VBN cord-260054-iihgc5nr 36 27 to to TO cord-260054-iihgc5nr 36 28 be be VB cord-260054-iihgc5nr 36 29 mutated mutate VBN cord-260054-iihgc5nr 36 30 in in IN cord-260054-iihgc5nr 36 31 the the DT cord-260054-iihgc5nr 36 32 above above JJ cord-260054-iihgc5nr 36 33 14-amino 14-amino CD cord-260054-iihgc5nr 36 34 acid acid NN cord-260054-iihgc5nr 36 35 segment segment NN cord-260054-iihgc5nr 36 36 : : : cord-260054-iihgc5nr 36 37 S929 s929 JJ cord-260054-iihgc5nr 36 38 , , , cord-260054-iihgc5nr 36 39 D936 d936 XX cord-260054-iihgc5nr 36 40 , , , cord-260054-iihgc5nr 36 41 L938 L938 NNP cord-260054-iihgc5nr 36 42 , , , cord-260054-iihgc5nr 36 43 S939 S939 NNP cord-260054-iihgc5nr 36 44 , , , cord-260054-iihgc5nr 36 45 S940 S940 NNP cord-260054-iihgc5nr 36 46 and and CC cord-260054-iihgc5nr 36 47 S943 S943 NNP cord-260054-iihgc5nr 36 48 . . . cord-260054-iihgc5nr 37 1 After after IN cord-260054-iihgc5nr 37 2 the the DT cord-260054-iihgc5nr 37 3 proteolytic proteolytic JJ cord-260054-iihgc5nr 37 4 processing processing NN cord-260054-iihgc5nr 37 5 , , , cord-260054-iihgc5nr 37 6 in in IN cord-260054-iihgc5nr 37 7 the the DT cord-260054-iihgc5nr 37 8 post post JJ cord-260054-iihgc5nr 37 9 - - JJ cord-260054-iihgc5nr 37 10 fusion fusion JJ cord-260054-iihgc5nr 37 11 conformation conformation NN cord-260054-iihgc5nr 37 12 , , , cord-260054-iihgc5nr 37 13 the the DT cord-260054-iihgc5nr 37 14 S S NNP cord-260054-iihgc5nr 37 15 protein protein NN cord-260054-iihgc5nr 37 16 HR1 HR1 NNP cord-260054-iihgc5nr 37 17 and and CC cord-260054-iihgc5nr 37 18 HR2 HR2 NNP cord-260054-iihgc5nr 37 19 motifs motif NNS cord-260054-iihgc5nr 37 20 interact interact VBP cord-260054-iihgc5nr 37 21 with with IN cord-260054-iihgc5nr 37 22 each each DT cord-260054-iihgc5nr 37 23 other other JJ cord-260054-iihgc5nr 37 24 to to TO cord-260054-iihgc5nr 37 25 form form VB cord-260054-iihgc5nr 37 26 a a DT cord-260054-iihgc5nr 37 27 six six CD cord-260054-iihgc5nr 37 28 - - HYPH cord-260054-iihgc5nr 37 29 helix helix NN cord-260054-iihgc5nr 37 30 bundle bundle NN cord-260054-iihgc5nr 37 31 ( ( -LRB- cord-260054-iihgc5nr 37 32 6-HB 6-hb CD cord-260054-iihgc5nr 37 33 ) ) -RRB- cord-260054-iihgc5nr 37 34 , , , cord-260054-iihgc5nr 37 35 which which WDT cord-260054-iihgc5nr 37 36 promotes promote VBZ cord-260054-iihgc5nr 37 37 initiation initiation NN cord-260054-iihgc5nr 37 38 of of IN cord-260054-iihgc5nr 37 39 the the DT cord-260054-iihgc5nr 37 40 viral viral JJ cord-260054-iihgc5nr 37 41 and and CC cord-260054-iihgc5nr 37 42 cellular cellular JJ cord-260054-iihgc5nr 37 43 membranes membrane NNS cord-260054-iihgc5nr 37 44 fusion fusion NN cord-260054-iihgc5nr 37 45 . . . cord-260054-iihgc5nr 38 1 The the DT cord-260054-iihgc5nr 38 2 HR1 HR1 NNP cord-260054-iihgc5nr 38 3 " " `` cord-260054-iihgc5nr 38 4 fusion fusion NN cord-260054-iihgc5nr 38 5 core core NN cord-260054-iihgc5nr 38 6 " " '' cord-260054-iihgc5nr 38 7 is be VBZ cord-260054-iihgc5nr 38 8 named name VBN cord-260054-iihgc5nr 38 9 after after IN cord-260054-iihgc5nr 38 10 its -PRON- PRP$ cord-260054-iihgc5nr 38 11 role role NN cord-260054-iihgc5nr 38 12 in in IN cord-260054-iihgc5nr 38 13 giving give VBG cord-260054-iihgc5nr 38 14 many many JJ cord-260054-iihgc5nr 38 15 interactions interaction NNS cord-260054-iihgc5nr 38 16 with with IN cord-260054-iihgc5nr 38 17 HR2 HR2 NNP cord-260054-iihgc5nr 38 18 in in IN cord-260054-iihgc5nr 38 19 the the DT cord-260054-iihgc5nr 38 20 post post JJ cord-260054-iihgc5nr 38 21 - - JJ cord-260054-iihgc5nr 38 22 fusion fusion JJ cord-260054-iihgc5nr 38 23 conformation conformation NN cord-260054-iihgc5nr 38 24 , , , cord-260054-iihgc5nr 38 25 thus thus RB cord-260054-iihgc5nr 38 26 playing play VBG cord-260054-iihgc5nr 38 27 a a DT cord-260054-iihgc5nr 38 28 key key JJ cord-260054-iihgc5nr 38 29 role role NN cord-260054-iihgc5nr 38 30 in in IN cord-260054-iihgc5nr 38 31 the the DT cord-260054-iihgc5nr 38 32 virus virus NN cord-260054-iihgc5nr 38 33 infectivity infectivity NN cord-260054-iihgc5nr 38 34 ( ( -LRB- cord-260054-iihgc5nr 38 35 31 31 CD cord-260054-iihgc5nr 38 36 ) ) -RRB- cord-260054-iihgc5nr 38 37 . . . cord-260054-iihgc5nr 39 1 Based base VBN cord-260054-iihgc5nr 39 2 on on IN cord-260054-iihgc5nr 39 3 the the DT cord-260054-iihgc5nr 39 4 structural structural JJ cord-260054-iihgc5nr 39 5 location location NN cord-260054-iihgc5nr 39 6 of of IN cord-260054-iihgc5nr 39 7 the the DT cord-260054-iihgc5nr 39 8 above above RB cord-260054-iihgc5nr 39 9 highly highly RB cord-260054-iihgc5nr 39 10 concentrated concentrated JJ cord-260054-iihgc5nr 39 11 mutations mutation NNS cord-260054-iihgc5nr 39 12 and and CC cord-260054-iihgc5nr 39 13 on on IN cord-260054-iihgc5nr 39 14 their -PRON- PRP$ cord-260054-iihgc5nr 39 15 nonconservative nonconservative JJ cord-260054-iihgc5nr 39 16 nature nature NN cord-260054-iihgc5nr 39 17 , , , cord-260054-iihgc5nr 39 18 we -PRON- PRP cord-260054-iihgc5nr 39 19 considered consider VBD cord-260054-iihgc5nr 39 20 them -PRON- PRP cord-260054-iihgc5nr 39 21 of of IN cord-260054-iihgc5nr 39 22 particular particular JJ cord-260054-iihgc5nr 39 23 interest interest NN cord-260054-iihgc5nr 39 24 and and CC cord-260054-iihgc5nr 39 25 decided decide VBD cord-260054-iihgc5nr 39 26 to to TO cord-260054-iihgc5nr 39 27 further further RB cord-260054-iihgc5nr 39 28 investigate investigate VB cord-260054-iihgc5nr 39 29 their -PRON- PRP$ cord-260054-iihgc5nr 39 30 structural structural JJ cord-260054-iihgc5nr 39 31 basis basis NN cord-260054-iihgc5nr 39 32 , , , cord-260054-iihgc5nr 39 33 both both CC cord-260054-iihgc5nr 39 34 in in IN cord-260054-iihgc5nr 39 35 the the DT cord-260054-iihgc5nr 39 36 pre pre NN cord-260054-iihgc5nr 39 37 - - JJ cord-260054-iihgc5nr 39 38 and and CC cord-260054-iihgc5nr 39 39 post post JJ cord-260054-iihgc5nr 39 40 - - JJ cord-260054-iihgc5nr 39 41 fusion fusion JJ cord-260054-iihgc5nr 39 42 conformation conformation NN cord-260054-iihgc5nr 39 43 , , , cord-260054-iihgc5nr 39 44 as as RB cord-260054-iihgc5nr 39 45 well well RB cord-260054-iihgc5nr 39 46 as as IN cord-260054-iihgc5nr 39 47 their -PRON- PRP$ cord-260054-iihgc5nr 39 48 sequencing sequencing NN cord-260054-iihgc5nr 39 49 dates date NNS cord-260054-iihgc5nr 39 50 and and CC cord-260054-iihgc5nr 39 51 geographical geographical JJ cord-260054-iihgc5nr 39 52 distribution distribution NN cord-260054-iihgc5nr 39 53 . . . cord-260054-iihgc5nr 40 1 As as IN cord-260054-iihgc5nr 40 2 we -PRON- PRP cord-260054-iihgc5nr 40 3 show show VBP cord-260054-iihgc5nr 40 4 in in IN cord-260054-iihgc5nr 40 5 the the DT cord-260054-iihgc5nr 40 6 following following NN cord-260054-iihgc5nr 40 7 , , , cord-260054-iihgc5nr 40 8 as as RB cord-260054-iihgc5nr 40 9 many many JJ cord-260054-iihgc5nr 40 10 as as IN cord-260054-iihgc5nr 40 11 three three CD cord-260054-iihgc5nr 40 12 of of IN cord-260054-iihgc5nr 40 13 them -PRON- PRP cord-260054-iihgc5nr 40 14 are be VBP cord-260054-iihgc5nr 40 15 responsible responsible JJ cord-260054-iihgc5nr 40 16 for for IN cord-260054-iihgc5nr 40 17 the the DT cord-260054-iihgc5nr 40 18 loss loss NN cord-260054-iihgc5nr 40 19 of of IN cord-260054-iihgc5nr 40 20 inter inter JJ cord-260054-iihgc5nr 40 21 - - JJ cord-260054-iihgc5nr 40 22 monomer monomer JJ cord-260054-iihgc5nr 40 23 Hbonds Hbonds NNP cord-260054-iihgc5nr 40 24 in in IN cord-260054-iihgc5nr 40 25 the the DT cord-260054-iihgc5nr 40 26 post post JJ cord-260054-iihgc5nr 40 27 - - JJ cord-260054-iihgc5nr 40 28 fusion fusion JJ cord-260054-iihgc5nr 40 29 conformation conformation NN cord-260054-iihgc5nr 40 30 , , , cord-260054-iihgc5nr 40 31 while while IN cord-260054-iihgc5nr 40 32 one one CD cord-260054-iihgc5nr 40 33 of of IN cord-260054-iihgc5nr 40 34 them -PRON- PRP cord-260054-iihgc5nr 40 35 , , , cord-260054-iihgc5nr 40 36 S943P S943P NNP cord-260054-iihgc5nr 40 37 , , , cord-260054-iihgc5nr 40 38 would would MD cord-260054-iihgc5nr 40 39 introduce introduce VB cord-260054-iihgc5nr 40 40 unexpected unexpected JJ cord-260054-iihgc5nr 40 41 structural structural JJ cord-260054-iihgc5nr 40 42 strain strain NN cord-260054-iihgc5nr 40 43 in in IN cord-260054-iihgc5nr 40 44 the the DT cord-260054-iihgc5nr 40 45 pre pre JJ cord-260054-iihgc5nr 40 46 - - JJ cord-260054-iihgc5nr 40 47 fusion fusion JJ cord-260054-iihgc5nr 40 48 conformation conformation NN cord-260054-iihgc5nr 40 49 . . . cord-260054-iihgc5nr 41 1 A a DT cord-260054-iihgc5nr 41 2 search search NN cord-260054-iihgc5nr 41 3 in in IN cord-260054-iihgc5nr 41 4 the the DT cord-260054-iihgc5nr 41 5 GISAID GISAID NNP cord-260054-iihgc5nr 41 6 resource resource NN cord-260054-iihgc5nr 41 7 updated update VBN cord-260054-iihgc5nr 41 8 to to IN cord-260054-iihgc5nr 41 9 May May NNP cord-260054-iihgc5nr 41 10 29 29 CD cord-260054-iihgc5nr 41 11 th th XX cord-260054-iihgc5nr 41 12 showed show VBD cord-260054-iihgc5nr 41 13 a a DT cord-260054-iihgc5nr 41 14 significant significant JJ cord-260054-iihgc5nr 41 15 increase increase NN cord-260054-iihgc5nr 41 16 in in IN cord-260054-iihgc5nr 41 17 occurrences occurrence NNS cord-260054-iihgc5nr 41 18 especially especially RB cord-260054-iihgc5nr 41 19 for for IN cord-260054-iihgc5nr 41 20 one one CD cord-260054-iihgc5nr 41 21 mutant mutant NN cord-260054-iihgc5nr 41 22 , , , cord-260054-iihgc5nr 41 23 D936Y D936Y NNP cord-260054-iihgc5nr 41 24 , , , cord-260054-iihgc5nr 41 25 unreported unreported JJ cord-260054-iihgc5nr 41 26 to to IN cord-260054-iihgc5nr 41 27 date date NN cord-260054-iihgc5nr 41 28 , , , cord-260054-iihgc5nr 41 29 which which WDT cord-260054-iihgc5nr 41 30 has have VBZ cord-260054-iihgc5nr 41 31 become become VBN cord-260054-iihgc5nr 41 32 a a DT cord-260054-iihgc5nr 41 33 common common JJ cord-260054-iihgc5nr 41 34 variant variant NN cord-260054-iihgc5nr 41 35 in in IN cord-260054-iihgc5nr 41 36 some some DT cord-260054-iihgc5nr 41 37 European european JJ cord-260054-iihgc5nr 41 38 countries country NNS cord-260054-iihgc5nr 41 39 , , , cord-260054-iihgc5nr 41 40 especially especially RB cord-260054-iihgc5nr 41 41 Sweden Sweden NNP cord-260054-iihgc5nr 41 42 . . . cord-260054-iihgc5nr 42 1 It -PRON- PRP cord-260054-iihgc5nr 42 2 is be VBZ cord-260054-iihgc5nr 42 3 also also RB cord-260054-iihgc5nr 42 4 the the DT cord-260054-iihgc5nr 42 5 mutant mutant NN cord-260054-iihgc5nr 42 6 having have VBG cord-260054-iihgc5nr 42 7 the the DT cord-260054-iihgc5nr 42 8 most most RBS cord-260054-iihgc5nr 42 9 significant significant JJ cord-260054-iihgc5nr 42 10 structural structural JJ cord-260054-iihgc5nr 42 11 role role NN cord-260054-iihgc5nr 42 12 , , , cord-260054-iihgc5nr 42 13 causing cause VBG cord-260054-iihgc5nr 42 14 the the DT cord-260054-iihgc5nr 42 15 loss loss NN cord-260054-iihgc5nr 42 16 of of IN cord-260054-iihgc5nr 42 17 an an DT cord-260054-iihgc5nr 42 18 intermonomer intermonomer NNP cord-260054-iihgc5nr 42 19 salt salt NN cord-260054-iihgc5nr 42 20 bridge bridge NN cord-260054-iihgc5nr 42 21 in in IN cord-260054-iihgc5nr 42 22 the the DT cord-260054-iihgc5nr 42 23 post post JJ cord-260054-iihgc5nr 42 24 - - JJ cord-260054-iihgc5nr 42 25 fusion fusion JJ cord-260054-iihgc5nr 42 26 assembly assembly NN cord-260054-iihgc5nr 42 27 . . . cord-260054-iihgc5nr 43 1 We -PRON- PRP cord-260054-iihgc5nr 43 2 downloaded download VBD cord-260054-iihgc5nr 43 3 the the DT cord-260054-iihgc5nr 43 4 10,823 10,823 CD cord-260054-iihgc5nr 43 5 genomic genomic JJ cord-260054-iihgc5nr 43 6 sequences sequence NNS cord-260054-iihgc5nr 43 7 available available JJ cord-260054-iihgc5nr 43 8 from from IN cord-260054-iihgc5nr 43 9 GISAID GISAID NNP cord-260054-iihgc5nr 43 10 on on IN cord-260054-iihgc5nr 43 11 April April NNP cord-260054-iihgc5nr 43 12 21 21 CD cord-260054-iihgc5nr 44 1 st st NNP cord-260054-iihgc5nr 44 2 2020 2020 CD cord-260054-iihgc5nr 44 3 . . . cord-260054-iihgc5nr 45 1 From from IN cord-260054-iihgc5nr 45 2 these these DT cord-260054-iihgc5nr 45 3 sequences sequence NNS cord-260054-iihgc5nr 45 4 , , , cord-260054-iihgc5nr 45 5 we -PRON- PRP cord-260054-iihgc5nr 45 6 extracted extract VBD cord-260054-iihgc5nr 45 7 the the DT cord-260054-iihgc5nr 45 8 nucleotide nucleotide JJ cord-260054-iihgc5nr 45 9 sequences sequence NNS cord-260054-iihgc5nr 45 10 of of IN cord-260054-iihgc5nr 45 11 the the DT cord-260054-iihgc5nr 45 12 spike spike NN cord-260054-iihgc5nr 45 13 protein protein NN cord-260054-iihgc5nr 45 14 and and CC cord-260054-iihgc5nr 45 15 translated translate VBD cord-260054-iihgc5nr 45 16 them -PRON- PRP cord-260054-iihgc5nr 45 17 to to IN cord-260054-iihgc5nr 45 18 protein protein NN cord-260054-iihgc5nr 45 19 sequences sequence NNS cord-260054-iihgc5nr 45 20 with with IN cord-260054-iihgc5nr 45 21 in in IN cord-260054-iihgc5nr 45 22 - - HYPH cord-260054-iihgc5nr 45 23 house house NN cord-260054-iihgc5nr 45 24 scripts script NNS cord-260054-iihgc5nr 45 25 . . . cord-260054-iihgc5nr 46 1 Nucleotides nucleotide NNS cord-260054-iihgc5nr 46 2 sequences sequence NNS cord-260054-iihgc5nr 46 3 featuring feature VBG cord-260054-iihgc5nr 46 4 an an DT cord-260054-iihgc5nr 46 5 internal internal JJ cord-260054-iihgc5nr 46 6 stop stop NN cord-260054-iihgc5nr 46 7 codon codon NN cord-260054-iihgc5nr 46 8 , , , cord-260054-iihgc5nr 46 9 having have VBG cord-260054-iihgc5nr 46 10 at at RB cord-260054-iihgc5nr 46 11 least least RBS cord-260054-iihgc5nr 46 12 one one CD cord-260054-iihgc5nr 46 13 undefined undefined JJ cord-260054-iihgc5nr 46 14 ( ( -LRB- cord-260054-iihgc5nr 46 15 " " `` cord-260054-iihgc5nr 46 16 N N NNP cord-260054-iihgc5nr 46 17 " " '' cord-260054-iihgc5nr 46 18 ) ) -RRB- cord-260054-iihgc5nr 46 19 nucleotide nucleotide JJ cord-260054-iihgc5nr 46 20 or or CC cord-260054-iihgc5nr 46 21 resulting result VBG cord-260054-iihgc5nr 46 22 in in IN cord-260054-iihgc5nr 46 23 spike spike NN cord-260054-iihgc5nr 46 24 proteins protein NNS cord-260054-iihgc5nr 46 25 of of IN cord-260054-iihgc5nr 46 26 length length NN cord-260054-iihgc5nr 46 27 different different JJ cord-260054-iihgc5nr 46 28 from from IN cord-260054-iihgc5nr 46 29 1,273 1,273 CD cord-260054-iihgc5nr 46 30 amino amino JJ cord-260054-iihgc5nr 46 31 acids acid NNS cord-260054-iihgc5nr 46 32 were be VBD cord-260054-iihgc5nr 46 33 discarded discard VBN cord-260054-iihgc5nr 46 34 . . . cord-260054-iihgc5nr 47 1 Sequences sequence NNS cord-260054-iihgc5nr 47 2 annotated annotate VBN cord-260054-iihgc5nr 47 3 as as IN cord-260054-iihgc5nr 47 4 pangolin pangolin NN cord-260054-iihgc5nr 47 5 , , , cord-260054-iihgc5nr 47 6 bat bat NN cord-260054-iihgc5nr 47 7 or or CC cord-260054-iihgc5nr 47 8 canine canine NN cord-260054-iihgc5nr 47 9 were be VBD cord-260054-iihgc5nr 47 10 also also RB cord-260054-iihgc5nr 47 11 discarded discard VBN cord-260054-iihgc5nr 47 12 . . . cord-260054-iihgc5nr 48 1 The the DT cord-260054-iihgc5nr 48 2 remaining remain VBG cord-260054-iihgc5nr 48 3 7,692 7,692 CD cord-260054-iihgc5nr 48 4 protein protein NN cord-260054-iihgc5nr 48 5 sequences sequence NNS cord-260054-iihgc5nr 48 6 were be VBD cord-260054-iihgc5nr 48 7 further further RB cord-260054-iihgc5nr 48 8 analysed analyse VBN cord-260054-iihgc5nr 48 9 . . . cord-260054-iihgc5nr 49 1 First first RB cord-260054-iihgc5nr 49 2 , , , cord-260054-iihgc5nr 49 3 we -PRON- PRP cord-260054-iihgc5nr 49 4 clustered cluster VBD cord-260054-iihgc5nr 49 5 them -PRON- PRP cord-260054-iihgc5nr 49 6 in in IN cord-260054-iihgc5nr 49 7 sets set NNS cord-260054-iihgc5nr 49 8 of of IN cord-260054-iihgc5nr 49 9 identical identical JJ cord-260054-iihgc5nr 49 10 sequences sequence NNS cord-260054-iihgc5nr 49 11 with with IN cord-260054-iihgc5nr 49 12 CD CD NNP cord-260054-iihgc5nr 49 13 - - HYPH cord-260054-iihgc5nr 49 14 HIT HIT NNP cord-260054-iihgc5nr 49 15 ( ( -LRB- cord-260054-iihgc5nr 49 16 32 32 CD cord-260054-iihgc5nr 49 17 ) ) -RRB- cord-260054-iihgc5nr 49 18 , , , cord-260054-iihgc5nr 49 19 obtaining obtain VBG cord-260054-iihgc5nr 49 20 120 120 CD cord-260054-iihgc5nr 49 21 clusters cluster NNS cord-260054-iihgc5nr 49 22 of of IN cord-260054-iihgc5nr 49 23 at at RB cord-260054-iihgc5nr 49 24 least least JJS cord-260054-iihgc5nr 49 25 2 2 CD cord-260054-iihgc5nr 49 26 sequences sequence NNS cord-260054-iihgc5nr 49 27 and and CC cord-260054-iihgc5nr 49 28 245 245 CD cord-260054-iihgc5nr 49 29 unique unique JJ cord-260054-iihgc5nr 49 30 sequences sequence NNS cord-260054-iihgc5nr 49 31 . . . cord-260054-iihgc5nr 50 1 As as IN cord-260054-iihgc5nr 50 2 a a DT cord-260054-iihgc5nr 50 3 reference reference NN cord-260054-iihgc5nr 50 4 system system NN cord-260054-iihgc5nr 50 5 for for IN cord-260054-iihgc5nr 50 6 further further JJ cord-260054-iihgc5nr 50 7 analyses analysis NNS cord-260054-iihgc5nr 50 8 , , , cord-260054-iihgc5nr 50 9 we -PRON- PRP cord-260054-iihgc5nr 50 10 used use VBD cord-260054-iihgc5nr 50 11 the the DT cord-260054-iihgc5nr 50 12 first first JJ cord-260054-iihgc5nr 50 13 dated date VBN cord-260054-iihgc5nr 50 14 ( ( -LRB- cord-260054-iihgc5nr 50 15 on on IN cord-260054-iihgc5nr 50 16 December December NNP cord-260054-iihgc5nr 50 17 24 24 CD cord-260054-iihgc5nr 50 18 th th CD cord-260054-iihgc5nr 50 19 2019 2019 CD cord-260054-iihgc5nr 50 20 ) ) -RRB- cord-260054-iihgc5nr 51 1 genomic genomic JJ cord-260054-iihgc5nr 51 2 sequence sequence NN cord-260054-iihgc5nr 51 3 in in IN cord-260054-iihgc5nr 51 4 GISAID GISAID NNP cord-260054-iihgc5nr 51 5 , , , cord-260054-iihgc5nr 52 1 isolated isolate VBN cord-260054-iihgc5nr 52 2 and and CC cord-260054-iihgc5nr 52 3 sequenced sequence VBN cord-260054-iihgc5nr 52 4 in in IN cord-260054-iihgc5nr 52 5 Wuhan Wuhan NNP cord-260054-iihgc5nr 52 6 ( ( -LRB- cord-260054-iihgc5nr 52 7 Hubei Hubei NNP cord-260054-iihgc5nr 52 8 , , , cord-260054-iihgc5nr 52 9 China China NNP cord-260054-iihgc5nr 52 10 ) ) -RRB- cord-260054-iihgc5nr 52 11 ( ( -LRB- cord-260054-iihgc5nr 52 12 30 30 CD cord-260054-iihgc5nr 52 13 ) ) -RRB- cord-260054-iihgc5nr 52 14 . . . cord-260054-iihgc5nr 53 1 Then then RB cord-260054-iihgc5nr 53 2 , , , cord-260054-iihgc5nr 53 3 upon upon IN cord-260054-iihgc5nr 53 4 alignment alignment NN cord-260054-iihgc5nr 53 5 to to IN cord-260054-iihgc5nr 53 6 the the DT cord-260054-iihgc5nr 53 7 reference reference NN cord-260054-iihgc5nr 53 8 sequence sequence NN cord-260054-iihgc5nr 53 9 , , , cord-260054-iihgc5nr 53 10 we -PRON- PRP cord-260054-iihgc5nr 53 11 identified identify VBD cord-260054-iihgc5nr 53 12 point point NN cord-260054-iihgc5nr 53 13 mutations mutation NNS cord-260054-iihgc5nr 53 14 in in IN cord-260054-iihgc5nr 53 15 all all PDT cord-260054-iihgc5nr 53 16 the the DT cord-260054-iihgc5nr 53 17 sets set NNS cord-260054-iihgc5nr 53 18 of of IN cord-260054-iihgc5nr 53 19 at at RB cord-260054-iihgc5nr 53 20 least least RBS cord-260054-iihgc5nr 53 21 two two CD cord-260054-iihgc5nr 53 22 sequences sequence NNS cord-260054-iihgc5nr 53 23 . . . cord-260054-iihgc5nr 54 1 We -PRON- PRP cord-260054-iihgc5nr 54 2 downloaded download VBD cord-260054-iihgc5nr 54 3 again again RB cord-260054-iihgc5nr 54 4 the the DT cord-260054-iihgc5nr 54 5 34,805 34,805 CD cord-260054-iihgc5nr 54 6 genomic genomic JJ cord-260054-iihgc5nr 54 7 sequences sequence NNS cord-260054-iihgc5nr 54 8 available available JJ cord-260054-iihgc5nr 54 9 from from IN cord-260054-iihgc5nr 54 10 GISAID GISAID NNP cord-260054-iihgc5nr 54 11 on on IN cord-260054-iihgc5nr 54 12 May May NNP cord-260054-iihgc5nr 54 13 29 29 CD cord-260054-iihgc5nr 54 14 th th CD cord-260054-iihgc5nr 54 15 2020 2020 CD cord-260054-iihgc5nr 54 16 ( ( -LRB- cord-260054-iihgc5nr 54 17 gisaid_hcov-19_2020_05_29_14 gisaid_hcov-19_2020_05_29_14 NNP cord-260054-iihgc5nr 54 18 ) ) -RRB- cord-260054-iihgc5nr 54 19 and and CC cord-260054-iihgc5nr 54 20 followed follow VBD cord-260054-iihgc5nr 54 21 the the DT cord-260054-iihgc5nr 54 22 above above JJ cord-260054-iihgc5nr 54 23 pipeline pipeline NN cord-260054-iihgc5nr 54 24 to to TO cord-260054-iihgc5nr 54 25 extract extract VB cord-260054-iihgc5nr 54 26 23,332 23,332 CD cord-260054-iihgc5nr 54 27 complete complete JJ cord-260054-iihgc5nr 54 28 1273-residue 1273-residue CD cord-260054-iihgc5nr 54 29 long long JJ cord-260054-iihgc5nr 54 30 S s NN cord-260054-iihgc5nr 54 31 protein protein NN cord-260054-iihgc5nr 54 32 sequences sequence NNS cord-260054-iihgc5nr 54 33 . . . cord-260054-iihgc5nr 55 1 We -PRON- PRP cord-260054-iihgc5nr 55 2 then then RB cord-260054-iihgc5nr 55 3 recorded record VBD cord-260054-iihgc5nr 55 4 the the DT cord-260054-iihgc5nr 55 5 presence presence NN cord-260054-iihgc5nr 55 6 and and CC cord-260054-iihgc5nr 55 7 frequency frequency NN cord-260054-iihgc5nr 55 8 in in IN cord-260054-iihgc5nr 55 9 them -PRON- PRP cord-260054-iihgc5nr 55 10 of of IN cord-260054-iihgc5nr 55 11 any any DT cord-260054-iihgc5nr 55 12 mutation mutation NN cord-260054-iihgc5nr 55 13 occurring occur VBG cord-260054-iihgc5nr 55 14 in in IN cord-260054-iihgc5nr 55 15 the the DT cord-260054-iihgc5nr 55 16 fusion fusion NN cord-260054-iihgc5nr 55 17 core core NN cord-260054-iihgc5nr 55 18 of of IN cord-260054-iihgc5nr 55 19 the the DT cord-260054-iihgc5nr 55 20 HR1 HR1 NNP cord-260054-iihgc5nr 55 21 ( ( -LRB- cord-260054-iihgc5nr 55 22 residues residue NNS cord-260054-iihgc5nr 55 23 929 929 CD cord-260054-iihgc5nr 55 24 - - SYM cord-260054-iihgc5nr 55 25 949 949 CD cord-260054-iihgc5nr 55 26 ) ) -RRB- cord-260054-iihgc5nr 55 27 with with IN cord-260054-iihgc5nr 55 28 in in IN cord-260054-iihgc5nr 55 29 - - HYPH cord-260054-iihgc5nr 55 30 house house NN cord-260054-iihgc5nr 55 31 scripts script NNS cord-260054-iihgc5nr 55 32 . . . cord-260054-iihgc5nr 56 1 Mutants mutant NNS cord-260054-iihgc5nr 56 2 3D 3d JJ cord-260054-iihgc5nr 56 3 models model NNS cord-260054-iihgc5nr 56 4 were be VBD cord-260054-iihgc5nr 56 5 built build VBN cord-260054-iihgc5nr 56 6 using use VBG cord-260054-iihgc5nr 56 7 the the DT cord-260054-iihgc5nr 56 8 mutate_model mutate_model NNP cord-260054-iihgc5nr 56 9 module module NN cord-260054-iihgc5nr 56 10 of of IN cord-260054-iihgc5nr 56 11 the the DT cord-260054-iihgc5nr 56 12 Modeller Modeller NNP cord-260054-iihgc5nr 56 13 9v11 9v11 CD cord-260054-iihgc5nr 56 14 program program NN cord-260054-iihgc5nr 56 15 ( ( -LRB- cord-260054-iihgc5nr 56 16 33 33 CD cord-260054-iihgc5nr 56 17 ) ) -RRB- cord-260054-iihgc5nr 56 18 . . . cord-260054-iihgc5nr 57 1 This this DT cord-260054-iihgc5nr 57 2 is be VBZ cord-260054-iihgc5nr 57 3 an an DT cord-260054-iihgc5nr 57 4 automated automated JJ cord-260054-iihgc5nr 57 5 method method NN cord-260054-iihgc5nr 57 6 for for IN cord-260054-iihgc5nr 57 7 modelling modelling NN cord-260054-iihgc5nr 57 8 point point NN cord-260054-iihgc5nr 57 9 mutations mutation NNS cord-260054-iihgc5nr 57 10 in in IN cord-260054-iihgc5nr 57 11 protein protein NN cord-260054-iihgc5nr 57 12 structures structure NNS cord-260054-iihgc5nr 57 13 , , , cord-260054-iihgc5nr 57 14 which which WDT cord-260054-iihgc5nr 57 15 includes include VBZ cord-260054-iihgc5nr 57 16 an an DT cord-260054-iihgc5nr 57 17 optimisation optimisation NN cord-260054-iihgc5nr 57 18 procedure procedure NN cord-260054-iihgc5nr 57 19 of of IN cord-260054-iihgc5nr 57 20 the the DT cord-260054-iihgc5nr 57 21 mutated mutated JJ cord-260054-iihgc5nr 57 22 residue residue NN cord-260054-iihgc5nr 57 23 in in IN cord-260054-iihgc5nr 57 24 its -PRON- PRP$ cord-260054-iihgc5nr 57 25 environment environment NN cord-260054-iihgc5nr 57 26 , , , cord-260054-iihgc5nr 57 27 beginning begin VBG cord-260054-iihgc5nr 57 28 with with IN cord-260054-iihgc5nr 57 29 a a DT cord-260054-iihgc5nr 57 30 conjugate conjugate NN cord-260054-iihgc5nr 57 31 gradients gradient NNS cord-260054-iihgc5nr 57 32 minimisation minimisation NN cord-260054-iihgc5nr 57 33 , , , cord-260054-iihgc5nr 57 34 continuing continue VBG cord-260054-iihgc5nr 57 35 with with IN cord-260054-iihgc5nr 57 36 molecular molecular JJ cord-260054-iihgc5nr 57 37 dynamics dynamic NNS cord-260054-iihgc5nr 57 38 with with IN cord-260054-iihgc5nr 57 39 simulated simulated JJ cord-260054-iihgc5nr 57 40 annealing annealing NN cord-260054-iihgc5nr 57 41 and and CC cord-260054-iihgc5nr 57 42 finishing finish VBG cord-260054-iihgc5nr 57 43 again again RB cord-260054-iihgc5nr 57 44 by by IN cord-260054-iihgc5nr 57 45 conjugate conjugate NN cord-260054-iihgc5nr 57 46 gradients gradient NNS cord-260054-iihgc5nr 57 47 . . . cord-260054-iihgc5nr 58 1 The the DT cord-260054-iihgc5nr 58 2 used use VBN cord-260054-iihgc5nr 58 3 force force NN cord-260054-iihgc5nr 58 4 field field NN cord-260054-iihgc5nr 58 5 is be VBZ cord-260054-iihgc5nr 58 6 CHARM-22 charm-22 NN cord-260054-iihgc5nr 58 7 , , , cord-260054-iihgc5nr 58 8 for for IN cord-260054-iihgc5nr 58 9 details detail NNS cord-260054-iihgc5nr 58 10 see see VBP cord-260054-iihgc5nr 58 11 Reference reference NN cord-260054-iihgc5nr 58 12 ( ( -LRB- cord-260054-iihgc5nr 58 13 34 34 CD cord-260054-iihgc5nr 58 14 ) ) -RRB- cord-260054-iihgc5nr 58 15 . . . cord-260054-iihgc5nr 59 1 Models model NNS cord-260054-iihgc5nr 59 2 for for IN cord-260054-iihgc5nr 59 3 mutants mutant NNS cord-260054-iihgc5nr 59 4 in in IN cord-260054-iihgc5nr 59 5 the the DT cord-260054-iihgc5nr 59 6 pre pre JJ cord-260054-iihgc5nr 59 7 - - JJ cord-260054-iihgc5nr 59 8 fusion fusion JJ cord-260054-iihgc5nr 59 9 conformation conformation NN cord-260054-iihgc5nr 59 10 were be VBD cord-260054-iihgc5nr 59 11 built build VBN cord-260054-iihgc5nr 59 12 starting start VBG cord-260054-iihgc5nr 59 13 from from IN cord-260054-iihgc5nr 59 14 the the DT cord-260054-iihgc5nr 59 15 EM EM NNP cord-260054-iihgc5nr 59 16 structure structure NN cord-260054-iihgc5nr 59 17 of of IN cord-260054-iihgc5nr 59 18 the the DT cord-260054-iihgc5nr 59 19 pre pre JJ cord-260054-iihgc5nr 59 20 - - JJ cord-260054-iihgc5nr 59 21 fusion fusion JJ cord-260054-iihgc5nr 59 22 trimeric trimeric JJ cord-260054-iihgc5nr 59 23 conformation conformation NN cord-260054-iihgc5nr 59 24 ( ( -LRB- cord-260054-iihgc5nr 59 25 PDB PDB NNP cord-260054-iihgc5nr 59 26 ID ID NNP cord-260054-iihgc5nr 59 27 : : : cord-260054-iihgc5nr 59 28 6VSB 6VSB NNP cord-260054-iihgc5nr 59 29 , , , cord-260054-iihgc5nr 59 30 resolution resolution NN cord-260054-iihgc5nr 59 31 3.46 3.46 CD cord-260054-iihgc5nr 59 32 Å å NN cord-260054-iihgc5nr 59 33 , , , cord-260054-iihgc5nr 59 34 ( ( -LRB- cord-260054-iihgc5nr 59 35 22 22 CD cord-260054-iihgc5nr 59 36 ) ) -RRB- cord-260054-iihgc5nr 59 37 ) ) -RRB- cord-260054-iihgc5nr 59 38 . . . cord-260054-iihgc5nr 60 1 Models model NNS cord-260054-iihgc5nr 60 2 for for IN cord-260054-iihgc5nr 60 3 mutants mutant NNS cord-260054-iihgc5nr 60 4 in in IN cord-260054-iihgc5nr 60 5 the the DT cord-260054-iihgc5nr 60 6 post post JJ cord-260054-iihgc5nr 60 7 - - JJ cord-260054-iihgc5nr 60 8 fusion fusion JJ cord-260054-iihgc5nr 60 9 conformation conformation NN cord-260054-iihgc5nr 60 10 were be VBD cord-260054-iihgc5nr 60 11 built build VBN cord-260054-iihgc5nr 60 12 starting start VBG cord-260054-iihgc5nr 60 13 from from IN cord-260054-iihgc5nr 60 14 the the DT cord-260054-iihgc5nr 60 15 X x NN cord-260054-iihgc5nr 60 16 - - NN cord-260054-iihgc5nr 60 17 ray ray JJ cord-260054-iihgc5nr 60 18 structure structure NN cord-260054-iihgc5nr 60 19 of of IN cord-260054-iihgc5nr 60 20 the the DT cord-260054-iihgc5nr 60 21 S2 S2 NNP cord-260054-iihgc5nr 60 22 subunit subunit NN cord-260054-iihgc5nr 60 23 fusion fusion NN cord-260054-iihgc5nr 60 24 core core NN cord-260054-iihgc5nr 60 25 ( ( -LRB- cord-260054-iihgc5nr 60 26 PDB PDB NNP cord-260054-iihgc5nr 60 27 ID ID NNP cord-260054-iihgc5nr 60 28 : : : cord-260054-iihgc5nr 60 29 6LXT 6LXT NNP cord-260054-iihgc5nr 60 30 , , , cord-260054-iihgc5nr 60 31 resolution resolution NN cord-260054-iihgc5nr 60 32 2.90 2.90 CD cord-260054-iihgc5nr 60 33 Å å NN cord-260054-iihgc5nr 60 34 , , , cord-260054-iihgc5nr 60 35 ( ( -LRB- cord-260054-iihgc5nr 60 36 29 29 CD cord-260054-iihgc5nr 60 37 ) ) -RRB- cord-260054-iihgc5nr 60 38 ) ) -RRB- cord-260054-iihgc5nr 60 39 . . . cord-260054-iihgc5nr 61 1 Molecular molecular JJ cord-260054-iihgc5nr 61 2 models model NNS cord-260054-iihgc5nr 61 3 were be VBD cord-260054-iihgc5nr 61 4 analysed analyse VBN cord-260054-iihgc5nr 61 5 and and CC cord-260054-iihgc5nr 61 6 visually visually RB cord-260054-iihgc5nr 61 7 inspected inspect VBN cord-260054-iihgc5nr 61 8 with with IN cord-260054-iihgc5nr 61 9 Pymol Pymol NNP cord-260054-iihgc5nr 61 10 ( ( -LRB- cord-260054-iihgc5nr 61 11 35 35 CD cord-260054-iihgc5nr 61 12 ) ) -RRB- cord-260054-iihgc5nr 61 13 . . . cord-260054-iihgc5nr 62 1 The the DT cord-260054-iihgc5nr 62 2 COCOMAPS COCOMAPS NNP cord-260054-iihgc5nr 62 3 web web NN cord-260054-iihgc5nr 62 4 server server NN cord-260054-iihgc5nr 62 5 ( ( -LRB- cord-260054-iihgc5nr 62 6 36 36 CD cord-260054-iihgc5nr 62 7 ) ) -RRB- cord-260054-iihgc5nr 62 8 was be VBD cord-260054-iihgc5nr 62 9 used use VBN cord-260054-iihgc5nr 62 10 to to TO cord-260054-iihgc5nr 62 11 analyse analyse VB cord-260054-iihgc5nr 62 12 the the DT cord-260054-iihgc5nr 62 13 inter inter JJ cord-260054-iihgc5nr 62 14 - - JJ cord-260054-iihgc5nr 62 15 chain chain JJ cord-260054-iihgc5nr 62 16 contacts contact NNS cord-260054-iihgc5nr 62 17 and and CC cord-260054-iihgc5nr 62 18 Hbonds Hbonds NNPS cord-260054-iihgc5nr 62 19 as as RB cord-260054-iihgc5nr 62 20 well well RB cord-260054-iihgc5nr 62 21 as as IN cord-260054-iihgc5nr 62 22 the the DT cord-260054-iihgc5nr 62 23 residues residue NNS cord-260054-iihgc5nr 62 24 accessibility accessibility NN cord-260054-iihgc5nr 62 25 to to IN cord-260054-iihgc5nr 62 26 the the DT cord-260054-iihgc5nr 62 27 solvent solvent NN cord-260054-iihgc5nr 62 28 . . . cord-260054-iihgc5nr 63 1 We -PRON- PRP cord-260054-iihgc5nr 63 2 downloaded download VBD cord-260054-iihgc5nr 63 3 all all PDT cord-260054-iihgc5nr 63 4 the the DT cord-260054-iihgc5nr 63 5 SARS SARS NNP cord-260054-iihgc5nr 63 6 - - HYPH cord-260054-iihgc5nr 63 7 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 63 8 genomic genomic JJ cord-260054-iihgc5nr 63 9 sequences sequence NNS cord-260054-iihgc5nr 63 10 from from IN cord-260054-iihgc5nr 63 11 the the DT cord-260054-iihgc5nr 63 12 GISAID GISAID NNP cord-260054-iihgc5nr 63 13 resource resource NN cord-260054-iihgc5nr 63 14 on on IN cord-260054-iihgc5nr 63 15 April April NNP cord-260054-iihgc5nr 63 16 21 21 CD cord-260054-iihgc5nr 63 17 st st NNP cord-260054-iihgc5nr 63 18 2020 2020 CD cord-260054-iihgc5nr 63 19 , , , cord-260054-iihgc5nr 63 20 extracted extract VBN cord-260054-iihgc5nr 63 21 from from IN cord-260054-iihgc5nr 63 22 them -PRON- PRP cord-260054-iihgc5nr 63 23 7,692 7,692 CD cord-260054-iihgc5nr 63 24 complete complete JJ cord-260054-iihgc5nr 63 25 S S NNP cord-260054-iihgc5nr 63 26 protein protein NN cord-260054-iihgc5nr 63 27 sequences sequence NNS cord-260054-iihgc5nr 63 28 and and CC cord-260054-iihgc5nr 63 29 identified identify VBD cord-260054-iihgc5nr 63 30 all all PDT cord-260054-iihgc5nr 63 31 the the DT cord-260054-iihgc5nr 63 32 point point NN cord-260054-iihgc5nr 63 33 mutations mutation NNS cord-260054-iihgc5nr 63 34 occurring occur VBG cord-260054-iihgc5nr 63 35 in in IN cord-260054-iihgc5nr 63 36 at at RB cord-260054-iihgc5nr 63 37 least least RBS cord-260054-iihgc5nr 63 38 two two CD cord-260054-iihgc5nr 63 39 identical identical JJ cord-260054-iihgc5nr 63 40 sequences sequence NNS cord-260054-iihgc5nr 63 41 ( ( -LRB- cord-260054-iihgc5nr 63 42 see see VB cord-260054-iihgc5nr 63 43 Methods Methods NNPS cord-260054-iihgc5nr 63 44 ) ) -RRB- cord-260054-iihgc5nr 63 45 . . . cord-260054-iihgc5nr 64 1 The the DT cord-260054-iihgc5nr 64 2 111 111 CD cord-260054-iihgc5nr 64 3 mutations mutation NNS cord-260054-iihgc5nr 64 4 we -PRON- PRP cord-260054-iihgc5nr 64 5 identified identify VBD cord-260054-iihgc5nr 64 6 , , , cord-260054-iihgc5nr 64 7 occurring occur VBG cord-260054-iihgc5nr 64 8 at at IN cord-260054-iihgc5nr 64 9 105 105 CD cord-260054-iihgc5nr 64 10 different different JJ cord-260054-iihgc5nr 64 11 positions position NNS cord-260054-iihgc5nr 64 12 spread spread VBP cord-260054-iihgc5nr 64 13 all all RB cord-260054-iihgc5nr 64 14 over over IN cord-260054-iihgc5nr 64 15 the the DT cord-260054-iihgc5nr 64 16 protein protein NN cord-260054-iihgc5nr 64 17 sequence sequence NN cord-260054-iihgc5nr 64 18 , , , cord-260054-iihgc5nr 64 19 are be VBP cord-260054-iihgc5nr 64 20 reported report VBN cord-260054-iihgc5nr 64 21 in in IN cord-260054-iihgc5nr 64 22 Table Table NNP cord-260054-iihgc5nr 64 23 S1 S1 NNP cord-260054-iihgc5nr 64 24 , , , cord-260054-iihgc5nr 64 25 with with IN cord-260054-iihgc5nr 64 26 the the DT cord-260054-iihgc5nr 64 27 relative relative JJ cord-260054-iihgc5nr 64 28 number number NN cord-260054-iihgc5nr 64 29 of of IN cord-260054-iihgc5nr 64 30 occurrences occurrence NNS cord-260054-iihgc5nr 64 31 . . . cord-260054-iihgc5nr 65 1 While while IN cord-260054-iihgc5nr 65 2 the the DT cord-260054-iihgc5nr 65 3 mutations mutation NNS cord-260054-iihgc5nr 65 4 we -PRON- PRP cord-260054-iihgc5nr 65 5 identified identify VBD cord-260054-iihgc5nr 65 6 were be VBD cord-260054-iihgc5nr 65 7 spaced space VBN cord-260054-iihgc5nr 65 8 on on IN cord-260054-iihgc5nr 65 9 average average JJ cord-260054-iihgc5nr 65 10 12 12 CD cord-260054-iihgc5nr 65 11 positions position NNS cord-260054-iihgc5nr 65 12 along along IN cord-260054-iihgc5nr 65 13 the the DT cord-260054-iihgc5nr 65 14 protein protein NN cord-260054-iihgc5nr 65 15 sequence sequence NN cord-260054-iihgc5nr 65 16 , , , cord-260054-iihgc5nr 65 17 a a DT cord-260054-iihgc5nr 65 18 segment segment NN cord-260054-iihgc5nr 65 19 of of IN cord-260054-iihgc5nr 65 20 14 14 CD cord-260054-iihgc5nr 65 21 amino amino JJ cord-260054-iihgc5nr 65 22 acids acid NNS cord-260054-iihgc5nr 65 23 harboured harbour VBD cord-260054-iihgc5nr 65 24 6 6 CD cord-260054-iihgc5nr 65 25 mutations mutation NNS cord-260054-iihgc5nr 65 26 , , , cord-260054-iihgc5nr 65 27 at at IN cord-260054-iihgc5nr 65 28 positions position NNS cord-260054-iihgc5nr 65 29 929 929 CD cord-260054-iihgc5nr 65 30 , , , cord-260054-iihgc5nr 65 31 936 936 CD cord-260054-iihgc5nr 65 32 , , , cord-260054-iihgc5nr 65 33 938 938 CD cord-260054-iihgc5nr 65 34 , , , cord-260054-iihgc5nr 65 35 939 939 CD cord-260054-iihgc5nr 65 36 , , , cord-260054-iihgc5nr 65 37 940 940 CD cord-260054-iihgc5nr 65 38 and and CC cord-260054-iihgc5nr 65 39 943 943 CD cord-260054-iihgc5nr 65 40 , , , cord-260054-iihgc5nr 65 41 proposing propose VBG cord-260054-iihgc5nr 65 42 itself -PRON- PRP cord-260054-iihgc5nr 65 43 as as IN cord-260054-iihgc5nr 65 44 a a DT cord-260054-iihgc5nr 65 45 mutational mutational JJ cord-260054-iihgc5nr 65 46 hotspot hotspot NN cord-260054-iihgc5nr 65 47 . . . cord-260054-iihgc5nr 66 1 This this DT cord-260054-iihgc5nr 66 2 sequence sequence NN cord-260054-iihgc5nr 66 3 segment segment NN cord-260054-iihgc5nr 66 4 is be VBZ cord-260054-iihgc5nr 66 5 part part NN cord-260054-iihgc5nr 66 6 of of IN cord-260054-iihgc5nr 66 7 the the DT cord-260054-iihgc5nr 66 8 " " `` cord-260054-iihgc5nr 66 9 fusion fusion NN cord-260054-iihgc5nr 66 10 core core NN cord-260054-iihgc5nr 66 11 " " '' cord-260054-iihgc5nr 66 12 of of IN cord-260054-iihgc5nr 66 13 the the DT cord-260054-iihgc5nr 66 14 HR1 HR1 NNP cord-260054-iihgc5nr 66 15 , , , cord-260054-iihgc5nr 66 16 in in IN cord-260054-iihgc5nr 66 17 the the DT cord-260054-iihgc5nr 66 18 protein protein NN cord-260054-iihgc5nr 66 19 S2 S2 NNP cord-260054-iihgc5nr 66 20 subunit subunit NN cord-260054-iihgc5nr 66 21 . . . cord-260054-iihgc5nr 67 1 The the DT cord-260054-iihgc5nr 67 2 HR1 HR1 NNP cord-260054-iihgc5nr 67 3 of of IN cord-260054-iihgc5nr 67 4 coronaviruses coronaviruse NNS cord-260054-iihgc5nr 68 1 S s NN cord-260054-iihgc5nr 68 2 proteins protein NNS cord-260054-iihgc5nr 68 3 undergoes undergo VBZ cord-260054-iihgc5nr 68 4 one one CD cord-260054-iihgc5nr 68 5 of of IN cord-260054-iihgc5nr 68 6 the the DT cord-260054-iihgc5nr 68 7 most most RBS cord-260054-iihgc5nr 68 8 notable notable JJ cord-260054-iihgc5nr 68 9 rearrangements rearrangement NNS cord-260054-iihgc5nr 68 10 within within IN cord-260054-iihgc5nr 68 11 the the DT cord-260054-iihgc5nr 68 12 protein protein NN cord-260054-iihgc5nr 68 13 between between IN cord-260054-iihgc5nr 68 14 the the DT cord-260054-iihgc5nr 68 15 pre pre NN cord-260054-iihgc5nr 68 16 - - JJ cord-260054-iihgc5nr 68 17 and and CC cord-260054-iihgc5nr 68 18 post post JJ cord-260054-iihgc5nr 68 19 - - JJ cord-260054-iihgc5nr 68 20 fusion fusion JJ cord-260054-iihgc5nr 68 21 conformations conformation NNS cord-260054-iihgc5nr 68 22 . . . cord-260054-iihgc5nr 69 1 In in IN cord-260054-iihgc5nr 69 2 the the DT cord-260054-iihgc5nr 69 3 post post JJ cord-260054-iihgc5nr 69 4 - - JJ cord-260054-iihgc5nr 69 5 fusion fusion JJ cord-260054-iihgc5nr 69 6 conformation conformation NN cord-260054-iihgc5nr 69 7 , , , cord-260054-iihgc5nr 69 8 in in IN cord-260054-iihgc5nr 69 9 fact fact NN cord-260054-iihgc5nr 69 10 , , , cord-260054-iihgc5nr 69 11 it -PRON- PRP cord-260054-iihgc5nr 69 12 experiences experience VBZ cord-260054-iihgc5nr 69 13 a a DT cord-260054-iihgc5nr 69 14 refolding refolding NN cord-260054-iihgc5nr 69 15 of of IN cord-260054-iihgc5nr 69 16 the the DT cord-260054-iihgc5nr 69 17 pre pre JJ cord-260054-iihgc5nr 69 18 - - JJ cord-260054-iihgc5nr 69 19 fusion fusion JJ cord-260054-iihgc5nr 69 20 multiple multiple JJ cord-260054-iihgc5nr 69 21 helices helix NNS cord-260054-iihgc5nr 69 22 and and CC cord-260054-iihgc5nr 69 23 intervening intervene VBG cord-260054-iihgc5nr 69 24 regions region NNS cord-260054-iihgc5nr 69 25 into into IN cord-260054-iihgc5nr 69 26 a a DT cord-260054-iihgc5nr 69 27 single single JJ cord-260054-iihgc5nr 69 28 continuous continuous JJ cord-260054-iihgc5nr 69 29 helix helix NN cord-260054-iihgc5nr 69 30 ( ( -LRB- cord-260054-iihgc5nr 69 31 Figure figure NN cord-260054-iihgc5nr 69 32 1 1 CD cord-260054-iihgc5nr 69 33 ) ) -RRB- cord-260054-iihgc5nr 69 34 . . . cord-260054-iihgc5nr 70 1 As as IN cord-260054-iihgc5nr 70 2 already already RB cord-260054-iihgc5nr 70 3 mentioned mention VBN cord-260054-iihgc5nr 70 4 , , , cord-260054-iihgc5nr 70 5 three three CD cord-260054-iihgc5nr 70 6 of of IN cord-260054-iihgc5nr 70 7 these these DT cord-260054-iihgc5nr 70 8 long long JJ cord-260054-iihgc5nr 70 9 helices helix NNS cord-260054-iihgc5nr 70 10 then then RB cord-260054-iihgc5nr 70 11 form form VBP cord-260054-iihgc5nr 70 12 a a DT cord-260054-iihgc5nr 70 13 6HB 6hb CD cord-260054-iihgc5nr 70 14 with with IN cord-260054-iihgc5nr 70 15 three three CD cord-260054-iihgc5nr 70 16 HR2 HR2 NNP cord-260054-iihgc5nr 70 17 helical helical JJ cord-260054-iihgc5nr 70 18 motifs motif NNS cord-260054-iihgc5nr 70 19 ( ( -LRB- cord-260054-iihgc5nr 70 20 18 18 CD cord-260054-iihgc5nr 70 21 , , , cord-260054-iihgc5nr 70 22 29 29 CD cord-260054-iihgc5nr 70 23 , , , cord-260054-iihgc5nr 70 24 31 31 CD cord-260054-iihgc5nr 70 25 ) ) -RRB- cord-260054-iihgc5nr 70 26 . . . cord-260054-iihgc5nr 71 1 The the DT cord-260054-iihgc5nr 71 2 HR1 HR1 NNP cord-260054-iihgc5nr 71 3 and and CC cord-260054-iihgc5nr 71 4 its -PRON- PRP$ cord-260054-iihgc5nr 71 5 " " `` cord-260054-iihgc5nr 71 6 fusion fusion NN cord-260054-iihgc5nr 71 7 core core NN cord-260054-iihgc5nr 71 8 " " '' cord-260054-iihgc5nr 71 9 in in IN cord-260054-iihgc5nr 71 10 particular particular JJ cord-260054-iihgc5nr 71 11 thus thus RB cord-260054-iihgc5nr 71 12 play play VB cord-260054-iihgc5nr 71 13 a a DT cord-260054-iihgc5nr 71 14 crucial crucial JJ cord-260054-iihgc5nr 71 15 role role NN cord-260054-iihgc5nr 71 16 in in IN cord-260054-iihgc5nr 71 17 the the DT cord-260054-iihgc5nr 71 18 virus virus NN cord-260054-iihgc5nr 71 19 infectivity infectivity NN cord-260054-iihgc5nr 71 20 . . . cord-260054-iihgc5nr 72 1 The the DT cord-260054-iihgc5nr 72 2 following follow VBG cord-260054-iihgc5nr 72 3 6 6 CD cord-260054-iihgc5nr 72 4 mutations mutation NNS cord-260054-iihgc5nr 72 5 were be VBD cord-260054-iihgc5nr 72 6 identified identify VBN cord-260054-iihgc5nr 72 7 in in IN cord-260054-iihgc5nr 72 8 the the DT cord-260054-iihgc5nr 72 9 fusion fusion NN cord-260054-iihgc5nr 72 10 core core NN cord-260054-iihgc5nr 72 11 of of IN cord-260054-iihgc5nr 72 12 the the DT cord-260054-iihgc5nr 72 13 HR1 HR1 NNP cord-260054-iihgc5nr 72 14 on on IN cord-260054-iihgc5nr 72 15 April April NNP cord-260054-iihgc5nr 72 16 21 21 CD cord-260054-iihgc5nr 72 17 st st NNP cord-260054-iihgc5nr 72 18 2020 2020 CD cord-260054-iihgc5nr 72 19 : : : cord-260054-iihgc5nr 72 20 S929I S929I NNP cord-260054-iihgc5nr 72 21 , , , cord-260054-iihgc5nr 72 22 D936Y D936Y NNP cord-260054-iihgc5nr 72 23 , , , cord-260054-iihgc5nr 72 24 L938F L938F NNP cord-260054-iihgc5nr 72 25 , , , cord-260054-iihgc5nr 72 26 S939F S939F NNP cord-260054-iihgc5nr 72 27 , , , cord-260054-iihgc5nr 72 28 S940F S940F NNP cord-260054-iihgc5nr 72 29 , , , cord-260054-iihgc5nr 72 30 S943P. S943P. NNP cord-260054-iihgc5nr 72 31 Two two CD cord-260054-iihgc5nr 72 32 of of IN cord-260054-iihgc5nr 72 33 these these DT cord-260054-iihgc5nr 72 34 mutations mutation NNS cord-260054-iihgc5nr 72 35 , , , cord-260054-iihgc5nr 72 36 D936Y D936Y NNP cord-260054-iihgc5nr 72 37 and and CC cord-260054-iihgc5nr 72 38 S943P S943P NNP cord-260054-iihgc5nr 72 39 , , , cord-260054-iihgc5nr 72 40 were be VBD cord-260054-iihgc5nr 72 41 among among IN cord-260054-iihgc5nr 72 42 the the DT cord-260054-iihgc5nr 72 43 most most RBS cord-260054-iihgc5nr 72 44 frequent frequent JJ cord-260054-iihgc5nr 72 45 in in IN cord-260054-iihgc5nr 72 46 the the DT cord-260054-iihgc5nr 72 47 ensemble ensemble NN cord-260054-iihgc5nr 72 48 of of IN cord-260054-iihgc5nr 72 49 mutations mutation NNS cord-260054-iihgc5nr 72 50 we -PRON- PRP cord-260054-iihgc5nr 72 51 identified identify VBD cord-260054-iihgc5nr 72 52 . . . cord-260054-iihgc5nr 73 1 Besides besides IN cord-260054-iihgc5nr 73 2 the the DT cord-260054-iihgc5nr 73 3 widespread widespread JJ cord-260054-iihgc5nr 73 4 D614 d614 NN cord-260054-iihgc5nr 73 5 G g NN cord-260054-iihgc5nr 73 6 , , , cord-260054-iihgc5nr 73 7 now now RB cord-260054-iihgc5nr 73 8 dominant dominant JJ cord-260054-iihgc5nr 73 9 over over IN cord-260054-iihgc5nr 73 10 the the DT cord-260054-iihgc5nr 73 11 original original JJ cord-260054-iihgc5nr 73 12 D614 d614 NN cord-260054-iihgc5nr 73 13 variant variant NN cord-260054-iihgc5nr 73 14 ( ( -LRB- cord-260054-iihgc5nr 73 15 37 37 CD cord-260054-iihgc5nr 73 16 , , , cord-260054-iihgc5nr 73 17 38 38 CD cord-260054-iihgc5nr 73 18 ) ) -RRB- cord-260054-iihgc5nr 73 19 , , , cord-260054-iihgc5nr 73 20 only only RB cord-260054-iihgc5nr 73 21 5 5 CD cord-260054-iihgc5nr 73 22 other other JJ cord-260054-iihgc5nr 73 23 mutations mutation NNS cord-260054-iihgc5nr 73 24 ( ( -LRB- cord-260054-iihgc5nr 73 25 two two CD cord-260054-iihgc5nr 73 26 of of IN cord-260054-iihgc5nr 73 27 them -PRON- PRP cord-260054-iihgc5nr 73 28 being be VBG cord-260054-iihgc5nr 73 29 very very RB cord-260054-iihgc5nr 73 30 peripheral peripheral JJ cord-260054-iihgc5nr 73 31 , , , cord-260054-iihgc5nr 73 32 L5F l5f NN cord-260054-iihgc5nr 73 33 and and CC cord-260054-iihgc5nr 73 34 P1263L P1263L NNP cord-260054-iihgc5nr 73 35 ) ) -RRB- cord-260054-iihgc5nr 73 36 recurred recur VBD cord-260054-iihgc5nr 73 37 indeed indeed RB cord-260054-iihgc5nr 73 38 in in IN cord-260054-iihgc5nr 73 39 ≥ ≥ CD cord-260054-iihgc5nr 73 40 20 20 CD cord-260054-iihgc5nr 73 41 sequences sequence NNS cord-260054-iihgc5nr 73 42 ( ( -LRB- cord-260054-iihgc5nr 73 43 see see VB cord-260054-iihgc5nr 73 44 Table Table NNP cord-260054-iihgc5nr 73 45 S1 S1 NNP cord-260054-iihgc5nr 73 46 ) ) -RRB- cord-260054-iihgc5nr 73 47 . . . cord-260054-iihgc5nr 74 1 S943P S943P NNP cord-260054-iihgc5nr 74 2 was be VBD cord-260054-iihgc5nr 74 3 also also RB cord-260054-iihgc5nr 74 4 reported report VBN cord-260054-iihgc5nr 74 5 in in IN cord-260054-iihgc5nr 74 6 ( ( -LRB- cord-260054-iihgc5nr 74 7 38 38 CD cord-260054-iihgc5nr 74 8 ) ) -RRB- cord-260054-iihgc5nr 74 9 , , , cord-260054-iihgc5nr 74 10 where where WRB cord-260054-iihgc5nr 74 11 it -PRON- PRP cord-260054-iihgc5nr 74 12 was be VBD cord-260054-iihgc5nr 74 13 hypothesized hypothesize VBN cord-260054-iihgc5nr 74 14 to to TO cord-260054-iihgc5nr 74 15 be be VB cord-260054-iihgc5nr 74 16 spreading spread VBG cord-260054-iihgc5nr 74 17 via via IN cord-260054-iihgc5nr 74 18 recombination recombination NN cord-260054-iihgc5nr 74 19 . . . cord-260054-iihgc5nr 75 1 The the DT cord-260054-iihgc5nr 75 2 L938F L938F NNP cord-260054-iihgc5nr 75 3 mutation mutation NN cord-260054-iihgc5nr 75 4 was be VBD cord-260054-iihgc5nr 75 5 a a DT cord-260054-iihgc5nr 75 6 particularly particularly RB cord-260054-iihgc5nr 75 7 late late JJ cord-260054-iihgc5nr 75 8 one one CD cord-260054-iihgc5nr 75 9 ; ; : cord-260054-iihgc5nr 75 10 it -PRON- PRP cord-260054-iihgc5nr 75 11 was be VBD cord-260054-iihgc5nr 75 12 found find VBN cord-260054-iihgc5nr 75 13 in in IN cord-260054-iihgc5nr 75 14 2 2 CD cord-260054-iihgc5nr 75 15 sequences sequence NNS cord-260054-iihgc5nr 75 16 , , , cord-260054-iihgc5nr 75 17 associated associate VBN cord-260054-iihgc5nr 75 18 to to IN cord-260054-iihgc5nr 75 19 the the DT cord-260054-iihgc5nr 75 20 D614 d614 NN cord-260054-iihgc5nr 75 21 G g NN cord-260054-iihgc5nr 75 22 mutation mutation NN cord-260054-iihgc5nr 75 23 , , , cord-260054-iihgc5nr 75 24 both both CC cord-260054-iihgc5nr 75 25 from from IN cord-260054-iihgc5nr 75 26 England England NNP cord-260054-iihgc5nr 75 27 and and CC cord-260054-iihgc5nr 75 28 dated date VBN cord-260054-iihgc5nr 75 29 March March NNP cord-260054-iihgc5nr 75 30 29 29 CD cord-260054-iihgc5nr 75 31 th th XX cord-260054-iihgc5nr 75 32 . . . cord-260054-iihgc5nr 76 1 The the DT cord-260054-iihgc5nr 76 2 S929I S929I NNP cord-260054-iihgc5nr 76 3 mutation mutation NN cord-260054-iihgc5nr 76 4 was be VBD cord-260054-iihgc5nr 76 5 found find VBN cord-260054-iihgc5nr 76 6 in in IN cord-260054-iihgc5nr 76 7 2 2 CD cord-260054-iihgc5nr 76 8 sequences sequence NNS cord-260054-iihgc5nr 76 9 from from IN cord-260054-iihgc5nr 76 10 USA USA NNP cord-260054-iihgc5nr 76 11 ( ( -LRB- cord-260054-iihgc5nr 76 12 Washington Washington NNP cord-260054-iihgc5nr 76 13 ) ) -RRB- cord-260054-iihgc5nr 76 14 , , , cord-260054-iihgc5nr 76 15 dated date VBN cord-260054-iihgc5nr 76 16 March March NNP cord-260054-iihgc5nr 76 17 12 12 CD cord-260054-iihgc5nr 76 18 th th XX cord-260054-iihgc5nr 76 19 and and CC cord-260054-iihgc5nr 76 20 27 27 CD cord-260054-iihgc5nr 76 21 th th XX cord-260054-iihgc5nr 76 22 , , , cord-260054-iihgc5nr 76 23 associated associate VBN cord-260054-iihgc5nr 76 24 to to IN cord-260054-iihgc5nr 76 25 the the DT cord-260054-iihgc5nr 76 26 D614 d614 NN cord-260054-iihgc5nr 76 27 G g NN cord-260054-iihgc5nr 76 28 mutation mutation NN cord-260054-iihgc5nr 76 29 . . . cord-260054-iihgc5nr 77 1 Finally finally RB cord-260054-iihgc5nr 77 2 , , , cord-260054-iihgc5nr 77 3 the the DT cord-260054-iihgc5nr 77 4 S940F S940F NNP cord-260054-iihgc5nr 77 5 mutation mutation NN cord-260054-iihgc5nr 77 6 had have VBD cord-260054-iihgc5nr 77 7 a a DT cord-260054-iihgc5nr 77 8 unique unique JJ cord-260054-iihgc5nr 77 9 geographical geographical JJ cord-260054-iihgc5nr 77 10 distribution distribution NN cord-260054-iihgc5nr 77 11 , , , cord-260054-iihgc5nr 77 12 as as IN cord-260054-iihgc5nr 77 13 it -PRON- PRP cord-260054-iihgc5nr 77 14 was be VBD cord-260054-iihgc5nr 77 15 found find VBN cord-260054-iihgc5nr 77 16 in in IN cord-260054-iihgc5nr 77 17 2 2 CD cord-260054-iihgc5nr 77 18 sequences sequence NNS cord-260054-iihgc5nr 77 19 from from IN cord-260054-iihgc5nr 77 20 Australia Australia NNP cord-260054-iihgc5nr 77 21 ( ( -LRB- cord-260054-iihgc5nr 77 22 New New NNP cord-260054-iihgc5nr 77 23 South South NNP cord-260054-iihgc5nr 77 24 Wales Wales NNP cord-260054-iihgc5nr 77 25 ) ) -RRB- cord-260054-iihgc5nr 77 26 dated date VBD cord-260054-iihgc5nr 77 27 February February NNP cord-260054-iihgc5nr 77 28 28 28 CD cord-260054-iihgc5nr 77 29 th th XX cord-260054-iihgc5nr 77 30 and and CC cord-260054-iihgc5nr 77 31 March March NNP cord-260054-iihgc5nr 77 32 4 4 CD cord-260054-iihgc5nr 77 33 th th XX cord-260054-iihgc5nr 77 34 , , , cord-260054-iihgc5nr 77 35 not not RB cord-260054-iihgc5nr 77 36 associated associate VBN cord-260054-iihgc5nr 77 37 to to IN cord-260054-iihgc5nr 77 38 the the DT cord-260054-iihgc5nr 77 39 D614 d614 NN cord-260054-iihgc5nr 77 40 G g NN cord-260054-iihgc5nr 77 41 mutation mutation NN cord-260054-iihgc5nr 77 42 . . . cord-260054-iihgc5nr 78 1 In in IN cord-260054-iihgc5nr 78 2 addition addition NN cord-260054-iihgc5nr 78 3 , , , cord-260054-iihgc5nr 78 4 it -PRON- PRP cord-260054-iihgc5nr 78 5 was be VBD cord-260054-iihgc5nr 78 6 found find VBN cord-260054-iihgc5nr 78 7 in in IN cord-260054-iihgc5nr 78 8 1 1 CD cord-260054-iihgc5nr 78 9 single single JJ cord-260054-iihgc5nr 78 10 sequence sequence NN cord-260054-iihgc5nr 78 11 from from IN cord-260054-iihgc5nr 78 12 France France NNP cord-260054-iihgc5nr 78 13 , , , cord-260054-iihgc5nr 78 14 dated date VBN cord-260054-iihgc5nr 78 15 March March NNP cord-260054-iihgc5nr 78 16 20 20 CD cord-260054-iihgc5nr 78 17 th th XX cord-260054-iihgc5nr 78 18 , , , cord-260054-iihgc5nr 78 19 where where WRB cord-260054-iihgc5nr 78 20 it -PRON- PRP cord-260054-iihgc5nr 78 21 was be VBD cord-260054-iihgc5nr 78 22 associated associate VBN cord-260054-iihgc5nr 78 23 to to IN cord-260054-iihgc5nr 78 24 the the DT cord-260054-iihgc5nr 78 25 D614 d614 NN cord-260054-iihgc5nr 78 26 G g NN cord-260054-iihgc5nr 78 27 mutation mutation NN cord-260054-iihgc5nr 78 28 . . . cord-260054-iihgc5nr 79 1 In in IN cord-260054-iihgc5nr 79 2 conclusion conclusion NN cord-260054-iihgc5nr 79 3 , , , cord-260054-iihgc5nr 79 4 with with IN cord-260054-iihgc5nr 79 5 the the DT cord-260054-iihgc5nr 79 6 exception exception NN cord-260054-iihgc5nr 79 7 of of IN cord-260054-iihgc5nr 79 8 S940F S940F NNP cord-260054-iihgc5nr 79 9 , , , cord-260054-iihgc5nr 79 10 which which WDT cord-260054-iihgc5nr 79 11 was be VBD cord-260054-iihgc5nr 79 12 found find VBN cord-260054-iihgc5nr 79 13 in in IN cord-260054-iihgc5nr 79 14 Australia Australia NNP cord-260054-iihgc5nr 79 15 , , , cord-260054-iihgc5nr 79 16 all all PDT cord-260054-iihgc5nr 79 17 the the DT cord-260054-iihgc5nr 79 18 mutations mutation NNS cord-260054-iihgc5nr 79 19 in in IN cord-260054-iihgc5nr 79 20 the the DT cord-260054-iihgc5nr 79 21 HR1 HR1 NNP cord-260054-iihgc5nr 79 22 core core NN cord-260054-iihgc5nr 79 23 fusion fusion NN cord-260054-iihgc5nr 79 24 were be VBD cord-260054-iihgc5nr 79 25 spread spread VBN cord-260054-iihgc5nr 79 26 in in IN cord-260054-iihgc5nr 79 27 two two CD cord-260054-iihgc5nr 79 28 continents continent NNS cord-260054-iihgc5nr 79 29 , , , cord-260054-iihgc5nr 79 30 Europe Europe NNP cord-260054-iihgc5nr 79 31 and/or and/or CC cord-260054-iihgc5nr 79 32 North North NNP cord-260054-iihgc5nr 79 33 America America NNP cord-260054-iihgc5nr 79 34 . . . cord-260054-iihgc5nr 80 1 Furthermore furthermore RB cord-260054-iihgc5nr 80 2 , , , cord-260054-iihgc5nr 80 3 most most JJS cord-260054-iihgc5nr 80 4 of of IN cord-260054-iihgc5nr 80 5 them -PRON- PRP cord-260054-iihgc5nr 80 6 originated originate VBD cord-260054-iihgc5nr 80 7 from from IN cord-260054-iihgc5nr 80 8 the the DT cord-260054-iihgc5nr 80 9 D614 d614 CD cord-260054-iihgc5nr 80 10 G g NN cord-260054-iihgc5nr 80 11 variant variant NN cord-260054-iihgc5nr 80 12 . . . cord-260054-iihgc5nr 81 1 This this DT cord-260054-iihgc5nr 81 2 is be VBZ cord-260054-iihgc5nr 81 3 in in IN cord-260054-iihgc5nr 81 4 agreement agreement NN cord-260054-iihgc5nr 81 5 with with IN cord-260054-iihgc5nr 81 6 them -PRON- PRP cord-260054-iihgc5nr 81 7 seeming seeming JJ cord-260054-iihgc5nr 81 8 to to TO cord-260054-iihgc5nr 81 9 be be VB cord-260054-iihgc5nr 81 10 quite quite RB cord-260054-iihgc5nr 81 11 late late JJ cord-260054-iihgc5nr 81 12 mutations mutation NNS cord-260054-iihgc5nr 81 13 , , , cord-260054-iihgc5nr 81 14 sequenced sequence VBN cord-260054-iihgc5nr 81 15 starting start VBG cord-260054-iihgc5nr 81 16 from from IN cord-260054-iihgc5nr 81 17 the the DT cord-260054-iihgc5nr 81 18 end end NN cord-260054-iihgc5nr 81 19 of of IN cord-260054-iihgc5nr 81 20 February February NNP cord-260054-iihgc5nr 81 21 / / SYM cord-260054-iihgc5nr 81 22 March March NNP cord-260054-iihgc5nr 81 23 2020 2020 CD cord-260054-iihgc5nr 81 24 , , , cord-260054-iihgc5nr 81 25 i.e. i.e. FW cord-260054-iihgc5nr 81 26 over over IN cord-260054-iihgc5nr 81 27 two two CD cord-260054-iihgc5nr 81 28 months month NNS cord-260054-iihgc5nr 81 29 after after IN cord-260054-iihgc5nr 81 30 the the DT cord-260054-iihgc5nr 81 31 first first JJ cord-260054-iihgc5nr 81 32 Wuhan Wuhan NNP cord-260054-iihgc5nr 81 33 variant variant NN cord-260054-iihgc5nr 81 34 dated date VBN cord-260054-iihgc5nr 81 35 December December NNP cord-260054-iihgc5nr 81 36 24 24 CD cord-260054-iihgc5nr 81 37 th th XX cord-260054-iihgc5nr 81 38 2019 2019 CD cord-260054-iihgc5nr 81 39 ( ( -LRB- cord-260054-iihgc5nr 81 40 30 30 CD cord-260054-iihgc5nr 81 41 ) ) -RRB- cord-260054-iihgc5nr 82 1 ( ( -LRB- cord-260054-iihgc5nr 82 2 Table table NN cord-260054-iihgc5nr 82 3 1 1 CD cord-260054-iihgc5nr 82 4 ) ) -RRB- cord-260054-iihgc5nr 83 1 The the DT cord-260054-iihgc5nr 83 2 ≈3-fold ≈3-fold NNP cord-260054-iihgc5nr 83 3 increment increment NN cord-260054-iihgc5nr 83 4 in in IN cord-260054-iihgc5nr 83 5 frequency frequency NN cord-260054-iihgc5nr 83 6 of of IN cord-260054-iihgc5nr 83 7 the the DT cord-260054-iihgc5nr 83 8 S929I S929I NNP cord-260054-iihgc5nr 83 9 and and CC cord-260054-iihgc5nr 83 10 S939F S939F NNP cord-260054-iihgc5nr 83 11 mutants mutant NNS cord-260054-iihgc5nr 83 12 was be VBD cord-260054-iihgc5nr 83 13 in in IN cord-260054-iihgc5nr 83 14 line line NN cord-260054-iihgc5nr 83 15 with with IN cord-260054-iihgc5nr 83 16 the the DT cord-260054-iihgc5nr 83 17 increment increment NN cord-260054-iihgc5nr 83 18 of of IN cord-260054-iihgc5nr 83 19 the the DT cord-260054-iihgc5nr 83 20 sequences sequence NNS cord-260054-iihgc5nr 83 21 in in IN cord-260054-iihgc5nr 83 22 the the DT cord-260054-iihgc5nr 83 23 dataset dataset NN cord-260054-iihgc5nr 83 24 . . . cord-260054-iihgc5nr 84 1 The the DT cord-260054-iihgc5nr 84 2 three three CD cord-260054-iihgc5nr 84 3 additional additional JJ cord-260054-iihgc5nr 84 4 occurrences occurrence NNS cord-260054-iihgc5nr 84 5 of of IN cord-260054-iihgc5nr 84 6 the the DT cord-260054-iihgc5nr 84 7 S929I S929I NNP cord-260054-iihgc5nr 84 8 mutation mutation NN cord-260054-iihgc5nr 84 9 were be VBD cord-260054-iihgc5nr 84 10 from from IN cord-260054-iihgc5nr 84 11 USA USA NNP cord-260054-iihgc5nr 84 12 ( ( -LRB- cord-260054-iihgc5nr 84 13 Washington Washington NNP cord-260054-iihgc5nr 84 14 ) ) -RRB- cord-260054-iihgc5nr 84 15 , , , cord-260054-iihgc5nr 84 16 Wales Wales NNP cord-260054-iihgc5nr 84 17 and and CC cord-260054-iihgc5nr 84 18 England England NNP cord-260054-iihgc5nr 84 19 . . . cord-260054-iihgc5nr 85 1 A a DT cord-260054-iihgc5nr 85 2 novel novel JJ cord-260054-iihgc5nr 85 3 S929 s929 NN cord-260054-iihgc5nr 85 4 T t NN cord-260054-iihgc5nr 85 5 mutation mutation NN cord-260054-iihgc5nr 85 6 was be VBD cord-260054-iihgc5nr 85 7 also also RB cord-260054-iihgc5nr 85 8 reported report VBN cord-260054-iihgc5nr 85 9 twice twice RB cord-260054-iihgc5nr 85 10 from from IN cord-260054-iihgc5nr 85 11 Scotland Scotland NNP cord-260054-iihgc5nr 85 12 . . . cord-260054-iihgc5nr 86 1 Additional additional JJ cord-260054-iihgc5nr 86 2 occurrences occurrence NNS cord-260054-iihgc5nr 86 3 of of IN cord-260054-iihgc5nr 86 4 the the DT cord-260054-iihgc5nr 86 5 S939F S939F NNP cord-260054-iihgc5nr 86 6 mutation mutation NN cord-260054-iihgc5nr 86 7 were be VBD cord-260054-iihgc5nr 86 8 instead instead RB cord-260054-iihgc5nr 86 9 from from IN cord-260054-iihgc5nr 86 10 USA USA NNP cord-260054-iihgc5nr 86 11 , , , cord-260054-iihgc5nr 86 12 7 7 CD cord-260054-iihgc5nr 86 13 , , , cord-260054-iihgc5nr 86 14 England England NNP cord-260054-iihgc5nr 86 15 , , , cord-260054-iihgc5nr 86 16 2 2 CD cord-260054-iihgc5nr 86 17 , , , cord-260054-iihgc5nr 86 18 Kazakhstan Kazakhstan NNP cord-260054-iihgc5nr 86 19 , , , cord-260054-iihgc5nr 86 20 1 1 CD cord-260054-iihgc5nr 86 21 , , , cord-260054-iihgc5nr 86 22 and and CC cord-260054-iihgc5nr 86 23 UAE UAE NNP cord-260054-iihgc5nr 86 24 , , , cord-260054-iihgc5nr 86 25 1 1 CD cord-260054-iihgc5nr 86 26 . . . cord-260054-iihgc5nr 87 1 As as IN cord-260054-iihgc5nr 87 2 for for IN cord-260054-iihgc5nr 87 3 the the DT cord-260054-iihgc5nr 87 4 L938F L938F NNP cord-260054-iihgc5nr 87 5 and and CC cord-260054-iihgc5nr 87 6 S940F S940F NNP cord-260054-iihgc5nr 87 7 mutants mutant NNS cord-260054-iihgc5nr 87 8 , , , cord-260054-iihgc5nr 87 9 their -PRON- PRP$ cord-260054-iihgc5nr 87 10 increment increment NN cord-260054-iihgc5nr 87 11 was be VBD cord-260054-iihgc5nr 87 12 significantly significantly RB cord-260054-iihgc5nr 87 13 lower low JJR cord-260054-iihgc5nr 87 14 than than IN cord-260054-iihgc5nr 87 15 the the DT cord-260054-iihgc5nr 87 16 increment increment NN cord-260054-iihgc5nr 87 17 of of IN cord-260054-iihgc5nr 87 18 the the DT cord-260054-iihgc5nr 87 19 sequences sequence NNS cord-260054-iihgc5nr 87 20 in in IN cord-260054-iihgc5nr 87 21 the the DT cord-260054-iihgc5nr 87 22 dataset dataset NN cord-260054-iihgc5nr 87 23 . . . cord-260054-iihgc5nr 88 1 A a DT cord-260054-iihgc5nr 88 2 positive positive JJ cord-260054-iihgc5nr 88 3 selection selection NN cord-260054-iihgc5nr 88 4 thus thus RB cord-260054-iihgc5nr 88 5 clearly clearly RB cord-260054-iihgc5nr 88 6 has have VBZ cord-260054-iihgc5nr 88 7 n't not RB cord-260054-iihgc5nr 88 8 emerged emerge VBN cord-260054-iihgc5nr 88 9 to to IN cord-260054-iihgc5nr 88 10 date date NN cord-260054-iihgc5nr 88 11 for for IN cord-260054-iihgc5nr 88 12 these these DT cord-260054-iihgc5nr 88 13 mutations mutation NNS cord-260054-iihgc5nr 88 14 . . . cord-260054-iihgc5nr 89 1 The the DT cord-260054-iihgc5nr 89 2 only only JJ cord-260054-iihgc5nr 89 3 additional additional JJ cord-260054-iihgc5nr 89 4 occurrence occurrence NN cord-260054-iihgc5nr 89 5 of of IN cord-260054-iihgc5nr 89 6 L938F L938F NNP cord-260054-iihgc5nr 89 7 was be VBD cord-260054-iihgc5nr 89 8 from from IN cord-260054-iihgc5nr 89 9 Denmark Denmark NNP cord-260054-iihgc5nr 89 10 , , , cord-260054-iihgc5nr 89 11 while while IN cord-260054-iihgc5nr 89 12 the the DT cord-260054-iihgc5nr 89 13 2 2 CD cord-260054-iihgc5nr 89 14 additional additional JJ cord-260054-iihgc5nr 89 15 occurrences occurrence NNS cord-260054-iihgc5nr 89 16 of of IN cord-260054-iihgc5nr 89 17 S940F S940F NNPS cord-260054-iihgc5nr 89 18 were be VBD cord-260054-iihgc5nr 89 19 from from IN cord-260054-iihgc5nr 89 20 France France NNP cord-260054-iihgc5nr 89 21 and and CC cord-260054-iihgc5nr 89 22 USA USA NNP cord-260054-iihgc5nr 89 23 ( ( -LRB- cord-260054-iihgc5nr 89 24 Washington Washington NNP cord-260054-iihgc5nr 89 25 ) ) -RRB- cord-260054-iihgc5nr 89 26 . . . cord-260054-iihgc5nr 90 1 The the DT cord-260054-iihgc5nr 90 2 S943P S943P NNP cord-260054-iihgc5nr 90 3 mutation mutation NN cord-260054-iihgc5nr 90 4 represented represent VBD cord-260054-iihgc5nr 90 5 a a DT cord-260054-iihgc5nr 90 6 special special JJ cord-260054-iihgc5nr 90 7 case case NN cord-260054-iihgc5nr 90 8 . . . cord-260054-iihgc5nr 91 1 Most Most JJS cord-260054-iihgc5nr 91 2 of of IN cord-260054-iihgc5nr 91 3 the the DT cord-260054-iihgc5nr 91 4 sequences sequence NNS cord-260054-iihgc5nr 91 5 harbouring harbour VBG cord-260054-iihgc5nr 91 6 such such PDT cord-260054-iihgc5nr 91 7 a a DT cord-260054-iihgc5nr 91 8 mutation mutation NN cord-260054-iihgc5nr 91 9 were be VBD cord-260054-iihgc5nr 91 10 indeed indeed RB cord-260054-iihgc5nr 91 11 modified modify VBN cord-260054-iihgc5nr 91 12 between between IN cord-260054-iihgc5nr 91 13 the the DT cord-260054-iihgc5nr 91 14 April April NNP cord-260054-iihgc5nr 91 15 21 21 CD cord-260054-iihgc5nr 91 16 st st NNP cord-260054-iihgc5nr 91 17 and and CC cord-260054-iihgc5nr 91 18 the the DT cord-260054-iihgc5nr 91 19 May May NNP cord-260054-iihgc5nr 91 20 29 29 CD cord-260054-iihgc5nr 91 21 th th XX cord-260054-iihgc5nr 91 22 datasets dataset NNS cord-260054-iihgc5nr 91 23 , , , cord-260054-iihgc5nr 91 24 so so IN cord-260054-iihgc5nr 91 25 that that IN cord-260054-iihgc5nr 91 26 they -PRON- PRP cord-260054-iihgc5nr 91 27 do do VBP cord-260054-iihgc5nr 91 28 not not RB cord-260054-iihgc5nr 91 29 feature feature VB cord-260054-iihgc5nr 91 30 anymore anymore RB cord-260054-iihgc5nr 91 31 the the DT cord-260054-iihgc5nr 91 32 mutation mutation NN cord-260054-iihgc5nr 91 33 to to IN cord-260054-iihgc5nr 91 34 proline proline NN cord-260054-iihgc5nr 91 35 . . . cord-260054-iihgc5nr 92 1 However however RB cord-260054-iihgc5nr 92 2 , , , cord-260054-iihgc5nr 92 3 3 3 CD cord-260054-iihgc5nr 92 4 novel novel JJ cord-260054-iihgc5nr 92 5 occurrences occurrence NNS cord-260054-iihgc5nr 92 6 of of IN cord-260054-iihgc5nr 92 7 the the DT cord-260054-iihgc5nr 92 8 same same JJ cord-260054-iihgc5nr 92 9 mutation mutation NN cord-260054-iihgc5nr 92 10 , , , cord-260054-iihgc5nr 92 11 S943P S943P NNP cord-260054-iihgc5nr 92 12 , , , cord-260054-iihgc5nr 92 13 emerged emerge VBD cord-260054-iihgc5nr 92 14 from from IN cord-260054-iihgc5nr 92 15 China China NNP cord-260054-iihgc5nr 92 16 ( ( -LRB- cord-260054-iihgc5nr 92 17 Beijing Beijing NNP cord-260054-iihgc5nr 92 18 ) ) -RRB- cord-260054-iihgc5nr 92 19 . . . cord-260054-iihgc5nr 93 1 In in IN cord-260054-iihgc5nr 93 2 addition addition NN cord-260054-iihgc5nr 93 3 , , , cord-260054-iihgc5nr 93 4 3 3 CD cord-260054-iihgc5nr 93 5 sequences sequence NNS cord-260054-iihgc5nr 93 6 from from IN cord-260054-iihgc5nr 93 7 Scotland Scotland NNP cord-260054-iihgc5nr 93 8 presented present VBD cord-260054-iihgc5nr 93 9 the the DT cord-260054-iihgc5nr 93 10 novel novel JJ cord-260054-iihgc5nr 93 11 S943I S943I NNP cord-260054-iihgc5nr 93 12 mutation mutation NN cord-260054-iihgc5nr 93 13 . . . cord-260054-iihgc5nr 94 1 As as IN cord-260054-iihgc5nr 94 2 for for IN cord-260054-iihgc5nr 94 3 the the DT cord-260054-iihgc5nr 94 4 remaining remain VBG cord-260054-iihgc5nr 94 5 positions position NNS cord-260054-iihgc5nr 94 6 of of IN cord-260054-iihgc5nr 94 7 the the DT cord-260054-iihgc5nr 94 8 HR1 HR1 NNP cord-260054-iihgc5nr 94 9 fusion fusion NN cord-260054-iihgc5nr 94 10 core core NN cord-260054-iihgc5nr 94 11 , , , cord-260054-iihgc5nr 94 12 to to IN cord-260054-iihgc5nr 94 13 May May NNP cord-260054-iihgc5nr 94 14 29 29 CD cord-260054-iihgc5nr 94 15 th th XX cord-260054-iihgc5nr 94 16 , , , cord-260054-iihgc5nr 94 17 either either CC cord-260054-iihgc5nr 94 18 they -PRON- PRP cord-260054-iihgc5nr 94 19 were be VBD cord-260054-iihgc5nr 94 20 fully fully RB cord-260054-iihgc5nr 94 21 conserved conserve VBN cord-260054-iihgc5nr 94 22 ( ( -LRB- cord-260054-iihgc5nr 94 23 S937 s937 CC cord-260054-iihgc5nr 94 24 , , , cord-260054-iihgc5nr 94 25 K933 K933 NNP cord-260054-iihgc5nr 94 26 , , , cord-260054-iihgc5nr 94 27 A942 A942 NNP cord-260054-iihgc5nr 94 28 , , , cord-260054-iihgc5nr 94 29 A944 A944 NNP cord-260054-iihgc5nr 94 30 , , , cord-260054-iihgc5nr 94 31 L945 L945 NNP cord-260054-iihgc5nr 94 32 , , , cord-260054-iihgc5nr 94 33 G946 G946 NNP cord-260054-iihgc5nr 94 34 , , , cord-260054-iihgc5nr 94 35 K947 K947 NNP cord-260054-iihgc5nr 94 36 and and CC cord-260054-iihgc5nr 94 37 Q949 Q949 NNP cord-260054-iihgc5nr 94 38 ) ) -RRB- cord-260054-iihgc5nr 94 39 , , , cord-260054-iihgc5nr 94 40 or or CC cord-260054-iihgc5nr 94 41 they -PRON- PRP cord-260054-iihgc5nr 94 42 hosted host VBD cord-260054-iihgc5nr 94 43 one one CD cord-260054-iihgc5nr 94 44 single single JJ cord-260054-iihgc5nr 94 45 occurrence occurrence NN cord-260054-iihgc5nr 94 46 of of IN cord-260054-iihgc5nr 94 47 mutation mutation NN cord-260054-iihgc5nr 94 48 ( ( -LRB- cord-260054-iihgc5nr 94 49 to to IN cord-260054-iihgc5nr 94 50 valine valine NN cord-260054-iihgc5nr 94 51 for for IN cord-260054-iihgc5nr 94 52 A930 A930 NNP cord-260054-iihgc5nr 94 53 , , , cord-260054-iihgc5nr 94 54 to to IN cord-260054-iihgc5nr 94 55 aspartate aspartate VB cord-260054-iihgc5nr 94 56 for for IN cord-260054-iihgc5nr 94 57 I931 I931 NNPS cord-260054-iihgc5nr 94 58 and and CC cord-260054-iihgc5nr 94 59 G932 G932 NNPS cord-260054-iihgc5nr 94 60 , , , cord-260054-iihgc5nr 94 61 to to TO cord-260054-iihgc5nr 94 62 histidine histidine VB cord-260054-iihgc5nr 94 63 for for IN cord-260054-iihgc5nr 94 64 Q934 Q934 NNP cord-260054-iihgc5nr 94 65 and and CC cord-260054-iihgc5nr 94 66 D935 D935 NNP cord-260054-iihgc5nr 94 67 , , , cord-260054-iihgc5nr 94 68 to to IN cord-260054-iihgc5nr 94 69 alanine alanine NN cord-260054-iihgc5nr 94 70 for for IN cord-260054-iihgc5nr 94 71 T941 T941 NNP cord-260054-iihgc5nr 94 72 and and CC cord-260054-iihgc5nr 94 73 to to IN cord-260054-iihgc5nr 94 74 arginine arginine NN cord-260054-iihgc5nr 94 75 for for IN cord-260054-iihgc5nr 94 76 L948 L948 NNP cord-260054-iihgc5nr 94 77 ) ) -RRB- cord-260054-iihgc5nr 94 78 . . . cord-260054-iihgc5nr 95 1 Because because IN cord-260054-iihgc5nr 95 2 of of IN cord-260054-iihgc5nr 95 3 the the DT cord-260054-iihgc5nr 95 4 rarity rarity NN cord-260054-iihgc5nr 95 5 of of IN cord-260054-iihgc5nr 95 6 such such JJ cord-260054-iihgc5nr 95 7 mutations mutation NNS cord-260054-iihgc5nr 95 8 , , , cord-260054-iihgc5nr 95 9 we -PRON- PRP cord-260054-iihgc5nr 95 10 will will MD cord-260054-iihgc5nr 95 11 not not RB cord-260054-iihgc5nr 95 12 discuss discuss VB cord-260054-iihgc5nr 95 13 them -PRON- PRP cord-260054-iihgc5nr 95 14 here here RB cord-260054-iihgc5nr 95 15 . . . cord-260054-iihgc5nr 96 1 However however RB cord-260054-iihgc5nr 96 2 , , , cord-260054-iihgc5nr 96 3 we -PRON- PRP cord-260054-iihgc5nr 96 4 will will MD cord-260054-iihgc5nr 96 5 continue continue VB cord-260054-iihgc5nr 96 6 to to TO cord-260054-iihgc5nr 96 7 monitor monitor VB cord-260054-iihgc5nr 96 8 them -PRON- PRP cord-260054-iihgc5nr 96 9 over over IN cord-260054-iihgc5nr 96 10 time time NN cord-260054-iihgc5nr 96 11 . . . cord-260054-iihgc5nr 97 1 All all PDT cord-260054-iihgc5nr 97 2 the the DT cord-260054-iihgc5nr 97 3 amino amino NN cord-260054-iihgc5nr 97 4 acids acid NNS cord-260054-iihgc5nr 97 5 undergoing undergo VBG cord-260054-iihgc5nr 98 1 mutations mutation NNS cord-260054-iihgc5nr 98 2 in in IN cord-260054-iihgc5nr 98 3 the the DT cord-260054-iihgc5nr 98 4 SARS SARS NNP cord-260054-iihgc5nr 98 5 - - HYPH cord-260054-iihgc5nr 98 6 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 98 7 S S NNP cord-260054-iihgc5nr 98 8 protein protein NN cord-260054-iihgc5nr 98 9 are be VBP cord-260054-iihgc5nr 98 10 conserved conserve VBN cord-260054-iihgc5nr 98 11 in in IN cord-260054-iihgc5nr 98 12 the the DT cord-260054-iihgc5nr 98 13 bat bat NN cord-260054-iihgc5nr 99 1 coronavirus coronavirus NN cord-260054-iihgc5nr 99 2 RaTG13 RaTG13 NNP cord-260054-iihgc5nr 99 3 S S NNP cord-260054-iihgc5nr 99 4 protein protein NN cord-260054-iihgc5nr 99 5 ( ( -LRB- cord-260054-iihgc5nr 99 6 sharing share VBG cord-260054-iihgc5nr 99 7 an an DT cord-260054-iihgc5nr 99 8 overall overall JJ cord-260054-iihgc5nr 99 9 sequence sequence NN cord-260054-iihgc5nr 99 10 identity identity NN cord-260054-iihgc5nr 99 11 of of IN cord-260054-iihgc5nr 99 12 97 97 CD cord-260054-iihgc5nr 99 13 % % NN cord-260054-iihgc5nr 99 14 with with IN cord-260054-iihgc5nr 99 15 SARS SARS NNP cord-260054-iihgc5nr 99 16 - - HYPH cord-260054-iihgc5nr 99 17 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 99 18 S S NNP cord-260054-iihgc5nr 99 19 protein protein NN cord-260054-iihgc5nr 99 20 ) ) -RRB- cord-260054-iihgc5nr 99 21 , , , cord-260054-iihgc5nr 99 22 while while IN cord-260054-iihgc5nr 99 23 as as RB cord-260054-iihgc5nr 99 24 many many JJ cord-260054-iihgc5nr 99 25 as as IN cord-260054-iihgc5nr 99 26 five five CD cord-260054-iihgc5nr 99 27 of of IN cord-260054-iihgc5nr 99 28 them -PRON- PRP cord-260054-iihgc5nr 99 29 are be VBP cord-260054-iihgc5nr 99 30 mutated mutate VBN cord-260054-iihgc5nr 99 31 in in IN cord-260054-iihgc5nr 99 32 the the DT cord-260054-iihgc5nr 99 33 SARS SARS NNP cord-260054-iihgc5nr 99 34 - - HYPH cord-260054-iihgc5nr 99 35 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 99 36 S S NNP cord-260054-iihgc5nr 99 37 protein protein NN cord-260054-iihgc5nr 99 38 ( ( -LRB- cord-260054-iihgc5nr 99 39 overall overall JJ cord-260054-iihgc5nr 99 40 76 76 CD cord-260054-iihgc5nr 99 41 % % NN cord-260054-iihgc5nr 99 42 sequence sequence NN cord-260054-iihgc5nr 99 43 identical identical JJ cord-260054-iihgc5nr 99 44 to to IN cord-260054-iihgc5nr 99 45 the the DT cord-260054-iihgc5nr 99 46 SARS SARS NNP cord-260054-iihgc5nr 99 47 - - HYPH cord-260054-iihgc5nr 99 48 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 99 49 homolog homolog NN cord-260054-iihgc5nr 99 50 ) ) -RRB- cord-260054-iihgc5nr 100 1 ( ( -LRB- cord-260054-iihgc5nr 100 2 see see VB cord-260054-iihgc5nr 100 3 Figure figure NN cord-260054-iihgc5nr 100 4 1 1 CD cord-260054-iihgc5nr 100 5 ) ) -RRB- cord-260054-iihgc5nr 100 6 . . . cord-260054-iihgc5nr 101 1 Four four CD cord-260054-iihgc5nr 101 2 of of IN cord-260054-iihgc5nr 101 3 these these DT cord-260054-iihgc5nr 101 4 mutations mutation NNS cord-260054-iihgc5nr 101 5 are be VBP cord-260054-iihgc5nr 101 6 however however RB cord-260054-iihgc5nr 101 7 conservative conservative JJ cord-260054-iihgc5nr 101 8 ( ( -LRB- cord-260054-iihgc5nr 101 9 aspartate aspartate NN cord-260054-iihgc5nr 101 10 to to IN cord-260054-iihgc5nr 101 11 glutamate glutamate NN cord-260054-iihgc5nr 101 12 , , , cord-260054-iihgc5nr 101 13 serine serine NN cord-260054-iihgc5nr 101 14 to to IN cord-260054-iihgc5nr 101 15 threonine threonine NN cord-260054-iihgc5nr 101 16 ) ) -RRB- cord-260054-iihgc5nr 101 17 , , , cord-260054-iihgc5nr 101 18 except except IN cord-260054-iihgc5nr 101 19 S929 s929 NN cord-260054-iihgc5nr 101 20 , , , cord-260054-iihgc5nr 101 21 which which WDT cord-260054-iihgc5nr 101 22 is be VBZ cord-260054-iihgc5nr 101 23 a a DT cord-260054-iihgc5nr 101 24 lysine lysine NN cord-260054-iihgc5nr 101 25 in in IN cord-260054-iihgc5nr 101 26 SARS- SARS- NNP cord-260054-iihgc5nr 102 1 CoV. CoV. NNP cord-260054-iihgc5nr 103 1 It -PRON- PRP cord-260054-iihgc5nr 103 2 has have VBZ cord-260054-iihgc5nr 103 3 been be VBN cord-260054-iihgc5nr 103 4 proposed propose VBN cord-260054-iihgc5nr 103 5 that that IN cord-260054-iihgc5nr 103 6 such such JJ cord-260054-iihgc5nr 103 7 mutations mutation NNS cord-260054-iihgc5nr 103 8 in in IN cord-260054-iihgc5nr 103 9 the the DT cord-260054-iihgc5nr 103 10 SARS SARS NNP cord-260054-iihgc5nr 103 11 - - HYPH cord-260054-iihgc5nr 103 12 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 103 13 HR1 HR1 NNP cord-260054-iihgc5nr 103 14 may may MD cord-260054-iihgc5nr 103 15 be be VB cord-260054-iihgc5nr 103 16 associated associate VBN cord-260054-iihgc5nr 103 17 with with IN cord-260054-iihgc5nr 103 18 enhanced enhanced JJ cord-260054-iihgc5nr 103 19 interactions interaction NNS cord-260054-iihgc5nr 103 20 with with IN cord-260054-iihgc5nr 103 21 the the DT cord-260054-iihgc5nr 103 22 HR2 HR2 NNP cord-260054-iihgc5nr 103 23 , , , cord-260054-iihgc5nr 103 24 further further RB cord-260054-iihgc5nr 103 25 stabilizing stabilize VBG cord-260054-iihgc5nr 103 26 the the DT cord-260054-iihgc5nr 103 27 6-HB 6-hb CD cord-260054-iihgc5nr 103 28 structure structure NN cord-260054-iihgc5nr 103 29 and and CC cord-260054-iihgc5nr 103 30 maybe maybe RB cord-260054-iihgc5nr 103 31 leading lead VBG cord-260054-iihgc5nr 103 32 to to IN cord-260054-iihgc5nr 103 33 increased increase VBN cord-260054-iihgc5nr 103 34 infectivity infectivity NN cord-260054-iihgc5nr 103 35 of of IN cord-260054-iihgc5nr 103 36 the the DT cord-260054-iihgc5nr 103 37 virus virus NN cord-260054-iihgc5nr 103 38 ( ( -LRB- cord-260054-iihgc5nr 103 39 29 29 CD cord-260054-iihgc5nr 103 40 ) ) -RRB- cord-260054-iihgc5nr 103 41 . . . cord-260054-iihgc5nr 104 1 It -PRON- PRP cord-260054-iihgc5nr 104 2 is be VBZ cord-260054-iihgc5nr 104 3 noteworthy noteworthy JJ cord-260054-iihgc5nr 104 4 that that IN cord-260054-iihgc5nr 104 5 no no DT cord-260054-iihgc5nr 104 6 one one NN cord-260054-iihgc5nr 104 7 of of IN cord-260054-iihgc5nr 104 8 the the DT cord-260054-iihgc5nr 104 9 point point NN cord-260054-iihgc5nr 104 10 mutations mutation NNS cord-260054-iihgc5nr 104 11 we -PRON- PRP cord-260054-iihgc5nr 104 12 identified identify VBD cord-260054-iihgc5nr 104 13 restored restore VBD cord-260054-iihgc5nr 104 14 the the DT cord-260054-iihgc5nr 104 15 corresponding correspond VBG cord-260054-iihgc5nr 104 16 SARS SARS NNP cord-260054-iihgc5nr 104 17 - - HYPH cord-260054-iihgc5nr 104 18 CoV cov NN cord-260054-iihgc5nr 104 19 amino amino NN cord-260054-iihgc5nr 104 20 acid acid NN cord-260054-iihgc5nr 104 21 . . . cord-260054-iihgc5nr 105 1 In in IN cord-260054-iihgc5nr 105 2 the the DT cord-260054-iihgc5nr 105 3 pre pre JJ cord-260054-iihgc5nr 105 4 - - JJ cord-260054-iihgc5nr 105 5 fusion fusion JJ cord-260054-iihgc5nr 105 6 conformation conformation NN cord-260054-iihgc5nr 105 7 , , , cord-260054-iihgc5nr 105 8 all all PDT cord-260054-iihgc5nr 105 9 the the DT cord-260054-iihgc5nr 105 10 mutated mutate VBN cord-260054-iihgc5nr 105 11 positions position NNS cord-260054-iihgc5nr 105 12 , , , cord-260054-iihgc5nr 105 13 but but CC cord-260054-iihgc5nr 105 14 S943 S943 NNP cord-260054-iihgc5nr 105 15 , , , cord-260054-iihgc5nr 105 16 are be VBP cord-260054-iihgc5nr 105 17 located locate VBN cord-260054-iihgc5nr 105 18 on on IN cord-260054-iihgc5nr 105 19 the the DT cord-260054-iihgc5nr 105 20 second second NN cord-260054-iihgc5nr 105 21 of of IN cord-260054-iihgc5nr 105 22 four four CD cord-260054-iihgc5nr 105 23 non non JJ cord-260054-iihgc5nr 105 24 - - JJ cord-260054-iihgc5nr 105 25 coaxial coaxial JJ cord-260054-iihgc5nr 105 26 helical helical JJ cord-260054-iihgc5nr 105 27 segments segment NNS cord-260054-iihgc5nr 105 28 composing compose VBG cord-260054-iihgc5nr 105 29 the the DT cord-260054-iihgc5nr 105 30 HR1 HR1 NNP cord-260054-iihgc5nr 105 31 ( ( -LRB- cord-260054-iihgc5nr 105 32 Figure figure NN cord-260054-iihgc5nr 105 33 1 1 CD cord-260054-iihgc5nr 105 34 ) ) -RRB- cord-260054-iihgc5nr 105 35 . . . cord-260054-iihgc5nr 106 1 Four four CD cord-260054-iihgc5nr 106 2 of of IN cord-260054-iihgc5nr 106 3 them -PRON- PRP cord-260054-iihgc5nr 106 4 , , , cord-260054-iihgc5nr 106 5 S929 s929 JJ cord-260054-iihgc5nr 106 6 , , , cord-260054-iihgc5nr 106 7 D936 D936 NNPS cord-260054-iihgc5nr 106 8 , , , cord-260054-iihgc5nr 106 9 S939 S939 NNP cord-260054-iihgc5nr 106 10 and and CC cord-260054-iihgc5nr 106 11 S940 S940 NNP cord-260054-iihgc5nr 106 12 , , , cord-260054-iihgc5nr 106 13 are be VBP cord-260054-iihgc5nr 106 14 exposed expose VBN cord-260054-iihgc5nr 106 15 to to IN cord-260054-iihgc5nr 106 16 the the DT cord-260054-iihgc5nr 106 17 solvent solvent NN cord-260054-iihgc5nr 106 18 ( ( -LRB- cord-260054-iihgc5nr 106 19 Table Table NNP cord-260054-iihgc5nr 106 20 2 2 CD cord-260054-iihgc5nr 106 21 ) ) -RRB- cord-260054-iihgc5nr 106 22 , , , cord-260054-iihgc5nr 106 23 and and CC cord-260054-iihgc5nr 106 24 can can MD cord-260054-iihgc5nr 106 25 be be VB cord-260054-iihgc5nr 106 26 modelled model VBN cord-260054-iihgc5nr 106 27 as as IN cord-260054-iihgc5nr 106 28 larger large JJR cord-260054-iihgc5nr 106 29 ( ( -LRB- cord-260054-iihgc5nr 106 30 mostly mostly RB cord-260054-iihgc5nr 106 31 aromatic aromatic JJ cord-260054-iihgc5nr 106 32 ) ) -RRB- cord-260054-iihgc5nr 106 33 residues residue NNS cord-260054-iihgc5nr 106 34 without without IN cord-260054-iihgc5nr 106 35 causing cause VBG cord-260054-iihgc5nr 106 36 any any DT cord-260054-iihgc5nr 106 37 structural structural JJ cord-260054-iihgc5nr 106 38 strain strain NN cord-260054-iihgc5nr 106 39 ( ( -LRB- cord-260054-iihgc5nr 106 40 see see VB cord-260054-iihgc5nr 106 41 Figure figure NN cord-260054-iihgc5nr 106 42 2 2 CD cord-260054-iihgc5nr 106 43 ) ) -RRB- cord-260054-iihgc5nr 106 44 . . . cord-260054-iihgc5nr 107 1 These these DT cord-260054-iihgc5nr 107 2 mutations mutation NNS cord-260054-iihgc5nr 107 3 are be VBP cord-260054-iihgc5nr 107 4 not not RB cord-260054-iihgc5nr 107 5 expected expect VBN cord-260054-iihgc5nr 107 6 to to TO cord-260054-iihgc5nr 107 7 cause cause VB cord-260054-iihgc5nr 107 8 relevant relevant JJ cord-260054-iihgc5nr 107 9 changes change NNS cord-260054-iihgc5nr 107 10 in in IN cord-260054-iihgc5nr 107 11 the the DT cord-260054-iihgc5nr 107 12 prefusion prefusion NN cord-260054-iihgc5nr 107 13 structure structure NN cord-260054-iihgc5nr 107 14 , , , cord-260054-iihgc5nr 107 15 although although IN cord-260054-iihgc5nr 107 16 they -PRON- PRP cord-260054-iihgc5nr 107 17 could could MD cord-260054-iihgc5nr 107 18 have have VB cord-260054-iihgc5nr 107 19 a a DT cord-260054-iihgc5nr 107 20 destabilizing destabilizing JJ cord-260054-iihgc5nr 107 21 effect effect NN cord-260054-iihgc5nr 107 22 as as IN cord-260054-iihgc5nr 107 23 a a DT cord-260054-iihgc5nr 107 24 consequence consequence NN cord-260054-iihgc5nr 107 25 of of IN cord-260054-iihgc5nr 107 26 posing pose VBG cord-260054-iihgc5nr 107 27 large large JJ cord-260054-iihgc5nr 107 28 aromatic aromatic JJ cord-260054-iihgc5nr 107 29 residues residue NNS cord-260054-iihgc5nr 107 30 in in IN cord-260054-iihgc5nr 107 31 direct direct JJ cord-260054-iihgc5nr 107 32 contact contact NN cord-260054-iihgc5nr 107 33 with with IN cord-260054-iihgc5nr 107 34 the the DT cord-260054-iihgc5nr 107 35 solvent solvent NN cord-260054-iihgc5nr 107 36 instead instead RB cord-260054-iihgc5nr 107 37 of of IN cord-260054-iihgc5nr 107 38 smaller small JJR cord-260054-iihgc5nr 107 39 apolar apolar NNP cord-260054-iihgc5nr 107 40 ( ( -LRB- cord-260054-iihgc5nr 107 41 leucine leucine NNP cord-260054-iihgc5nr 107 42 ) ) -RRB- cord-260054-iihgc5nr 107 43 , , , cord-260054-iihgc5nr 107 44 polar polar JJ cord-260054-iihgc5nr 107 45 ( ( -LRB- cord-260054-iihgc5nr 107 46 serine serine NN cord-260054-iihgc5nr 107 47 in in IN cord-260054-iihgc5nr 107 48 2 2 CD cord-260054-iihgc5nr 107 49 cases case NNS cord-260054-iihgc5nr 107 50 ) ) -RRB- cord-260054-iihgc5nr 107 51 or or CC cord-260054-iihgc5nr 107 52 even even RB cord-260054-iihgc5nr 107 53 charged charge VBN cord-260054-iihgc5nr 107 54 ( ( -LRB- cord-260054-iihgc5nr 107 55 aspartate aspartate NN cord-260054-iihgc5nr 107 56 ) ) -RRB- cord-260054-iihgc5nr 107 57 residues residue NNS cord-260054-iihgc5nr 107 58 . . . cord-260054-iihgc5nr 108 1 In in IN cord-260054-iihgc5nr 108 2 addition addition NN cord-260054-iihgc5nr 108 3 , , , cord-260054-iihgc5nr 108 4 S940 S940 NNP cord-260054-iihgc5nr 108 5 involves involve VBZ cord-260054-iihgc5nr 108 6 its -PRON- PRP$ cord-260054-iihgc5nr 108 7 side side NN cord-260054-iihgc5nr 108 8 - - HYPH cord-260054-iihgc5nr 108 9 chain chain NN cord-260054-iihgc5nr 108 10 in in IN cord-260054-iihgc5nr 108 11 a a DT cord-260054-iihgc5nr 108 12 H h NN cord-260054-iihgc5nr 108 13 - - HYPH cord-260054-iihgc5nr 108 14 bond bond NN cord-260054-iihgc5nr 108 15 with with IN cord-260054-iihgc5nr 108 16 the the DT cord-260054-iihgc5nr 108 17 main main JJ cord-260054-iihgc5nr 108 18 - - HYPH cord-260054-iihgc5nr 108 19 chain chain NN cord-260054-iihgc5nr 108 20 of of IN cord-260054-iihgc5nr 108 21 D936 D936 NNS cord-260054-iihgc5nr 108 22 , , , cord-260054-iihgc5nr 108 23 4 4 CD cord-260054-iihgc5nr 108 24 residues residue NNS cord-260054-iihgc5nr 108 25 upstream upstream RB cord-260054-iihgc5nr 108 26 . . . cord-260054-iihgc5nr 109 1 The the DT cord-260054-iihgc5nr 109 2 loss loss NN cord-260054-iihgc5nr 109 3 of of IN cord-260054-iihgc5nr 109 4 this this DT cord-260054-iihgc5nr 109 5 H h NN cord-260054-iihgc5nr 109 6 - - HYPH cord-260054-iihgc5nr 109 7 bond bond NN cord-260054-iihgc5nr 109 8 in in IN cord-260054-iihgc5nr 109 9 the the DT cord-260054-iihgc5nr 109 10 S940F S940F NNPS cord-260054-iihgc5nr 109 11 mutant mutant NN cord-260054-iihgc5nr 109 12 also also RB cord-260054-iihgc5nr 109 13 points point VBZ cord-260054-iihgc5nr 109 14 to to IN cord-260054-iihgc5nr 109 15 a a DT cord-260054-iihgc5nr 109 16 slight slight JJ cord-260054-iihgc5nr 109 17 destabilization destabilization NN cord-260054-iihgc5nr 109 18 of of IN cord-260054-iihgc5nr 109 19 the the DT cord-260054-iihgc5nr 109 20 pre pre JJ cord-260054-iihgc5nr 109 21 - - JJ cord-260054-iihgc5nr 109 22 fusion fusion JJ cord-260054-iihgc5nr 109 23 conformation conformation NN cord-260054-iihgc5nr 109 24 . . . cord-260054-iihgc5nr 110 1 As as IN cord-260054-iihgc5nr 110 2 for for IN cord-260054-iihgc5nr 110 3 L938 L938 NNP cord-260054-iihgc5nr 110 4 , , , cord-260054-iihgc5nr 110 5 it -PRON- PRP cord-260054-iihgc5nr 110 6 is be VBZ cord-260054-iihgc5nr 110 7 buried bury VBN cord-260054-iihgc5nr 110 8 in in IN cord-260054-iihgc5nr 110 9 the the DT cord-260054-iihgc5nr 110 10 prefusion prefusion NN cord-260054-iihgc5nr 110 11 conformation conformation NN cord-260054-iihgc5nr 110 12 , , , cord-260054-iihgc5nr 110 13 pointing point VBG cord-260054-iihgc5nr 110 14 towards towards IN cord-260054-iihgc5nr 110 15 a a DT cord-260054-iihgc5nr 110 16 three three CD cord-260054-iihgc5nr 110 17 - - HYPH cord-260054-iihgc5nr 110 18 stranded strand VBN cord-260054-iihgc5nr 110 19 anti anti JJ cord-260054-iihgc5nr 110 20 - - JJ cord-260054-iihgc5nr 110 21 parallel parallel JJ cord-260054-iihgc5nr 110 22 β β NN cord-260054-iihgc5nr 110 23 - - HYPH cord-260054-iihgc5nr 110 24 sheet sheet NN cord-260054-iihgc5nr 110 25 made make VBN cord-260054-iihgc5nr 110 26 of of IN cord-260054-iihgc5nr 110 27 the the DT cord-260054-iihgc5nr 110 28 S711-P728 S711-P728 NNP cord-260054-iihgc5nr 110 29 segment segment NN cord-260054-iihgc5nr 110 30 from from IN cord-260054-iihgc5nr 110 31 the the DT cord-260054-iihgc5nr 110 32 S1 S1 NNP cord-260054-iihgc5nr 110 33 subunit subunit NN cord-260054-iihgc5nr 110 34 and and CC cord-260054-iihgc5nr 110 35 of of IN cord-260054-iihgc5nr 110 36 the the DT cord-260054-iihgc5nr 110 37 Y1047-P1053 y1047-p1053 NN cord-260054-iihgc5nr 110 38 and and CC cord-260054-iihgc5nr 110 39 G1059-A1078 G1059-A1078 NNP cord-260054-iihgc5nr 110 40 segments segment NNS cord-260054-iihgc5nr 110 41 from from IN cord-260054-iihgc5nr 110 42 the the DT cord-260054-iihgc5nr 110 43 S2 S2 NNP cord-260054-iihgc5nr 110 44 subunit subunit NN cord-260054-iihgc5nr 110 45 , , , cord-260054-iihgc5nr 110 46 without without IN cord-260054-iihgc5nr 110 47 directly directly RB cord-260054-iihgc5nr 110 48 contacting contact VBG cord-260054-iihgc5nr 110 49 it -PRON- PRP cord-260054-iihgc5nr 111 1 ( ( -LRB- cord-260054-iihgc5nr 111 2 distances distance NNS cord-260054-iihgc5nr 111 3 above above IN cord-260054-iihgc5nr 111 4 5 5 CD cord-260054-iihgc5nr 111 5 Å å NN cord-260054-iihgc5nr 111 6 ) ) -RRB- cord-260054-iihgc5nr 111 7 . . . cord-260054-iihgc5nr 112 1 It -PRON- PRP cord-260054-iihgc5nr 112 2 can can MD cord-260054-iihgc5nr 112 3 also also RB cord-260054-iihgc5nr 112 4 be be VB cord-260054-iihgc5nr 112 5 modelled model VBN cord-260054-iihgc5nr 112 6 as as IN cord-260054-iihgc5nr 112 7 a a DT cord-260054-iihgc5nr 112 8 large large JJ cord-260054-iihgc5nr 112 9 phenylalanine phenylalanine NN cord-260054-iihgc5nr 112 10 without without IN cord-260054-iihgc5nr 112 11 causing cause VBG cord-260054-iihgc5nr 112 12 sterical sterical JJ cord-260054-iihgc5nr 112 13 strain strain NN cord-260054-iihgc5nr 112 14 . . . cord-260054-iihgc5nr 113 1 Upon upon IN cord-260054-iihgc5nr 113 2 mutation mutation NN cord-260054-iihgc5nr 113 3 , , , cord-260054-iihgc5nr 113 4 it -PRON- PRP cord-260054-iihgc5nr 113 5 seems seem VBZ cord-260054-iihgc5nr 113 6 to to TO cord-260054-iihgc5nr 113 7 optimize optimize VB cord-260054-iihgc5nr 113 8 the the DT cord-260054-iihgc5nr 113 9 hydrophobic hydrophobic JJ cord-260054-iihgc5nr 113 10 interactions interaction NNS cord-260054-iihgc5nr 113 11 with with IN cord-260054-iihgc5nr 113 12 the the DT cord-260054-iihgc5nr 113 13 neighboring neighboring NN cord-260054-iihgc5nr 113 14 residues residue NNS cord-260054-iihgc5nr 113 15 , , , cord-260054-iihgc5nr 113 16 especially especially RB cord-260054-iihgc5nr 113 17 I726 I726 NNP cord-260054-iihgc5nr 113 18 and and CC cord-260054-iihgc5nr 113 19 A944 A944 NNP cord-260054-iihgc5nr 113 20 . . . cord-260054-iihgc5nr 114 1 Finally finally RB cord-260054-iihgc5nr 114 2 , , , cord-260054-iihgc5nr 114 3 S943 S943 NNP cord-260054-iihgc5nr 114 4 is be VBZ cord-260054-iihgc5nr 114 5 located locate VBN cord-260054-iihgc5nr 114 6 on on IN cord-260054-iihgc5nr 114 7 a a DT cord-260054-iihgc5nr 114 8 turn turn NN cord-260054-iihgc5nr 114 9 immediately immediately RB cord-260054-iihgc5nr 114 10 downstream downstream VBP cord-260054-iihgc5nr 114 11 the the DT cord-260054-iihgc5nr 114 12 helical helical JJ cord-260054-iihgc5nr 114 13 segment segment NN cord-260054-iihgc5nr 114 14 hosting host VBG cord-260054-iihgc5nr 114 15 the the DT cord-260054-iihgc5nr 114 16 above above JJ cord-260054-iihgc5nr 114 17 five five CD cord-260054-iihgc5nr 114 18 mutations mutation NNS cord-260054-iihgc5nr 114 19 , , , cord-260054-iihgc5nr 114 20 between between IN cord-260054-iihgc5nr 114 21 the the DT cord-260054-iihgc5nr 114 22 second second JJ cord-260054-iihgc5nr 114 23 and and CC cord-260054-iihgc5nr 114 24 third third JJ cord-260054-iihgc5nr 114 25 helical helical JJ cord-260054-iihgc5nr 114 26 segments segment NNS cord-260054-iihgc5nr 114 27 . . . cord-260054-iihgc5nr 115 1 The the DT cord-260054-iihgc5nr 115 2 wild wild JJ cord-260054-iihgc5nr 115 3 - - HYPH cord-260054-iihgc5nr 115 4 type type NN cord-260054-iihgc5nr 115 5 residue residue NN cord-260054-iihgc5nr 115 6 S943 S943 NNP cord-260054-iihgc5nr 115 7 features feature VBZ cord-260054-iihgc5nr 115 8 φ φ NN cord-260054-iihgc5nr 115 9 and and CC cord-260054-iihgc5nr 115 10 ψ ψ LS cord-260054-iihgc5nr 115 11 dihedral dihedral JJ cord-260054-iihgc5nr 115 12 angles angle NNS cord-260054-iihgc5nr 115 13 of of IN cord-260054-iihgc5nr 115 14 58.5 58.5 CD cord-260054-iihgc5nr 115 15 ° ° NNS cord-260054-iihgc5nr 115 16 and and CC cord-260054-iihgc5nr 115 17 24.5 24.5 CD cord-260054-iihgc5nr 115 18 ° ° NNS cord-260054-iihgc5nr 115 19 , , , cord-260054-iihgc5nr 115 20 respectively respectively RB cord-260054-iihgc5nr 115 21 , , , cord-260054-iihgc5nr 115 22 which which WDT cord-260054-iihgc5nr 115 23 fall fall VBP cord-260054-iihgc5nr 115 24 in in IN cord-260054-iihgc5nr 115 25 an an DT cord-260054-iihgc5nr 115 26 unfavourable unfavourable JJ cord-260054-iihgc5nr 115 27 region region NN cord-260054-iihgc5nr 115 28 for for IN cord-260054-iihgc5nr 115 29 prolines proline NNS cord-260054-iihgc5nr 115 30 . . . cord-260054-iihgc5nr 116 1 In in IN cord-260054-iihgc5nr 116 2 the the DT cord-260054-iihgc5nr 116 3 S943P S943P NNP cord-260054-iihgc5nr 116 4 model model NN cord-260054-iihgc5nr 116 5 we -PRON- PRP cord-260054-iihgc5nr 116 6 generated generate VBD cord-260054-iihgc5nr 116 7 , , , cord-260054-iihgc5nr 116 8 the the DT cord-260054-iihgc5nr 116 9 P943 P943 NNP cord-260054-iihgc5nr 116 10 φ φ NN cord-260054-iihgc5nr 116 11 and and CC cord-260054-iihgc5nr 116 12 ψ ψ NN cord-260054-iihgc5nr 116 13 dihedrals dihedral NNS cord-260054-iihgc5nr 116 14 assume assume VBP cord-260054-iihgc5nr 116 15 the the DT cord-260054-iihgc5nr 116 16 values value NNS cord-260054-iihgc5nr 116 17 of of IN cord-260054-iihgc5nr 116 18 3.0 3.0 CD cord-260054-iihgc5nr 116 19 ° ° NNS cord-260054-iihgc5nr 116 20 and and CC cord-260054-iihgc5nr 116 21 68.2 68.2 CD cord-260054-iihgc5nr 116 22 ° ° NNS cord-260054-iihgc5nr 116 23 , , , cord-260054-iihgc5nr 116 24 placing place VBG cord-260054-iihgc5nr 116 25 the the DT cord-260054-iihgc5nr 116 26 residue residue NN cord-260054-iihgc5nr 116 27 in in IN cord-260054-iihgc5nr 116 28 an an DT cord-260054-iihgc5nr 116 29 outlier outlier JJ cord-260054-iihgc5nr 116 30 region region NN cord-260054-iihgc5nr 116 31 ( ( -LRB- cord-260054-iihgc5nr 116 32 39 39 CD cord-260054-iihgc5nr 116 33 ) ) -RRB- cord-260054-iihgc5nr 116 34 . . . cord-260054-iihgc5nr 117 1 The the DT cord-260054-iihgc5nr 117 2 favoured favoured JJ cord-260054-iihgc5nr 117 3 φ φ NN cord-260054-iihgc5nr 117 4 angle angle NN cord-260054-iihgc5nr 117 5 for for IN cord-260054-iihgc5nr 117 6 prolines proline NNS cord-260054-iihgc5nr 117 7 is be VBZ cord-260054-iihgc5nr 117 8 indeed indeed RB cord-260054-iihgc5nr 117 9 restricted restricted JJ cord-260054-iihgc5nr 117 10 to to IN cord-260054-iihgc5nr 117 11 the the DT cord-260054-iihgc5nr 117 12 value value NN cord-260054-iihgc5nr 117 13 of of IN cord-260054-iihgc5nr 117 14 -63 -63 NNP cord-260054-iihgc5nr 117 15 ± ± NNP cord-260054-iihgc5nr 117 16 15 15 CD cord-260054-iihgc5nr 117 17 ° ° NNS cord-260054-iihgc5nr 117 18 , , , cord-260054-iihgc5nr 117 19 ( ( -LRB- cord-260054-iihgc5nr 117 20 40 40 CD cord-260054-iihgc5nr 117 21 ) ) -RRB- cord-260054-iihgc5nr 117 22 characteristic characteristic NN cord-260054-iihgc5nr 117 23 of of IN cord-260054-iihgc5nr 117 24 α α NN cord-260054-iihgc5nr 117 25 - - HYPH cord-260054-iihgc5nr 117 26 helices helix NNS cord-260054-iihgc5nr 117 27 . . . cord-260054-iihgc5nr 118 1 A a DT cord-260054-iihgc5nr 118 2 proline proline NN cord-260054-iihgc5nr 118 3 at at IN cord-260054-iihgc5nr 118 4 such such PDT cord-260054-iihgc5nr 118 5 a a DT cord-260054-iihgc5nr 118 6 position position NN cord-260054-iihgc5nr 118 7 would would MD cord-260054-iihgc5nr 118 8 therefore therefore RB cord-260054-iihgc5nr 118 9 introduce introduce VB cord-260054-iihgc5nr 118 10 an an DT cord-260054-iihgc5nr 118 11 anomaly anomaly NN cord-260054-iihgc5nr 118 12 in in IN cord-260054-iihgc5nr 118 13 the the DT cord-260054-iihgc5nr 118 14 pre pre JJ cord-260054-iihgc5nr 118 15 - - JJ cord-260054-iihgc5nr 118 16 fusion fusion JJ cord-260054-iihgc5nr 118 17 conformation conformation NN cord-260054-iihgc5nr 118 18 , , , cord-260054-iihgc5nr 118 19 while while IN cord-260054-iihgc5nr 118 20 strongly strongly RB cord-260054-iihgc5nr 118 21 promoting promote VBG cord-260054-iihgc5nr 118 22 the the DT cord-260054-iihgc5nr 118 23 transition transition NN cord-260054-iihgc5nr 118 24 to to IN cord-260054-iihgc5nr 118 25 the the DT cord-260054-iihgc5nr 118 26 post post JJ cord-260054-iihgc5nr 118 27 - - JJ cord-260054-iihgc5nr 118 28 fusion fusion JJ cord-260054-iihgc5nr 118 29 single single JJ cord-260054-iihgc5nr 118 30 continuous continuous JJ cord-260054-iihgc5nr 118 31 helical helical JJ cord-260054-iihgc5nr 118 32 conformation conformation NN cord-260054-iihgc5nr 118 33 . . . cord-260054-iihgc5nr 119 1 It -PRON- PRP cord-260054-iihgc5nr 119 2 is be VBZ cord-260054-iihgc5nr 119 3 also also RB cord-260054-iihgc5nr 119 4 worth worth JJ cord-260054-iihgc5nr 119 5 noticing notice VBG cord-260054-iihgc5nr 119 6 that that IN cord-260054-iihgc5nr 119 7 this this DT cord-260054-iihgc5nr 119 8 would would MD cord-260054-iihgc5nr 119 9 be be VB cord-260054-iihgc5nr 119 10 the the DT cord-260054-iihgc5nr 119 11 only only JJ cord-260054-iihgc5nr 119 12 mutation mutation NN cord-260054-iihgc5nr 119 13 among among IN cord-260054-iihgc5nr 119 14 those those DT cord-260054-iihgc5nr 119 15 we -PRON- PRP cord-260054-iihgc5nr 119 16 identified identify VBD cord-260054-iihgc5nr 119 17 so so RB cord-260054-iihgc5nr 119 18 far far RB cord-260054-iihgc5nr 119 19 , , , cord-260054-iihgc5nr 119 20 introducing introduce VBG cord-260054-iihgc5nr 119 21 a a DT cord-260054-iihgc5nr 119 22 proline proline NN cord-260054-iihgc5nr 119 23 residue residue NN cord-260054-iihgc5nr 119 24 in in IN cord-260054-iihgc5nr 119 25 the the DT cord-260054-iihgc5nr 119 26 SARS SARS NNP cord-260054-iihgc5nr 119 27 - - HYPH cord-260054-iihgc5nr 119 28 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 119 29 S S NNP cord-260054-iihgc5nr 119 30 protein protein NN cord-260054-iihgc5nr 119 31 ( ( -LRB- cord-260054-iihgc5nr 119 32 Table Table NNP cord-260054-iihgc5nr 119 33 S1 S1 NNP cord-260054-iihgc5nr 119 34 ) ) -RRB- cord-260054-iihgc5nr 119 35 . . . cord-260054-iihgc5nr 120 1 In in IN cord-260054-iihgc5nr 120 2 light light NN cord-260054-iihgc5nr 120 3 of of IN cord-260054-iihgc5nr 120 4 the the DT cord-260054-iihgc5nr 120 5 analysis analysis NN cord-260054-iihgc5nr 120 6 of of IN cord-260054-iihgc5nr 120 7 the the DT cord-260054-iihgc5nr 120 8 GISAID GISAID NNP cord-260054-iihgc5nr 121 1 May May NNP cord-260054-iihgc5nr 121 2 29 29 CD cord-260054-iihgc5nr 121 3 th th XX cord-260054-iihgc5nr 121 4 updated update VBN cord-260054-iihgc5nr 121 5 , , , cord-260054-iihgc5nr 121 6 we -PRON- PRP cord-260054-iihgc5nr 121 7 also also RB cord-260054-iihgc5nr 121 8 modelled model VBD cord-260054-iihgc5nr 121 9 the the DT cord-260054-iihgc5nr 121 10 S943I S943I NNP cord-260054-iihgc5nr 121 11 mutation mutation NN cord-260054-iihgc5nr 121 12 . . . cord-260054-iihgc5nr 122 1 Being be VBG cord-260054-iihgc5nr 122 2 isoleucine isoleucine NN cord-260054-iihgc5nr 122 3 compatible compatible JJ cord-260054-iihgc5nr 122 4 with with IN cord-260054-iihgc5nr 122 5 the the DT cord-260054-iihgc5nr 122 6 S943 s943 JJ cord-260054-iihgc5nr 122 7 dihedral dihedral JJ cord-260054-iihgc5nr 122 8 values value NNS cord-260054-iihgc5nr 122 9 , , , cord-260054-iihgc5nr 122 10 this this DT cord-260054-iihgc5nr 122 11 mutation mutation NN cord-260054-iihgc5nr 122 12 does do VBZ cord-260054-iihgc5nr 122 13 not not RB cord-260054-iihgc5nr 122 14 result result VB cord-260054-iihgc5nr 122 15 in in IN cord-260054-iihgc5nr 122 16 any any DT cord-260054-iihgc5nr 122 17 structural structural JJ cord-260054-iihgc5nr 122 18 strain strain NN cord-260054-iihgc5nr 122 19 . . . cord-260054-iihgc5nr 123 1 When when WRB cord-260054-iihgc5nr 123 2 looking look VBG cord-260054-iihgc5nr 123 3 at at IN cord-260054-iihgc5nr 123 4 the the DT cord-260054-iihgc5nr 123 5 post post JJ cord-260054-iihgc5nr 123 6 - - JJ cord-260054-iihgc5nr 123 7 fusion fusion JJ cord-260054-iihgc5nr 123 8 conformation conformation NN cord-260054-iihgc5nr 123 9 of of IN cord-260054-iihgc5nr 123 10 the the DT cord-260054-iihgc5nr 123 11 SARS SARS NNP cord-260054-iihgc5nr 123 12 - - HYPH cord-260054-iihgc5nr 123 13 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 123 14 spike spike NN cord-260054-iihgc5nr 123 15 protein protein NN cord-260054-iihgc5nr 123 16 S2 s2 NN cord-260054-iihgc5nr 123 17 subunit subunit NN cord-260054-iihgc5nr 123 18 , , , cord-260054-iihgc5nr 123 19 these these DT cord-260054-iihgc5nr 123 20 mutations mutation NNS cord-260054-iihgc5nr 123 21 appear appear VBP cord-260054-iihgc5nr 123 22 more more RBR cord-260054-iihgc5nr 123 23 revealing revealing JJ cord-260054-iihgc5nr 123 24 . . . cord-260054-iihgc5nr 124 1 Three three CD cord-260054-iihgc5nr 124 2 of of IN cord-260054-iihgc5nr 124 3 the the DT cord-260054-iihgc5nr 124 4 wild wild JJ cord-260054-iihgc5nr 124 5 - - HYPH cord-260054-iihgc5nr 124 6 type type NN cord-260054-iihgc5nr 124 7 residues residue NNS cord-260054-iihgc5nr 124 8 , , , cord-260054-iihgc5nr 124 9 S929 s929 JJ cord-260054-iihgc5nr 124 10 , , , cord-260054-iihgc5nr 124 11 D936 D936 VBZ cord-260054-iihgc5nr 124 12 and and CC cord-260054-iihgc5nr 124 13 S943 S943 NNP cord-260054-iihgc5nr 124 14 , , , cord-260054-iihgc5nr 124 15 are be VBP cord-260054-iihgc5nr 124 16 indeed indeed RB cord-260054-iihgc5nr 124 17 engaged engage VBN cord-260054-iihgc5nr 124 18 in in IN cord-260054-iihgc5nr 124 19 side side NN cord-260054-iihgc5nr 124 20 - - HYPH cord-260054-iihgc5nr 124 21 chain chain NN cord-260054-iihgc5nr 124 22 to to IN cord-260054-iihgc5nr 124 23 side side NN cord-260054-iihgc5nr 124 24 - - HYPH cord-260054-iihgc5nr 124 25 chain chain NN cord-260054-iihgc5nr 124 26 H h NN cord-260054-iihgc5nr 124 27 - - HYPH cord-260054-iihgc5nr 124 28 bonds bond NNS cord-260054-iihgc5nr 124 29 with with IN cord-260054-iihgc5nr 124 30 the the DT cord-260054-iihgc5nr 124 31 HR2 HR2 NNP cord-260054-iihgc5nr 124 32 segment segment NN cord-260054-iihgc5nr 124 33 of of IN cord-260054-iihgc5nr 124 34 an an DT cord-260054-iihgc5nr 124 35 adjacent adjacent JJ cord-260054-iihgc5nr 124 36 monomer monomer NN cord-260054-iihgc5nr 124 37 . . . cord-260054-iihgc5nr 125 1 In in IN cord-260054-iihgc5nr 125 2 particular particular JJ cord-260054-iihgc5nr 125 3 , , , cord-260054-iihgc5nr 125 4 S929 s929 JJ cord-260054-iihgc5nr 125 5 , , , cord-260054-iihgc5nr 125 6 D936 D936 NNS cord-260054-iihgc5nr 125 7 and and CC cord-260054-iihgc5nr 125 8 S943 s943 JJ cord-260054-iihgc5nr 125 9 ( ( -LRB- cord-260054-iihgc5nr 125 10 HR1 HR1 NNP cord-260054-iihgc5nr 125 11 on on IN cord-260054-iihgc5nr 125 12 Chain Chain NNP cord-260054-iihgc5nr 125 13 A A NNP cord-260054-iihgc5nr 125 14 ) ) -RRB- cord-260054-iihgc5nr 125 15 are be VBP cord-260054-iihgc5nr 125 16 H h NN cord-260054-iihgc5nr 125 17 - - HYPH cord-260054-iihgc5nr 125 18 bonded bond VBN cord-260054-iihgc5nr 125 19 to to IN cord-260054-iihgc5nr 125 20 S1196 S1196 NNP cord-260054-iihgc5nr 125 21 , , , cord-260054-iihgc5nr 125 22 R1185 R1185 NNP cord-260054-iihgc5nr 125 23 and and CC cord-260054-iihgc5nr 125 24 E1182 E1182 NNP cord-260054-iihgc5nr 125 25 , , , cord-260054-iihgc5nr 125 26 respectively respectively RB cord-260054-iihgc5nr 125 27 ( ( -LRB- cord-260054-iihgc5nr 125 28 HR2 HR2 NNP cord-260054-iihgc5nr 125 29 on on IN cord-260054-iihgc5nr 125 30 Chain Chain NNP cord-260054-iihgc5nr 125 31 C C NNP cord-260054-iihgc5nr 125 32 , , , cord-260054-iihgc5nr 125 33 Figure figure NN cord-260054-iihgc5nr 125 34 3 3 CD cord-260054-iihgc5nr 125 35 ) ) -RRB- cord-260054-iihgc5nr 125 36 . . . cord-260054-iihgc5nr 126 1 These these DT cord-260054-iihgc5nr 126 2 are be VBP cord-260054-iihgc5nr 126 3 all all RB cord-260054-iihgc5nr 126 4 strong strong JJ cord-260054-iihgc5nr 126 5 H h NN cord-260054-iihgc5nr 126 6 - - HYPH cord-260054-iihgc5nr 126 7 bonds bond NNS cord-260054-iihgc5nr 126 8 , , , cord-260054-iihgc5nr 126 9 especially especially RB cord-260054-iihgc5nr 126 10 the the DT cord-260054-iihgc5nr 126 11 one one CD cord-260054-iihgc5nr 126 12 between between IN cord-260054-iihgc5nr 126 13 S943 S943 NNP cord-260054-iihgc5nr 126 14 and and CC cord-260054-iihgc5nr 126 15 E1182 E1182 NNP cord-260054-iihgc5nr 126 16 , , , cord-260054-iihgc5nr 126 17 involving involve VBG cord-260054-iihgc5nr 126 18 a a DT cord-260054-iihgc5nr 126 19 negatively negatively RB cord-260054-iihgc5nr 126 20 charged charge VBN cord-260054-iihgc5nr 126 21 residue residue NN cord-260054-iihgc5nr 126 22 , , , cord-260054-iihgc5nr 126 23 and and CC cord-260054-iihgc5nr 126 24 the the DT cord-260054-iihgc5nr 126 25 one one CD cord-260054-iihgc5nr 126 26 between between IN cord-260054-iihgc5nr 126 27 D936 D936 NNS cord-260054-iihgc5nr 126 28 and and CC cord-260054-iihgc5nr 126 29 R1185 R1185 NNP cord-260054-iihgc5nr 126 30 , , , cord-260054-iihgc5nr 126 31 being be VBG cord-260054-iihgc5nr 126 32 actually actually RB cord-260054-iihgc5nr 126 33 a a DT cord-260054-iihgc5nr 126 34 salt salt NN cord-260054-iihgc5nr 126 35 bridge bridge NN cord-260054-iihgc5nr 126 36 ( ( -LRB- cord-260054-iihgc5nr 126 37 estimated estimate VBN cord-260054-iihgc5nr 126 38 to to TO cord-260054-iihgc5nr 126 39 contribute contribute VB cord-260054-iihgc5nr 126 40 an an DT cord-260054-iihgc5nr 126 41 additional additional JJ cord-260054-iihgc5nr 126 42 3 3 CD cord-260054-iihgc5nr 126 43 - - SYM cord-260054-iihgc5nr 126 44 5 5 CD cord-260054-iihgc5nr 126 45 kcal kcal NNP cord-260054-iihgc5nr 126 46 / / SYM cord-260054-iihgc5nr 126 47 mol mol NN cord-260054-iihgc5nr 126 48 to to IN cord-260054-iihgc5nr 126 49 the the DT cord-260054-iihgc5nr 126 50 free free JJ cord-260054-iihgc5nr 126 51 energy energy NN cord-260054-iihgc5nr 126 52 of of IN cord-260054-iihgc5nr 126 53 protein protein NN cord-260054-iihgc5nr 126 54 stability stability NN cord-260054-iihgc5nr 126 55 as as IN cord-260054-iihgc5nr 126 56 compare compare VBP cord-260054-iihgc5nr 126 57 to to IN cord-260054-iihgc5nr 126 58 a a DT cord-260054-iihgc5nr 126 59 neutral neutral JJ cord-260054-iihgc5nr 126 60 H h NN cord-260054-iihgc5nr 126 61 - - HYPH cord-260054-iihgc5nr 126 62 bond bond NN cord-260054-iihgc5nr 126 63 ( ( -LRB- cord-260054-iihgc5nr 126 64 41 41 CD cord-260054-iihgc5nr 126 65 ) ) -RRB- cord-260054-iihgc5nr 126 66 ) ) -RRB- cord-260054-iihgc5nr 126 67 . . . cord-260054-iihgc5nr 127 1 All all PDT cord-260054-iihgc5nr 127 2 these these DT cord-260054-iihgc5nr 127 3 three three CD cord-260054-iihgc5nr 127 4 H h NN cord-260054-iihgc5nr 127 5 - - HYPH cord-260054-iihgc5nr 127 6 bonds bond NNS cord-260054-iihgc5nr 127 7 are be VBP cord-260054-iihgc5nr 127 8 lost lose VBN cord-260054-iihgc5nr 127 9 upon upon IN cord-260054-iihgc5nr 127 10 mutation mutation NN cord-260054-iihgc5nr 127 11 , , , cord-260054-iihgc5nr 127 12 which which WDT cord-260054-iihgc5nr 127 13 points point VBZ cord-260054-iihgc5nr 127 14 to to IN cord-260054-iihgc5nr 127 15 a a DT cord-260054-iihgc5nr 127 16 weakening weakening NN cord-260054-iihgc5nr 127 17 of of IN cord-260054-iihgc5nr 127 18 the the DT cord-260054-iihgc5nr 127 19 post post JJ cord-260054-iihgc5nr 127 20 - - JJ cord-260054-iihgc5nr 127 21 fusion fusion JJ cord-260054-iihgc5nr 127 22 assembly assembly NN cord-260054-iihgc5nr 127 23 . . . cord-260054-iihgc5nr 128 1 Of of IN cord-260054-iihgc5nr 128 2 the the DT cord-260054-iihgc5nr 128 3 remaining remain VBG cord-260054-iihgc5nr 128 4 three three CD cord-260054-iihgc5nr 128 5 mutations mutation NNS cord-260054-iihgc5nr 128 6 , , , cord-260054-iihgc5nr 128 7 S939F S939F NNP cord-260054-iihgc5nr 128 8 is be VBZ cord-260054-iihgc5nr 128 9 completely completely RB cord-260054-iihgc5nr 128 10 exposed expose VBN cord-260054-iihgc5nr 128 11 to to IN cord-260054-iihgc5nr 128 12 the the DT cord-260054-iihgc5nr 128 13 solvent solvent NN cord-260054-iihgc5nr 128 14 and and CC cord-260054-iihgc5nr 128 15 therefore therefore RB cord-260054-iihgc5nr 128 16 , , , cord-260054-iihgc5nr 128 17 like like UH cord-260054-iihgc5nr 128 18 in in IN cord-260054-iihgc5nr 128 19 the the DT cord-260054-iihgc5nr 128 20 pre pre JJ cord-260054-iihgc5nr 128 21 - - JJ cord-260054-iihgc5nr 128 22 fusion fusion JJ cord-260054-iihgc5nr 128 23 conformation conformation NN cord-260054-iihgc5nr 128 24 , , , cord-260054-iihgc5nr 128 25 expected expect VBN cord-260054-iihgc5nr 128 26 to to TO cord-260054-iihgc5nr 128 27 act act VB cord-260054-iihgc5nr 128 28 unfavourable unfavourable JJ cord-260054-iihgc5nr 128 29 on on IN cord-260054-iihgc5nr 128 30 the the DT cord-260054-iihgc5nr 128 31 protein protein NN cord-260054-iihgc5nr 128 32 solvation solvation NN cord-260054-iihgc5nr 128 33 energy energy NN cord-260054-iihgc5nr 128 34 . . . cord-260054-iihgc5nr 129 1 On on IN cord-260054-iihgc5nr 129 2 the the DT cord-260054-iihgc5nr 129 3 contrary contrary NN cord-260054-iihgc5nr 129 4 , , , cord-260054-iihgc5nr 129 5 in in IN cord-260054-iihgc5nr 129 6 case case NN cord-260054-iihgc5nr 129 7 of of IN cord-260054-iihgc5nr 129 8 L938F L938F NNP cord-260054-iihgc5nr 129 9 and and CC cord-260054-iihgc5nr 129 10 S940F S940F NNP cord-260054-iihgc5nr 129 11 , , , cord-260054-iihgc5nr 129 12 which which WDT cord-260054-iihgc5nr 129 13 are be VBP cord-260054-iihgc5nr 129 14 substantially substantially RB cord-260054-iihgc5nr 129 15 buried bury VBN cord-260054-iihgc5nr 129 16 within within IN cord-260054-iihgc5nr 129 17 the the DT cord-260054-iihgc5nr 129 18 structure structure NN cord-260054-iihgc5nr 129 19 , , , cord-260054-iihgc5nr 129 20 mutation mutation NN cord-260054-iihgc5nr 129 21 to to IN cord-260054-iihgc5nr 129 22 a a DT cord-260054-iihgc5nr 129 23 large large JJ cord-260054-iihgc5nr 129 24 aromatic aromatic JJ cord-260054-iihgc5nr 129 25 phenylalanine phenylalanine NN cord-260054-iihgc5nr 129 26 seems seem VBZ cord-260054-iihgc5nr 129 27 even even RB cord-260054-iihgc5nr 129 28 to to TO cord-260054-iihgc5nr 129 29 optimize optimize VB cord-260054-iihgc5nr 129 30 the the DT cord-260054-iihgc5nr 129 31 network network NN cord-260054-iihgc5nr 129 32 of of IN cord-260054-iihgc5nr 129 33 the the DT cord-260054-iihgc5nr 129 34 hydrophobic hydrophobic JJ cord-260054-iihgc5nr 129 35 interactions interaction NNS cord-260054-iihgc5nr 129 36 ; ; : cord-260054-iihgc5nr 129 37 in in IN cord-260054-iihgc5nr 129 38 case case NN cord-260054-iihgc5nr 129 39 of of IN cord-260054-iihgc5nr 129 40 F940 f940 CD cord-260054-iihgc5nr 129 41 , , , cord-260054-iihgc5nr 129 42 with with IN cord-260054-iihgc5nr 129 43 the the DT cord-260054-iihgc5nr 129 44 aliphatic aliphatic JJ cord-260054-iihgc5nr 129 45 parts part NNS cord-260054-iihgc5nr 129 46 of of IN cord-260054-iihgc5nr 129 47 the the DT cord-260054-iihgc5nr 129 48 side side NN cord-260054-iihgc5nr 129 49 - - HYPH cord-260054-iihgc5nr 129 50 chains chain NNS cord-260054-iihgc5nr 129 51 of of IN cord-260054-iihgc5nr 129 52 E1182 E1182 NNP cord-260054-iihgc5nr 129 53 and and CC cord-260054-iihgc5nr 129 54 R1185 R1185 NNP cord-260054-iihgc5nr 129 55 on on IN cord-260054-iihgc5nr 129 56 an an DT cord-260054-iihgc5nr 129 57 adjacent adjacent JJ cord-260054-iihgc5nr 129 58 monomer monomer NN cord-260054-iihgc5nr 129 59 , , , cord-260054-iihgc5nr 129 60 and and CC cord-260054-iihgc5nr 129 61 , , , cord-260054-iihgc5nr 129 62 in in IN cord-260054-iihgc5nr 129 63 case case NN cord-260054-iihgc5nr 129 64 of of IN cord-260054-iihgc5nr 129 65 F938 F938 NNP cord-260054-iihgc5nr 129 66 , , , cord-260054-iihgc5nr 129 67 with with IN cord-260054-iihgc5nr 129 68 V1189 V1189 NNP cord-260054-iihgc5nr 129 69 and and CC cord-260054-iihgc5nr 129 70 A1190 A1190 NNP cord-260054-iihgc5nr 129 71 on on IN cord-260054-iihgc5nr 129 72 the the DT cord-260054-iihgc5nr 129 73 same same JJ cord-260054-iihgc5nr 129 74 monomer monomer NN cord-260054-iihgc5nr 129 75 and and CC cord-260054-iihgc5nr 129 76 with with IN cord-260054-iihgc5nr 129 77 other other JJ cord-260054-iihgc5nr 129 78 F938 f938 CD cord-260054-iihgc5nr 129 79 residues residue NNS cord-260054-iihgc5nr 129 80 on on IN cord-260054-iihgc5nr 129 81 both both CC cord-260054-iihgc5nr 129 82 the the DT cord-260054-iihgc5nr 129 83 adjacent adjacent JJ cord-260054-iihgc5nr 129 84 monomers monomer NNS cord-260054-iihgc5nr 129 85 . . . cord-260054-iihgc5nr 130 1 When when WRB cord-260054-iihgc5nr 130 2 comparing compare VBG cord-260054-iihgc5nr 130 3 the the DT cord-260054-iihgc5nr 130 4 effect effect NN cord-260054-iihgc5nr 130 5 of of IN cord-260054-iihgc5nr 130 6 the the DT cord-260054-iihgc5nr 130 7 mutations mutation NNS cord-260054-iihgc5nr 130 8 on on IN cord-260054-iihgc5nr 130 9 the the DT cord-260054-iihgc5nr 130 10 pre pre NN cord-260054-iihgc5nr 130 11 - - JJ cord-260054-iihgc5nr 130 12 and and CC cord-260054-iihgc5nr 130 13 post post JJ cord-260054-iihgc5nr 130 14 - - JJ cord-260054-iihgc5nr 130 15 fusion fusion JJ cord-260054-iihgc5nr 130 16 structures structure NNS cord-260054-iihgc5nr 130 17 , , , cord-260054-iihgc5nr 130 18 it -PRON- PRP cord-260054-iihgc5nr 130 19 emerges emerge VBZ cord-260054-iihgc5nr 130 20 that that IN cord-260054-iihgc5nr 130 21 the the DT cord-260054-iihgc5nr 130 22 S929I S929I NNP cord-260054-iihgc5nr 130 23 , , , cord-260054-iihgc5nr 130 24 D936Y D936Y NNP cord-260054-iihgc5nr 130 25 and and CC cord-260054-iihgc5nr 130 26 S943I s943i NN cord-260054-iihgc5nr 130 27 mutations mutation NNS cord-260054-iihgc5nr 130 28 strongly strongly RB cord-260054-iihgc5nr 130 29 destabilize destabilize VBP cord-260054-iihgc5nr 130 30 the the DT cord-260054-iihgc5nr 130 31 postfusion postfusion NN cord-260054-iihgc5nr 130 32 conformation conformation NN cord-260054-iihgc5nr 130 33 , , , cord-260054-iihgc5nr 130 34 while while IN cord-260054-iihgc5nr 130 35 having have VBG cord-260054-iihgc5nr 130 36 a a DT cord-260054-iihgc5nr 130 37 marginal marginal JJ cord-260054-iihgc5nr 130 38 impact impact NN cord-260054-iihgc5nr 130 39 on on IN cord-260054-iihgc5nr 130 40 the the DT cord-260054-iihgc5nr 130 41 stability stability NN cord-260054-iihgc5nr 130 42 of of IN cord-260054-iihgc5nr 130 43 the the DT cord-260054-iihgc5nr 130 44 pre pre JJ cord-260054-iihgc5nr 130 45 - - JJ cord-260054-iihgc5nr 130 46 fusion fusion JJ cord-260054-iihgc5nr 130 47 one one CD cord-260054-iihgc5nr 130 48 . . . cord-260054-iihgc5nr 131 1 On on IN cord-260054-iihgc5nr 131 2 the the DT cord-260054-iihgc5nr 131 3 contrary contrary NN cord-260054-iihgc5nr 131 4 , , , cord-260054-iihgc5nr 131 5 S940F S940F NNP cord-260054-iihgc5nr 131 6 seems seem VBZ cord-260054-iihgc5nr 131 7 to to TO cord-260054-iihgc5nr 131 8 favour favour VB cord-260054-iihgc5nr 131 9 the the DT cord-260054-iihgc5nr 131 10 post post JJ cord-260054-iihgc5nr 131 11 - - JJ cord-260054-iihgc5nr 131 12 fusion fusion JJ cord-260054-iihgc5nr 131 13 conformation conformation NN cord-260054-iihgc5nr 131 14 over over IN cord-260054-iihgc5nr 131 15 the the DT cord-260054-iihgc5nr 131 16 pre pre JJ cord-260054-iihgc5nr 131 17 - - JJ cord-260054-iihgc5nr 131 18 fusion fusion JJ cord-260054-iihgc5nr 131 19 one one CD cord-260054-iihgc5nr 131 20 . . . cord-260054-iihgc5nr 132 1 As as IN cord-260054-iihgc5nr 132 2 for for IN cord-260054-iihgc5nr 132 3 S938F S938F NNP cord-260054-iihgc5nr 132 4 and and CC cord-260054-iihgc5nr 132 5 S939F S939F NNP cord-260054-iihgc5nr 132 6 , , , cord-260054-iihgc5nr 132 7 they -PRON- PRP cord-260054-iihgc5nr 132 8 seem seem VBP cord-260054-iihgc5nr 132 9 to to TO cord-260054-iihgc5nr 132 10 have have VB cord-260054-iihgc5nr 132 11 a a DT cord-260054-iihgc5nr 132 12 comparable comparable JJ cord-260054-iihgc5nr 132 13 effect effect NN cord-260054-iihgc5nr 132 14 on on IN cord-260054-iihgc5nr 132 15 both both CC cord-260054-iihgc5nr 132 16 the the DT cord-260054-iihgc5nr 132 17 conformations conformation NNS cord-260054-iihgc5nr 132 18 , , , cord-260054-iihgc5nr 132 19 slightly slightly RB cord-260054-iihgc5nr 132 20 stabilizing stabilize VBG cord-260054-iihgc5nr 132 21 and and CC cord-260054-iihgc5nr 132 22 destabilizing destabilizing JJ cord-260054-iihgc5nr 132 23 , , , cord-260054-iihgc5nr 132 24 respectively respectively RB cord-260054-iihgc5nr 132 25 . . . cord-260054-iihgc5nr 133 1 Finally finally RB cord-260054-iihgc5nr 133 2 , , , cord-260054-iihgc5nr 133 3 the the DT cord-260054-iihgc5nr 133 4 S943P s943p NN cord-260054-iihgc5nr 133 5 mutation mutation NN cord-260054-iihgc5nr 133 6 would would MD cord-260054-iihgc5nr 133 7 strongly strongly RB cord-260054-iihgc5nr 133 8 destabilize destabilize VB cord-260054-iihgc5nr 133 9 both both CC cord-260054-iihgc5nr 133 10 the the DT cord-260054-iihgc5nr 133 11 pre pre NN cord-260054-iihgc5nr 133 12 - - JJ cord-260054-iihgc5nr 133 13 and and CC cord-260054-iihgc5nr 133 14 post post JJ cord-260054-iihgc5nr 133 15 - - JJ cord-260054-iihgc5nr 133 16 fusion fusion JJ cord-260054-iihgc5nr 133 17 conformations conformation NNS cord-260054-iihgc5nr 133 18 . . . cord-260054-iihgc5nr 134 1 Based base VBN cord-260054-iihgc5nr 134 2 on on IN cord-260054-iihgc5nr 134 3 a a DT cord-260054-iihgc5nr 134 4 thorough thorough JJ cord-260054-iihgc5nr 134 5 analysis analysis NN cord-260054-iihgc5nr 134 6 of of IN cord-260054-iihgc5nr 134 7 the the DT cord-260054-iihgc5nr 134 8 S S NNP cord-260054-iihgc5nr 134 9 protein protein NN cord-260054-iihgc5nr 134 10 sequences sequence NNS cord-260054-iihgc5nr 134 11 , , , cord-260054-iihgc5nr 134 12 that that IN cord-260054-iihgc5nr 134 13 we -PRON- PRP cord-260054-iihgc5nr 134 14 extracted extract VBD cord-260054-iihgc5nr 134 15 from from IN cord-260054-iihgc5nr 134 16 the the DT cord-260054-iihgc5nr 134 17 genomic genomic JJ cord-260054-iihgc5nr 134 18 sequences sequence NNS cord-260054-iihgc5nr 134 19 of of IN cord-260054-iihgc5nr 134 20 SARS SARS NNP cord-260054-iihgc5nr 134 21 - - HYPH cord-260054-iihgc5nr 134 22 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 134 23 reported report VBD cord-260054-iihgc5nr 134 24 in in IN cord-260054-iihgc5nr 134 25 GISAID GISAID NNP cord-260054-iihgc5nr 134 26 on on IN cord-260054-iihgc5nr 134 27 April April NNP cord-260054-iihgc5nr 134 28 21 21 CD cord-260054-iihgc5nr 134 29 st st NNP cord-260054-iihgc5nr 134 30 , , , cord-260054-iihgc5nr 134 31 we -PRON- PRP cord-260054-iihgc5nr 134 32 identified identify VBD cord-260054-iihgc5nr 134 33 the the DT cord-260054-iihgc5nr 134 34 fusion fusion NN cord-260054-iihgc5nr 134 35 core core NN cord-260054-iihgc5nr 134 36 of of IN cord-260054-iihgc5nr 134 37 the the DT cord-260054-iihgc5nr 134 38 HR1 HR1 NNP cord-260054-iihgc5nr 134 39 as as IN cord-260054-iihgc5nr 134 40 a a DT cord-260054-iihgc5nr 134 41 mutational mutational JJ cord-260054-iihgc5nr 134 42 hotspot hotspot NN cord-260054-iihgc5nr 134 43 . . . cord-260054-iihgc5nr 135 1 The the DT cord-260054-iihgc5nr 135 2 D936Y D936Y NNP cord-260054-iihgc5nr 135 3 and and CC cord-260054-iihgc5nr 135 4 S943P s943p NN cord-260054-iihgc5nr 135 5 mutations mutation NNS cord-260054-iihgc5nr 135 6 were be VBD cord-260054-iihgc5nr 135 7 the the DT cord-260054-iihgc5nr 135 8 most most RBS cord-260054-iihgc5nr 135 9 numerous numerous JJ cord-260054-iihgc5nr 135 10 , , , cord-260054-iihgc5nr 135 11 being be VBG cord-260054-iihgc5nr 135 12 among among IN cord-260054-iihgc5nr 135 13 the the DT cord-260054-iihgc5nr 135 14 most most RBS cord-260054-iihgc5nr 135 15 frequently frequently RB cord-260054-iihgc5nr 135 16 occurring occur VBG cord-260054-iihgc5nr 135 17 mutations mutation NNS cord-260054-iihgc5nr 135 18 overall overall RB cord-260054-iihgc5nr 135 19 at at IN cord-260054-iihgc5nr 135 20 the the DT cord-260054-iihgc5nr 135 21 time time NN cord-260054-iihgc5nr 135 22 . . . cord-260054-iihgc5nr 136 1 Other other JJ cord-260054-iihgc5nr 136 2 , , , cord-260054-iihgc5nr 136 3 less less RBR cord-260054-iihgc5nr 136 4 frequent frequent JJ cord-260054-iihgc5nr 136 5 , , , cord-260054-iihgc5nr 136 6 mutations mutation NNS cord-260054-iihgc5nr 136 7 were be VBD cord-260054-iihgc5nr 136 8 S939F S939F NNP cord-260054-iihgc5nr 136 9 and and CC cord-260054-iihgc5nr 136 10 then then RB cord-260054-iihgc5nr 136 11 S929I S929I NNP cord-260054-iihgc5nr 136 12 , , , cord-260054-iihgc5nr 136 13 L938F L938F NNP cord-260054-iihgc5nr 136 14 and and CC cord-260054-iihgc5nr 136 15 S940F. s940f. NN cord-260054-iihgc5nr 137 1 Overall overall RB cord-260054-iihgc5nr 137 2 , , , cord-260054-iihgc5nr 137 3 such such JJ cord-260054-iihgc5nr 137 4 mutations mutation NNS cord-260054-iihgc5nr 137 5 appeared appear VBD cord-260054-iihgc5nr 137 6 to to TO cord-260054-iihgc5nr 137 7 be be VB cord-260054-iihgc5nr 137 8 late late JJ cord-260054-iihgc5nr 137 9 ones one NNS cord-260054-iihgc5nr 137 10 , , , cord-260054-iihgc5nr 137 11 emerging emerge VBG cord-260054-iihgc5nr 137 12 starting start VBG cord-260054-iihgc5nr 137 13 from from IN cord-260054-iihgc5nr 137 14 the the DT cord-260054-iihgc5nr 137 15 end end NN cord-260054-iihgc5nr 137 16 of of IN cord-260054-iihgc5nr 137 17 February February NNP cord-260054-iihgc5nr 137 18 or or CC cord-260054-iihgc5nr 137 19 even even RB cord-260054-iihgc5nr 137 20 mid mid JJ cord-260054-iihgc5nr 137 21 March March NNP cord-260054-iihgc5nr 137 22 2020 2020 CD cord-260054-iihgc5nr 137 23 , , , cord-260054-iihgc5nr 137 24 and and CC cord-260054-iihgc5nr 137 25 were be VBD cord-260054-iihgc5nr 137 26 mainly mainly RB cord-260054-iihgc5nr 137 27 localized localize VBN cord-260054-iihgc5nr 137 28 in in IN cord-260054-iihgc5nr 137 29 Europe Europe NNP cord-260054-iihgc5nr 137 30 and and CC cord-260054-iihgc5nr 137 31 USA USA NNP cord-260054-iihgc5nr 137 32 . . . cord-260054-iihgc5nr 138 1 Based base VBN cord-260054-iihgc5nr 138 2 on on IN cord-260054-iihgc5nr 138 3 their -PRON- PRP$ cord-260054-iihgc5nr 138 4 frequency frequency NN cord-260054-iihgc5nr 138 5 , , , cord-260054-iihgc5nr 138 6 on on IN cord-260054-iihgc5nr 138 7 their -PRON- PRP$ cord-260054-iihgc5nr 138 8 location location NN cord-260054-iihgc5nr 138 9 in in IN cord-260054-iihgc5nr 138 10 a a DT cord-260054-iihgc5nr 138 11 protein protein NN cord-260054-iihgc5nr 138 12 region region NN cord-260054-iihgc5nr 138 13 playing play VBG cord-260054-iihgc5nr 138 14 a a DT cord-260054-iihgc5nr 138 15 key key JJ cord-260054-iihgc5nr 138 16 role role NN cord-260054-iihgc5nr 138 17 in in IN cord-260054-iihgc5nr 138 18 the the DT cord-260054-iihgc5nr 138 19 post post JJ cord-260054-iihgc5nr 138 20 - - JJ cord-260054-iihgc5nr 138 21 fusion fusion JJ cord-260054-iihgc5nr 138 22 conformation conformation NN cord-260054-iihgc5nr 138 23 and and CC cord-260054-iihgc5nr 138 24 also also RB cord-260054-iihgc5nr 138 25 on on IN cord-260054-iihgc5nr 138 26 the the DT cord-260054-iihgc5nr 138 27 non non JJ cord-260054-iihgc5nr 138 28 - - JJ cord-260054-iihgc5nr 138 29 conservative conservative JJ cord-260054-iihgc5nr 138 30 nature nature NN cord-260054-iihgc5nr 138 31 of of IN cord-260054-iihgc5nr 138 32 the the DT cord-260054-iihgc5nr 138 33 mutations mutation NNS cord-260054-iihgc5nr 138 34 themselves -PRON- PRP cord-260054-iihgc5nr 138 35 , , , cord-260054-iihgc5nr 138 36 we -PRON- PRP cord-260054-iihgc5nr 138 37 decided decide VBD cord-260054-iihgc5nr 138 38 to to TO cord-260054-iihgc5nr 138 39 further further RB cord-260054-iihgc5nr 138 40 investigate investigate VB cord-260054-iihgc5nr 138 41 the the DT cord-260054-iihgc5nr 138 42 structural structural JJ cord-260054-iihgc5nr 138 43 basis basis NN cord-260054-iihgc5nr 138 44 of of IN cord-260054-iihgc5nr 138 45 such such JJ cord-260054-iihgc5nr 138 46 mutations mutation NNS cord-260054-iihgc5nr 138 47 , , , cord-260054-iihgc5nr 138 48 finding find VBG cord-260054-iihgc5nr 138 49 out out RP cord-260054-iihgc5nr 138 50 that that IN cord-260054-iihgc5nr 138 51 they -PRON- PRP cord-260054-iihgc5nr 138 52 all all DT cord-260054-iihgc5nr 138 53 can can MD cord-260054-iihgc5nr 138 54 play play VB cord-260054-iihgc5nr 138 55 a a DT cord-260054-iihgc5nr 138 56 role role NN cord-260054-iihgc5nr 138 57 in in IN cord-260054-iihgc5nr 138 58 tuning tune VBG cord-260054-iihgc5nr 138 59 the the DT cord-260054-iihgc5nr 138 60 stability stability NN cord-260054-iihgc5nr 138 61 of of IN cord-260054-iihgc5nr 138 62 the the DT cord-260054-iihgc5nr 138 63 preand preand NN cord-260054-iihgc5nr 138 64 / / SYM cord-260054-iihgc5nr 138 65 or or CC cord-260054-iihgc5nr 138 66 post post JJ cord-260054-iihgc5nr 138 67 - - JJ cord-260054-iihgc5nr 138 68 fusion fusion JJ cord-260054-iihgc5nr 138 69 S S NNP cord-260054-iihgc5nr 138 70 protein protein NN cord-260054-iihgc5nr 138 71 conformation conformation NN cord-260054-iihgc5nr 138 72 . . . cord-260054-iihgc5nr 139 1 Other other JJ cord-260054-iihgc5nr 139 2 potentially potentially RB cord-260054-iihgc5nr 139 3 interesting interesting JJ cord-260054-iihgc5nr 139 4 mutations mutation NNS cord-260054-iihgc5nr 139 5 are be VBP cord-260054-iihgc5nr 139 6 S929I S929I NNP cord-260054-iihgc5nr 139 7 and and CC cord-260054-iihgc5nr 139 8 S939F S939F NNP cord-260054-iihgc5nr 139 9 , , , cord-260054-iihgc5nr 139 10 whose whose WP$ cord-260054-iihgc5nr 139 11 number number NN cord-260054-iihgc5nr 139 12 of of IN cord-260054-iihgc5nr 139 13 occurrences occurrence NNS cord-260054-iihgc5nr 139 14 underwent undergo VBD cord-260054-iihgc5nr 139 15 a a DT cord-260054-iihgc5nr 139 16 ≈2/3-fold ≈2/3-fold NNP cord-260054-iihgc5nr 139 17 increase increase NN cord-260054-iihgc5nr 139 18 . . . cord-260054-iihgc5nr 140 1 On on IN cord-260054-iihgc5nr 140 2 the the DT cord-260054-iihgc5nr 140 3 other other JJ cord-260054-iihgc5nr 140 4 hand hand NN cord-260054-iihgc5nr 140 5 , , , cord-260054-iihgc5nr 140 6 the the DT cord-260054-iihgc5nr 140 7 increment increment NN cord-260054-iihgc5nr 140 8 in in IN cord-260054-iihgc5nr 140 9 the the DT cord-260054-iihgc5nr 140 10 occurrence occurrence NN cord-260054-iihgc5nr 140 11 of of IN cord-260054-iihgc5nr 140 12 L938F L938F NNP cord-260054-iihgc5nr 140 13 and and CC cord-260054-iihgc5nr 140 14 S940F S940F NNP cord-260054-iihgc5nr 140 15 was be VBD cord-260054-iihgc5nr 140 16 marginal marginal JJ cord-260054-iihgc5nr 140 17 , , , cord-260054-iihgc5nr 140 18 posing pose VBG cord-260054-iihgc5nr 140 19 less less JJR cord-260054-iihgc5nr 140 20 emphasis emphasis NN cord-260054-iihgc5nr 140 21 on on IN cord-260054-iihgc5nr 140 22 such such JJ cord-260054-iihgc5nr 140 23 mutations mutation NNS cord-260054-iihgc5nr 140 24 , , , cord-260054-iihgc5nr 140 25 which which WDT cord-260054-iihgc5nr 140 26 will will MD cord-260054-iihgc5nr 140 27 be be VB cord-260054-iihgc5nr 140 28 nonetheless nonetheless RB cord-260054-iihgc5nr 140 29 useful useful JJ cord-260054-iihgc5nr 140 30 to to TO cord-260054-iihgc5nr 140 31 continue continue VB cord-260054-iihgc5nr 140 32 monitoring monitor VBG cord-260054-iihgc5nr 140 33 . . . cord-260054-iihgc5nr 141 1 Finally finally RB cord-260054-iihgc5nr 141 2 , , , cord-260054-iihgc5nr 141 3 the the DT cord-260054-iihgc5nr 141 4 S943P S943P NNP cord-260054-iihgc5nr 141 5 mutation mutation NN cord-260054-iihgc5nr 141 6 , , , cord-260054-iihgc5nr 141 7 although although IN cord-260054-iihgc5nr 141 8 still still RB cord-260054-iihgc5nr 141 9 reported report VBN cord-260054-iihgc5nr 141 10 in in IN cord-260054-iihgc5nr 141 11 few few JJ cord-260054-iihgc5nr 141 12 cases case NNS cord-260054-iihgc5nr 141 13 , , , cord-260054-iihgc5nr 141 14 underwent undergo VBD cord-260054-iihgc5nr 141 15 a a DT cord-260054-iihgc5nr 141 16 dramatic dramatic JJ cord-260054-iihgc5nr 141 17 reduction reduction NN cord-260054-iihgc5nr 141 18 of of IN cord-260054-iihgc5nr 141 19 occurrences occurrence NNS cord-260054-iihgc5nr 141 20 , , , cord-260054-iihgc5nr 141 21 due due IN cord-260054-iihgc5nr 141 22 to to IN cord-260054-iihgc5nr 141 23 modification modification NN cord-260054-iihgc5nr 141 24 of of IN cord-260054-iihgc5nr 141 25 the the DT cord-260054-iihgc5nr 141 26 original original JJ cord-260054-iihgc5nr 141 27 sequences sequence NNS cord-260054-iihgc5nr 141 28 where where WRB cord-260054-iihgc5nr 141 29 they -PRON- PRP cord-260054-iihgc5nr 141 30 were be VBD cord-260054-iihgc5nr 141 31 first first RB cord-260054-iihgc5nr 141 32 reported report VBN cord-260054-iihgc5nr 141 33 . . . cord-260054-iihgc5nr 142 1 At at IN cord-260054-iihgc5nr 142 2 the the DT cord-260054-iihgc5nr 142 3 same same JJ cord-260054-iihgc5nr 142 4 time time NN cord-260054-iihgc5nr 142 5 , , , cord-260054-iihgc5nr 142 6 a a DT cord-260054-iihgc5nr 142 7 S943I S943I NNP cord-260054-iihgc5nr 142 8 mutation mutation NN cord-260054-iihgc5nr 142 9 emerged emerge VBD cord-260054-iihgc5nr 142 10 , , , cord-260054-iihgc5nr 142 11 that that DT cord-260054-iihgc5nr 142 12 will will MD cord-260054-iihgc5nr 142 13 also also RB cord-260054-iihgc5nr 142 14 be be VB cord-260054-iihgc5nr 142 15 worth worth JJ cord-260054-iihgc5nr 142 16 continuing continue VBG cord-260054-iihgc5nr 142 17 to to TO cord-260054-iihgc5nr 142 18 monitor monitor VB cord-260054-iihgc5nr 142 19 . . . cord-260054-iihgc5nr 143 1 We -PRON- PRP cord-260054-iihgc5nr 143 2 remind remind VBP cord-260054-iihgc5nr 143 3 here here RB cord-260054-iihgc5nr 143 4 that that IN cord-260054-iihgc5nr 143 5 a a DT cord-260054-iihgc5nr 143 6 proline proline NN cord-260054-iihgc5nr 143 7 at at IN cord-260054-iihgc5nr 143 8 position position NN cord-260054-iihgc5nr 143 9 943 943 CD cord-260054-iihgc5nr 143 10 would would MD cord-260054-iihgc5nr 143 11 cause cause VB cord-260054-iihgc5nr 143 12 a a DT cord-260054-iihgc5nr 143 13 significant significant JJ cord-260054-iihgc5nr 143 14 destabilization destabilization NN cord-260054-iihgc5nr 143 15 on on IN cord-260054-iihgc5nr 143 16 the the DT cord-260054-iihgc5nr 143 17 S S NNP cord-260054-iihgc5nr 143 18 protein protein NN cord-260054-iihgc5nr 143 19 pre pre JJ cord-260054-iihgc5nr 143 20 - - JJ cord-260054-iihgc5nr 143 21 fusion fusion JJ cord-260054-iihgc5nr 143 22 conformation conformation NN cord-260054-iihgc5nr 143 23 . . . cord-260054-iihgc5nr 144 1 It -PRON- PRP cord-260054-iihgc5nr 144 2 is be VBZ cord-260054-iihgc5nr 144 3 also also RB cord-260054-iihgc5nr 144 4 worth worth JJ cord-260054-iihgc5nr 144 5 noticing notice VBG cord-260054-iihgc5nr 144 6 that that IN cord-260054-iihgc5nr 144 7 the the DT cord-260054-iihgc5nr 144 8 2 2 CD cord-260054-iihgc5nr 144 9 mutations mutation NNS cord-260054-iihgc5nr 144 10 significantly significantly RB cord-260054-iihgc5nr 144 11 increasing increase VBG cord-260054-iihgc5nr 144 12 their -PRON- PRP$ cord-260054-iihgc5nr 144 13 frequency frequency NN cord-260054-iihgc5nr 144 14 over over IN cord-260054-iihgc5nr 144 15 time time NN cord-260054-iihgc5nr 144 16 , , , cord-260054-iihgc5nr 144 17 D936Y D936Y NNP cord-260054-iihgc5nr 144 18 and and CC cord-260054-iihgc5nr 144 19 S929I S929I NNP cord-260054-iihgc5nr 144 20 , , , cord-260054-iihgc5nr 144 21 were be VBD cord-260054-iihgc5nr 144 22 also also RB cord-260054-iihgc5nr 144 23 those those DT cord-260054-iihgc5nr 144 24 that that WDT cord-260054-iihgc5nr 144 25 , , , cord-260054-iihgc5nr 144 26 together together RB cord-260054-iihgc5nr 144 27 with with IN cord-260054-iihgc5nr 144 28 S943P s943p NN cord-260054-iihgc5nr 144 29 / / SYM cord-260054-iihgc5nr 144 30 I -PRON- PRP cord-260054-iihgc5nr 144 31 , , , cord-260054-iihgc5nr 144 32 caused cause VBD cord-260054-iihgc5nr 144 33 the the DT cord-260054-iihgc5nr 144 34 loss loss NN cord-260054-iihgc5nr 144 35 of of IN cord-260054-iihgc5nr 144 36 a a DT cord-260054-iihgc5nr 144 37 inter inter JJ cord-260054-iihgc5nr 144 38 - - JJ cord-260054-iihgc5nr 144 39 monomer monomer JJ cord-260054-iihgc5nr 144 40 H h NN cord-260054-iihgc5nr 144 41 - - HYPH cord-260054-iihgc5nr 144 42 bond bond NN cord-260054-iihgc5nr 144 43 in in IN cord-260054-iihgc5nr 144 44 the the DT cord-260054-iihgc5nr 144 45 post post JJ cord-260054-iihgc5nr 144 46 - - JJ cord-260054-iihgc5nr 144 47 fusion fusion JJ cord-260054-iihgc5nr 144 48 conformation conformation NN cord-260054-iihgc5nr 144 49 of of IN cord-260054-iihgc5nr 144 50 the the DT cord-260054-iihgc5nr 144 51 protein protein NN cord-260054-iihgc5nr 144 52 . . . cord-260054-iihgc5nr 145 1 Interestingly interestingly RB cord-260054-iihgc5nr 145 2 , , , cord-260054-iihgc5nr 145 3 the the DT cord-260054-iihgc5nr 145 4 now now RB cord-260054-iihgc5nr 145 5 emerging emerge VBG cord-260054-iihgc5nr 145 6 S943I S943I NNP cord-260054-iihgc5nr 145 7 mutation mutation NN cord-260054-iihgc5nr 145 8 gets get VBZ cord-260054-iihgc5nr 145 9 the the DT cord-260054-iihgc5nr 145 10 same same JJ cord-260054-iihgc5nr 145 11 effect effect NN cord-260054-iihgc5nr 145 12 without without IN cord-260054-iihgc5nr 145 13 destabilizing destabilize VBG cord-260054-iihgc5nr 145 14 the the DT cord-260054-iihgc5nr 145 15 pre pre JJ cord-260054-iihgc5nr 145 16 - - JJ cord-260054-iihgc5nr 145 17 fusion fusion JJ cord-260054-iihgc5nr 145 18 conformation conformation NN cord-260054-iihgc5nr 145 19 . . . cord-260054-iihgc5nr 146 1 The the DT cord-260054-iihgc5nr 146 2 most most RBS cord-260054-iihgc5nr 146 3 frequently frequently RB cord-260054-iihgc5nr 146 4 occurring occur VBG cord-260054-iihgc5nr 146 5 mutation mutation NN cord-260054-iihgc5nr 146 6 in in IN cord-260054-iihgc5nr 146 7 the the DT cord-260054-iihgc5nr 146 8 HR1 HR1 NNP cord-260054-iihgc5nr 146 9 " " `` cord-260054-iihgc5nr 146 10 fusion fusion NN cord-260054-iihgc5nr 146 11 core core NN cord-260054-iihgc5nr 146 12 " " '' cord-260054-iihgc5nr 146 13 , , , cord-260054-iihgc5nr 146 14 common common JJ cord-260054-iihgc5nr 146 15 in in IN cord-260054-iihgc5nr 146 16 Sweden Sweden NNP cord-260054-iihgc5nr 146 17 and and CC cord-260054-iihgc5nr 146 18 UK UK NNP cord-260054-iihgc5nr 146 19 on on IN cord-260054-iihgc5nr 146 20 May May NNP cord-260054-iihgc5nr 146 21 29 29 CD cord-260054-iihgc5nr 146 22 th th XX cord-260054-iihgc5nr 146 23 , , , cord-260054-iihgc5nr 146 24 is be VBZ cord-260054-iihgc5nr 146 25 also also RB cord-260054-iihgc5nr 146 26 the the DT cord-260054-iihgc5nr 146 27 one one NN cord-260054-iihgc5nr 146 28 causing cause VBG cord-260054-iihgc5nr 146 29 the the DT cord-260054-iihgc5nr 146 30 loss loss NN cord-260054-iihgc5nr 146 31 of of IN cord-260054-iihgc5nr 146 32 a a DT cord-260054-iihgc5nr 146 33 strong strong JJ cord-260054-iihgc5nr 146 34 inter inter JJ cord-260054-iihgc5nr 146 35 - - JJ cord-260054-iihgc5nr 146 36 monomer monomer JJ cord-260054-iihgc5nr 146 37 salt salt NN cord-260054-iihgc5nr 146 38 bridge bridge NN cord-260054-iihgc5nr 146 39 . . . cord-260054-iihgc5nr 147 1 Our -PRON- PRP$ cord-260054-iihgc5nr 147 2 structural structural JJ cord-260054-iihgc5nr 147 3 analyses analysis NNS cord-260054-iihgc5nr 147 4 provide provide VBP cord-260054-iihgc5nr 147 5 a a DT cord-260054-iihgc5nr 147 6 rationale rationale NN cord-260054-iihgc5nr 147 7 for for IN cord-260054-iihgc5nr 147 8 such such JJ cord-260054-iihgc5nr 147 9 mutations mutation NNS cord-260054-iihgc5nr 147 10 , , , cord-260054-iihgc5nr 147 11 pointing point VBG cord-260054-iihgc5nr 147 12 to to IN cord-260054-iihgc5nr 147 13 a a DT cord-260054-iihgc5nr 147 14 weakening weakening NN cord-260054-iihgc5nr 147 15 of of IN cord-260054-iihgc5nr 147 16 the the DT cord-260054-iihgc5nr 147 17 post post JJ cord-260054-iihgc5nr 147 18 - - JJ cord-260054-iihgc5nr 147 19 fusion fusion JJ cord-260054-iihgc5nr 147 20 assembly assembly NN cord-260054-iihgc5nr 147 21 . . . cord-260054-iihgc5nr 148 1 However however RB cord-260054-iihgc5nr 148 2 , , , cord-260054-iihgc5nr 148 3 only only RB cord-260054-iihgc5nr 148 4 experiments experiment NNS cord-260054-iihgc5nr 148 5 on on IN cord-260054-iihgc5nr 148 6 cellular cellular JJ cord-260054-iihgc5nr 148 7 systems system NNS cord-260054-iihgc5nr 148 8 will will MD cord-260054-iihgc5nr 148 9 clarify clarify VB cord-260054-iihgc5nr 148 10 whether whether IN cord-260054-iihgc5nr 148 11 this this DT cord-260054-iihgc5nr 148 12 may may MD cord-260054-iihgc5nr 148 13 be be VB cord-260054-iihgc5nr 148 14 a a DT cord-260054-iihgc5nr 148 15 virus virus NN cord-260054-iihgc5nr 148 16 strategy strategy NN cord-260054-iihgc5nr 148 17 for for IN cord-260054-iihgc5nr 148 18 reducing reduce VBG cord-260054-iihgc5nr 148 19 its -PRON- PRP$ cord-260054-iihgc5nr 148 20 membrane membrane NN cord-260054-iihgc5nr 148 21 fusion fusion NN cord-260054-iihgc5nr 148 22 capacity capacity NN cord-260054-iihgc5nr 148 23 , , , cord-260054-iihgc5nr 148 24 thus thus RB cord-260054-iihgc5nr 148 25 lowering lower VBG cord-260054-iihgc5nr 148 26 its -PRON- PRP$ cord-260054-iihgc5nr 148 27 virulence virulence NN cord-260054-iihgc5nr 148 28 . . . cord-260054-iihgc5nr 149 1 We -PRON- PRP cord-260054-iihgc5nr 149 2 gratefully gratefully RB cord-260054-iihgc5nr 149 3 acknowledge acknowledge VBP cord-260054-iihgc5nr 149 4 all all PDT cord-260054-iihgc5nr 149 5 the the DT cord-260054-iihgc5nr 149 6 Authors author NNS cord-260054-iihgc5nr 149 7 from from IN cord-260054-iihgc5nr 149 8 the the DT cord-260054-iihgc5nr 149 9 Originating originate VBG cord-260054-iihgc5nr 149 10 laboratories laboratory NNS cord-260054-iihgc5nr 149 11 responsible responsible JJ cord-260054-iihgc5nr 149 12 for for IN cord-260054-iihgc5nr 149 13 obtaining obtain VBG cord-260054-iihgc5nr 149 14 the the DT cord-260054-iihgc5nr 149 15 specimens specimen NNS cord-260054-iihgc5nr 149 16 and and CC cord-260054-iihgc5nr 149 17 the the DT cord-260054-iihgc5nr 149 18 Submitting submit VBG cord-260054-iihgc5nr 149 19 laboratories laboratory NNS cord-260054-iihgc5nr 149 20 where where WRB cord-260054-iihgc5nr 149 21 genetic genetic JJ cord-260054-iihgc5nr 149 22 sequence sequence NN cord-260054-iihgc5nr 149 23 data datum NNS cord-260054-iihgc5nr 149 24 were be VBD cord-260054-iihgc5nr 149 25 generated generate VBN cord-260054-iihgc5nr 149 26 and and CC cord-260054-iihgc5nr 149 27 shared share VBN cord-260054-iihgc5nr 149 28 via via IN cord-260054-iihgc5nr 149 29 the the DT cord-260054-iihgc5nr 149 30 GISAID GISAID NNP cord-260054-iihgc5nr 149 31 Initiative Initiative NNP cord-260054-iihgc5nr 149 32 , , , cord-260054-iihgc5nr 149 33 on on IN cord-260054-iihgc5nr 149 34 which which WDT cord-260054-iihgc5nr 149 35 this this DT cord-260054-iihgc5nr 149 36 research research NN cord-260054-iihgc5nr 149 37 is be VBZ cord-260054-iihgc5nr 149 38 based base VBN cord-260054-iihgc5nr 149 39 . . . cord-260054-iihgc5nr 150 1 Table table NN cord-260054-iihgc5nr 150 2 2 2 CD cord-260054-iihgc5nr 150 3 . . . cord-260054-iihgc5nr 151 1 Solvent solvent NN cord-260054-iihgc5nr 151 2 accessibility accessibility NN cord-260054-iihgc5nr 151 3 of of IN cord-260054-iihgc5nr 151 4 mutated mutated JJ cord-260054-iihgc5nr 151 5 residues residue NNS cord-260054-iihgc5nr 151 6 in in IN cord-260054-iihgc5nr 151 7 the the DT cord-260054-iihgc5nr 151 8 pre pre NN cord-260054-iihgc5nr 151 9 - - JJ cord-260054-iihgc5nr 151 10 and and CC cord-260054-iihgc5nr 151 11 post post JJ cord-260054-iihgc5nr 151 12 - - JJ cord-260054-iihgc5nr 151 13 fusion fusion JJ cord-260054-iihgc5nr 151 14 conformations conformation NNS cord-260054-iihgc5nr 151 15 . . . cord-260054-iihgc5nr 152 1 Amino amino NN cord-260054-iihgc5nr 152 2 acid acid NN cord-260054-iihgc5nr 152 3 Pre pre JJ cord-260054-iihgc5nr 152 4 - - JJ cord-260054-iihgc5nr 152 5 fusion fusion JJ cord-260054-iihgc5nr 152 6 Post Post NNP cord-260054-iihgc5nr 152 7 - - NN cord-260054-iihgc5nr 152 8 fusion fusion NN cord-260054-iihgc5nr 152 9 I929 I929 '' cord-260054-iihgc5nr 152 10 exposed expose VBN cord-260054-iihgc5nr 152 11 partly partly RB cord-260054-iihgc5nr 152 12 buried bury VBN cord-260054-iihgc5nr 152 13 ( ( -LRB- cord-260054-iihgc5nr 152 14 18.6 18.6 CD cord-260054-iihgc5nr 152 15 % % NN cord-260054-iihgc5nr 152 16 ) ) -RRB- cord-260054-iihgc5nr 153 1 a a DT cord-260054-iihgc5nr 153 2 Y936 y936 CD cord-260054-iihgc5nr 153 3 exposed expose VBN cord-260054-iihgc5nr 153 4 partly partly RB cord-260054-iihgc5nr 153 5 buried bury VBN cord-260054-iihgc5nr 153 6 ( ( -LRB- cord-260054-iihgc5nr 153 7 19.0 19.0 CD cord-260054-iihgc5nr 153 8 % % NN cord-260054-iihgc5nr 154 1 ) ) -RRB- cord-260054-iihgc5nr 154 2 F938 f938 CD cord-260054-iihgc5nr 154 3 buried bury VBN cord-260054-iihgc5nr 154 4 ( ( -LRB- cord-260054-iihgc5nr 154 5 95.8 95.8 CD cord-260054-iihgc5nr 154 6 % % NN cord-260054-iihgc5nr 154 7 ) ) -RRB- cord-260054-iihgc5nr 154 8 buried bury VBN cord-260054-iihgc5nr 154 9 ( ( -LRB- cord-260054-iihgc5nr 154 10 95.3 95.3 CD cord-260054-iihgc5nr 154 11 % % NN cord-260054-iihgc5nr 154 12 ) ) -RRB- cord-260054-iihgc5nr 155 1 F939 F939 NNS cord-260054-iihgc5nr 155 2 exposed expose VBD cord-260054-iihgc5nr 155 3 exposed expose VBN cord-260054-iihgc5nr 155 4 F940 f940 CD cord-260054-iihgc5nr 155 5 exposed expose VBN cord-260054-iihgc5nr 155 6 buried bury VBN cord-260054-iihgc5nr 155 7 ( ( -LRB- cord-260054-iihgc5nr 155 8 62.3 62.3 CD cord-260054-iihgc5nr 155 9 % % NN cord-260054-iihgc5nr 155 10 ) ) -RRB- cord-260054-iihgc5nr 155 11 Table table NN cord-260054-iihgc5nr 155 12 S1 s1 NN cord-260054-iihgc5nr 155 13 . . . cord-260054-iihgc5nr 156 1 List list NN cord-260054-iihgc5nr 156 2 of of IN cord-260054-iihgc5nr 156 3 mutations mutation NNS cord-260054-iihgc5nr 156 4 identified identify VBN cord-260054-iihgc5nr 156 5 in in IN cord-260054-iihgc5nr 156 6 GISAID GISAID NNP cord-260054-iihgc5nr 156 7 in in IN cord-260054-iihgc5nr 156 8 at at RB cord-260054-iihgc5nr 156 9 least least JJS cord-260054-iihgc5nr 156 10 2 2 CD cord-260054-iihgc5nr 156 11 identical identical JJ cord-260054-iihgc5nr 156 12 sequences sequence NNS cord-260054-iihgc5nr 156 13 on on IN cord-260054-iihgc5nr 156 14 April April NNP cord-260054-iihgc5nr 156 15 21 21 CD cord-260054-iihgc5nr 156 16 st st NNP cord-260054-iihgc5nr 156 17 2020 2020 CD cord-260054-iihgc5nr 156 18 , , , cord-260054-iihgc5nr 156 19 in in IN cord-260054-iihgc5nr 156 20 sequential sequential JJ cord-260054-iihgc5nr 156 21 order order NN cord-260054-iihgc5nr 156 22 . . . cord-260054-iihgc5nr 157 1 A a DT cord-260054-iihgc5nr 157 2 pneumonia pneumonia NN cord-260054-iihgc5nr 157 3 outbreak outbreak NN cord-260054-iihgc5nr 157 4 associated associate VBN cord-260054-iihgc5nr 157 5 with with IN cord-260054-iihgc5nr 157 6 a a DT cord-260054-iihgc5nr 157 7 new new JJ cord-260054-iihgc5nr 157 8 coronavirus coronavirus NN cord-260054-iihgc5nr 157 9 of of IN cord-260054-iihgc5nr 157 10 probable probable JJ cord-260054-iihgc5nr 157 11 bat bat NN cord-260054-iihgc5nr 157 12 origin origin NN cord-260054-iihgc5nr 158 1 A a DT cord-260054-iihgc5nr 158 2 new new JJ cord-260054-iihgc5nr 158 3 coronavirus coronavirus NN cord-260054-iihgc5nr 158 4 associated associate VBN cord-260054-iihgc5nr 158 5 with with IN cord-260054-iihgc5nr 158 6 human human JJ cord-260054-iihgc5nr 158 7 respiratory respiratory JJ cord-260054-iihgc5nr 158 8 disease disease NN cord-260054-iihgc5nr 158 9 in in IN cord-260054-iihgc5nr 158 10 China China NNP cord-260054-iihgc5nr 158 11 Clinical clinical JJ cord-260054-iihgc5nr 158 12 features feature NNS cord-260054-iihgc5nr 158 13 of of IN cord-260054-iihgc5nr 158 14 patients patient NNS cord-260054-iihgc5nr 158 15 infected infect VBN cord-260054-iihgc5nr 158 16 with with IN cord-260054-iihgc5nr 158 17 2019 2019 CD cord-260054-iihgc5nr 158 18 novel novel JJ cord-260054-iihgc5nr 158 19 coronavirus coronavirus NN cord-260054-iihgc5nr 158 20 in in IN cord-260054-iihgc5nr 158 21 Wuhan Wuhan NNP cord-260054-iihgc5nr 158 22 Receptor Receptor NNP cord-260054-iihgc5nr 158 23 recognition recognition NN cord-260054-iihgc5nr 158 24 mechanisms mechanism NNS cord-260054-iihgc5nr 158 25 of of IN cord-260054-iihgc5nr 158 26 coronaviruses coronaviruse NNS cord-260054-iihgc5nr 158 27 : : : cord-260054-iihgc5nr 158 28 a a DT cord-260054-iihgc5nr 158 29 decade decade NN cord-260054-iihgc5nr 158 30 of of IN cord-260054-iihgc5nr 158 31 structural structural JJ cord-260054-iihgc5nr 158 32 studies study NNS cord-260054-iihgc5nr 158 33 Activation Activation NNP cord-260054-iihgc5nr 158 34 of of IN cord-260054-iihgc5nr 158 35 the the DT cord-260054-iihgc5nr 158 36 SARS SARS NNP cord-260054-iihgc5nr 158 37 coronavirus coronavirus NN cord-260054-iihgc5nr 158 38 spike spike NN cord-260054-iihgc5nr 158 39 protein protein NN cord-260054-iihgc5nr 158 40 via via IN cord-260054-iihgc5nr 158 41 sequential sequential JJ cord-260054-iihgc5nr 158 42 proteolytic proteolytic JJ cord-260054-iihgc5nr 158 43 cleavage cleavage NN cord-260054-iihgc5nr 158 44 at at IN cord-260054-iihgc5nr 158 45 two two CD cord-260054-iihgc5nr 158 46 distinct distinct JJ cord-260054-iihgc5nr 158 47 sites site NNS cord-260054-iihgc5nr 158 48 Host host NN cord-260054-iihgc5nr 158 49 cell cell NN cord-260054-iihgc5nr 158 50 entry entry NN cord-260054-iihgc5nr 158 51 of of IN cord-260054-iihgc5nr 158 52 Middle Middle NNP cord-260054-iihgc5nr 158 53 East East NNP cord-260054-iihgc5nr 158 54 respiratory respiratory JJ cord-260054-iihgc5nr 158 55 syndrome syndrome NN cord-260054-iihgc5nr 158 56 coronavirus coronavirus NN cord-260054-iihgc5nr 158 57 after after IN cord-260054-iihgc5nr 158 58 two two CD cord-260054-iihgc5nr 158 59 - - HYPH cord-260054-iihgc5nr 158 60 step step NN cord-260054-iihgc5nr 158 61 , , , cord-260054-iihgc5nr 158 62 furin furin NN cord-260054-iihgc5nr 158 63 - - HYPH cord-260054-iihgc5nr 158 64 mediated mediate VBN cord-260054-iihgc5nr 158 65 activation activation NN cord-260054-iihgc5nr 158 66 of of IN cord-260054-iihgc5nr 158 67 the the DT cord-260054-iihgc5nr 158 68 spike spike NN cord-260054-iihgc5nr 158 69 protein protein NN cord-260054-iihgc5nr 158 70 Cell cell NN cord-260054-iihgc5nr 158 71 entry entry NN cord-260054-iihgc5nr 158 72 mechanisms mechanism NNS cord-260054-iihgc5nr 158 73 of of IN cord-260054-iihgc5nr 158 74 SARS SARS NNP cord-260054-iihgc5nr 158 75 - - HYPH cord-260054-iihgc5nr 158 76 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 158 77 SARS SARS NNP cord-260054-iihgc5nr 158 78 - - HYPH cord-260054-iihgc5nr 158 79 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 158 80 SPIKE spike NN cord-260054-iihgc5nr 158 81 PROTEIN protein NN cord-260054-iihgc5nr 158 82 : : : cord-260054-iihgc5nr 158 83 an an DT cord-260054-iihgc5nr 158 84 optimal optimal JJ cord-260054-iihgc5nr 158 85 immunological immunological JJ cord-260054-iihgc5nr 158 86 target target NN cord-260054-iihgc5nr 158 87 for for IN cord-260054-iihgc5nr 158 88 vaccines vaccine NNS cord-260054-iihgc5nr 158 89 The the DT cord-260054-iihgc5nr 158 90 spike spike NN cord-260054-iihgc5nr 158 91 protein protein NN cord-260054-iihgc5nr 158 92 of of IN cord-260054-iihgc5nr 158 93 SARS SARS NNP cord-260054-iihgc5nr 158 94 - - HYPH cord-260054-iihgc5nr 158 95 CoV CoV NNP cord-260054-iihgc5nr 158 96 -- -- : cord-260054-iihgc5nr 158 97 a a DT cord-260054-iihgc5nr 158 98 target target NN cord-260054-iihgc5nr 158 99 for for IN cord-260054-iihgc5nr 158 100 vaccine vaccine NN cord-260054-iihgc5nr 158 101 and and CC cord-260054-iihgc5nr 158 102 therapeutic therapeutic JJ cord-260054-iihgc5nr 158 103 development development NN cord-260054-iihgc5nr 158 104 Key key JJ cord-260054-iihgc5nr 158 105 residues residue NNS cord-260054-iihgc5nr 158 106 of of IN cord-260054-iihgc5nr 158 107 the the DT cord-260054-iihgc5nr 158 108 receptor receptor NN cord-260054-iihgc5nr 158 109 binding bind VBG cord-260054-iihgc5nr 158 110 motif motif NN cord-260054-iihgc5nr 158 111 in in IN cord-260054-iihgc5nr 158 112 the the DT cord-260054-iihgc5nr 158 113 spike spike NN cord-260054-iihgc5nr 158 114 protein protein NN cord-260054-iihgc5nr 158 115 of of IN cord-260054-iihgc5nr 158 116 SARS SARS NNP cord-260054-iihgc5nr 158 117 - - : cord-260054-iihgc5nr 158 118 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 158 119 that that WDT cord-260054-iihgc5nr 158 120 interact interact VBP cord-260054-iihgc5nr 158 121 with with IN cord-260054-iihgc5nr 158 122 ACE2 ACE2 NNP cord-260054-iihgc5nr 158 123 and and CC cord-260054-iihgc5nr 158 124 neutralizing neutralize VBG cord-260054-iihgc5nr 158 125 antibodies antibody NNS cord-260054-iihgc5nr 159 1 Potent potent JJ cord-260054-iihgc5nr 159 2 binding binding NN cord-260054-iihgc5nr 159 3 of of IN cord-260054-iihgc5nr 159 4 2019 2019 CD cord-260054-iihgc5nr 159 5 novel novel JJ cord-260054-iihgc5nr 159 6 coronavirus coronavirus NN cord-260054-iihgc5nr 159 7 spike spike NN cord-260054-iihgc5nr 159 8 protein protein NN cord-260054-iihgc5nr 159 9 by by IN cord-260054-iihgc5nr 159 10 a a DT cord-260054-iihgc5nr 159 11 SARS SARS NNP cord-260054-iihgc5nr 159 12 coronavirus coronavirus NN cord-260054-iihgc5nr 159 13 - - HYPH cord-260054-iihgc5nr 159 14 specific specific JJ cord-260054-iihgc5nr 159 15 human human JJ cord-260054-iihgc5nr 159 16 monoclonal monoclonal JJ cord-260054-iihgc5nr 159 17 antibody antibody NN cord-260054-iihgc5nr 159 18 2020 2020 CD cord-260054-iihgc5nr 159 19 ) ) -RRB- cord-260054-iihgc5nr 160 1 A a DT cord-260054-iihgc5nr 160 2 human human JJ cord-260054-iihgc5nr 160 3 monoclonal monoclonal JJ cord-260054-iihgc5nr 160 4 antibody antibody NN cord-260054-iihgc5nr 160 5 blocking block VBG cord-260054-iihgc5nr 160 6 SARS SARS NNP cord-260054-iihgc5nr 160 7 - - HYPH cord-260054-iihgc5nr 160 8 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 160 9 infection infection NN cord-260054-iihgc5nr 160 10 Candidate candidate NN cord-260054-iihgc5nr 160 11 drugs drug NNS cord-260054-iihgc5nr 160 12 against against IN cord-260054-iihgc5nr 160 13 SARS SARS NNP cord-260054-iihgc5nr 160 14 - - HYPH cord-260054-iihgc5nr 160 15 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 160 16 and and CC cord-260054-iihgc5nr 160 17 COVID-19 COVID-19 NNP cord-260054-iihgc5nr 160 18 SARS SARS NNP cord-260054-iihgc5nr 160 19 - - HYPH cord-260054-iihgc5nr 160 20 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 160 21 Vaccines vaccine NNS cord-260054-iihgc5nr 160 22 : : : cord-260054-iihgc5nr 160 23 Status Status NNP cord-260054-iihgc5nr 160 24 Report Report NNP cord-260054-iihgc5nr 160 25 . . . cord-260054-iihgc5nr 161 1 Immunity Immunity NNP cord-260054-iihgc5nr 161 2 Ready Ready NNP cord-260054-iihgc5nr 161 3 , , , cord-260054-iihgc5nr 161 4 set set VBN cord-260054-iihgc5nr 161 5 , , , cord-260054-iihgc5nr 161 6 fuse fuse NN cord-260054-iihgc5nr 161 7 ! ! . cord-260054-iihgc5nr 162 1 The the DT cord-260054-iihgc5nr 162 2 coronavirus coronavirus NN cord-260054-iihgc5nr 162 3 spike spike NN cord-260054-iihgc5nr 162 4 protein protein NN cord-260054-iihgc5nr 162 5 and and CC cord-260054-iihgc5nr 162 6 acquisition acquisition NN cord-260054-iihgc5nr 162 7 of of IN cord-260054-iihgc5nr 162 8 fusion fusion NN cord-260054-iihgc5nr 162 9 competence competence NN cord-260054-iihgc5nr 162 10 Structure Structure NNP cord-260054-iihgc5nr 162 11 of of IN cord-260054-iihgc5nr 162 12 SARS SARS NNP cord-260054-iihgc5nr 162 13 coronavirus coronavirus NN cord-260054-iihgc5nr 162 14 spike spike NN cord-260054-iihgc5nr 162 15 receptor receptor NN cord-260054-iihgc5nr 162 16 - - HYPH cord-260054-iihgc5nr 162 17 binding bind VBG cord-260054-iihgc5nr 162 18 domain domain NN cord-260054-iihgc5nr 162 19 complexed complexe VBN cord-260054-iihgc5nr 162 20 with with IN cord-260054-iihgc5nr 162 21 receptor receptor NN cord-260054-iihgc5nr 162 22 Structure Structure NNP cord-260054-iihgc5nr 162 23 of of IN cord-260054-iihgc5nr 162 24 MERS MERS NNP cord-260054-iihgc5nr 162 25 - - HYPH cord-260054-iihgc5nr 162 26 CoV CoV NNP cord-260054-iihgc5nr 162 27 spike spike NN cord-260054-iihgc5nr 162 28 receptor receptor NN cord-260054-iihgc5nr 162 29 - - HYPH cord-260054-iihgc5nr 162 30 binding bind VBG cord-260054-iihgc5nr 162 31 domain domain NN cord-260054-iihgc5nr 162 32 complexed complexe VBN cord-260054-iihgc5nr 162 33 with with IN cord-260054-iihgc5nr 162 34 human human JJ cord-260054-iihgc5nr 162 35 receptor receptor NN cord-260054-iihgc5nr 162 36 DPP4 DPP4 NNP cord-260054-iihgc5nr 162 37 Tectonic tectonic JJ cord-260054-iihgc5nr 162 38 conformational conformational JJ cord-260054-iihgc5nr 162 39 changes change NNS cord-260054-iihgc5nr 162 40 of of IN cord-260054-iihgc5nr 162 41 a a DT cord-260054-iihgc5nr 162 42 coronavirus coronavirus NN cord-260054-iihgc5nr 162 43 spike spike NN cord-260054-iihgc5nr 162 44 glycoprotein glycoprotein NN cord-260054-iihgc5nr 162 45 promote promote VBP cord-260054-iihgc5nr 162 46 membrane membrane NN cord-260054-iihgc5nr 162 47 fusion fusion NN cord-260054-iihgc5nr 162 48 2017 2017 CD cord-260054-iihgc5nr 162 49 ) ) -RRB- cord-260054-iihgc5nr 163 1 Data datum NNS cord-260054-iihgc5nr 163 2 , , , cord-260054-iihgc5nr 163 3 disease disease NN cord-260054-iihgc5nr 163 4 and and CC cord-260054-iihgc5nr 163 5 diplomacy diplomacy NN cord-260054-iihgc5nr 163 6 : : : cord-260054-iihgc5nr 163 7 GISAID GISAID NNP cord-260054-iihgc5nr 163 8 's 's POS cord-260054-iihgc5nr 163 9 innovative innovative JJ cord-260054-iihgc5nr 163 10 contribution contribution NN cord-260054-iihgc5nr 163 11 to to IN cord-260054-iihgc5nr 163 12 global global JJ cord-260054-iihgc5nr 163 13 health health NN cord-260054-iihgc5nr 163 14 GISAID GISAID NNP cord-260054-iihgc5nr 163 15 : : : cord-260054-iihgc5nr 164 1 Global global JJ cord-260054-iihgc5nr 164 2 initiative initiative NN cord-260054-iihgc5nr 164 3 on on IN cord-260054-iihgc5nr 164 4 sharing share VBG cord-260054-iihgc5nr 164 5 all all DT cord-260054-iihgc5nr 164 6 influenza influenza NN cord-260054-iihgc5nr 164 7 data datum NNS cord-260054-iihgc5nr 165 1 -from -from JJ cord-260054-iihgc5nr 165 2 vision vision NN cord-260054-iihgc5nr 165 3 to to IN cord-260054-iihgc5nr 165 4 reality reality NN cord-260054-iihgc5nr 166 1 The the DT cord-260054-iihgc5nr 166 2 Protein Protein NNP cord-260054-iihgc5nr 166 3 Data Data NNP cord-260054-iihgc5nr 166 4 Bank Bank NNP cord-260054-iihgc5nr 166 5 Cryo Cryo NNP cord-260054-iihgc5nr 166 6 - - HYPH cord-260054-iihgc5nr 166 7 EM EM NNP cord-260054-iihgc5nr 166 8 structure structure NN cord-260054-iihgc5nr 166 9 of of IN cord-260054-iihgc5nr 166 10 the the DT cord-260054-iihgc5nr 166 11 2019-nCoV 2019-ncov CD cord-260054-iihgc5nr 166 12 spike spike NN cord-260054-iihgc5nr 166 13 in in IN cord-260054-iihgc5nr 166 14 the the DT cord-260054-iihgc5nr 166 15 prefusion prefusion NN cord-260054-iihgc5nr 166 16 conformation conformation NNP cord-260054-iihgc5nr 166 17 2020 2020 CD cord-260054-iihgc5nr 166 18 ) ) -RRB- cord-260054-iihgc5nr 167 1 Structure structure NN cord-260054-iihgc5nr 167 2 , , , cord-260054-iihgc5nr 167 3 Function Function NNP cord-260054-iihgc5nr 167 4 , , , cord-260054-iihgc5nr 167 5 and and CC cord-260054-iihgc5nr 167 6 Antigenicity Antigenicity NNP cord-260054-iihgc5nr 167 7 of of IN cord-260054-iihgc5nr 167 8 the the DT cord-260054-iihgc5nr 167 9 SARS SARS NNP cord-260054-iihgc5nr 167 10 - - HYPH cord-260054-iihgc5nr 167 11 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 167 12 Spike Spike NNP cord-260054-iihgc5nr 167 13 Glycoprotein Glycoprotein NNP cord-260054-iihgc5nr 167 14 Structure Structure NNP cord-260054-iihgc5nr 167 15 of of IN cord-260054-iihgc5nr 167 16 the the DT cord-260054-iihgc5nr 167 17 SARS SARS NNP cord-260054-iihgc5nr 167 18 - - HYPH cord-260054-iihgc5nr 167 19 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 167 20 spike spike NN cord-260054-iihgc5nr 167 21 receptorbinding receptorbinding NN cord-260054-iihgc5nr 167 22 domain domain NN cord-260054-iihgc5nr 167 23 bound bind VBN cord-260054-iihgc5nr 167 24 to to IN cord-260054-iihgc5nr 167 25 the the DT cord-260054-iihgc5nr 167 26 ACE2 ACE2 NNP cord-260054-iihgc5nr 167 27 receptor receptor NN cord-260054-iihgc5nr 167 28 Structural structural JJ cord-260054-iihgc5nr 167 29 basis basis NN cord-260054-iihgc5nr 167 30 for for IN cord-260054-iihgc5nr 167 31 the the DT cord-260054-iihgc5nr 167 32 recognition recognition NN cord-260054-iihgc5nr 167 33 of of IN cord-260054-iihgc5nr 167 34 SARS SARS NNP cord-260054-iihgc5nr 167 35 - - HYPH cord-260054-iihgc5nr 167 36 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 167 37 by by IN cord-260054-iihgc5nr 167 38 full full JJ cord-260054-iihgc5nr 167 39 - - HYPH cord-260054-iihgc5nr 167 40 length length NN cord-260054-iihgc5nr 167 41 human human JJ cord-260054-iihgc5nr 167 42 ACE2 ACE2 NNP cord-260054-iihgc5nr 167 43 Structural Structural NNP cord-260054-iihgc5nr 167 44 and and CC cord-260054-iihgc5nr 167 45 Functional Functional NNP cord-260054-iihgc5nr 167 46 Basis Basis NNP cord-260054-iihgc5nr 167 47 of of IN cord-260054-iihgc5nr 167 48 SARS SARS NNP cord-260054-iihgc5nr 167 49 - - HYPH cord-260054-iihgc5nr 167 50 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 167 51 Entry Entry NNP cord-260054-iihgc5nr 167 52 by by IN cord-260054-iihgc5nr 167 53 Using use VBG cord-260054-iihgc5nr 167 54 Human Human NNP cord-260054-iihgc5nr 167 55 ACE2 ACE2 NNP cord-260054-iihgc5nr 167 56 2020 2020 CD cord-260054-iihgc5nr 167 57 ) ) -RRB- cord-260054-iihgc5nr 168 1 Structural structural JJ cord-260054-iihgc5nr 168 2 basis basis NN cord-260054-iihgc5nr 168 3 of of IN cord-260054-iihgc5nr 168 4 receptor receptor NN cord-260054-iihgc5nr 168 5 recognition recognition NN cord-260054-iihgc5nr 168 6 by by IN cord-260054-iihgc5nr 168 7 SARS SARS NNP cord-260054-iihgc5nr 168 8 - - HYPH cord-260054-iihgc5nr 168 9 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 168 10 A a DT cord-260054-iihgc5nr 168 11 highly highly RB cord-260054-iihgc5nr 168 12 conserved conserved JJ cord-260054-iihgc5nr 168 13 cryptic cryptic JJ cord-260054-iihgc5nr 168 14 epitope epitope NN cord-260054-iihgc5nr 168 15 in in IN cord-260054-iihgc5nr 168 16 the the DT cord-260054-iihgc5nr 168 17 receptor receptor NN cord-260054-iihgc5nr 168 18 binding bind VBG cord-260054-iihgc5nr 168 19 domains domain NNS cord-260054-iihgc5nr 168 20 of of IN cord-260054-iihgc5nr 168 21 SARS SARS NNP cord-260054-iihgc5nr 168 22 - - HYPH cord-260054-iihgc5nr 168 23 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 168 24 and and CC cord-260054-iihgc5nr 168 25 SARS SARS NNP cord-260054-iihgc5nr 168 26 - - HYPH cord-260054-iihgc5nr 168 27 CoV CoV NNP cord-260054-iihgc5nr 168 28 Inhibition Inhibition NNP cord-260054-iihgc5nr 168 29 of of IN cord-260054-iihgc5nr 168 30 SARS SARS NNP cord-260054-iihgc5nr 168 31 - - HYPH cord-260054-iihgc5nr 168 32 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 168 33 ( ( -LRB- cord-260054-iihgc5nr 168 34 previously previously RB cord-260054-iihgc5nr 168 35 2019-nCoV 2019-ncov CD cord-260054-iihgc5nr 168 36 ) ) -RRB- cord-260054-iihgc5nr 168 37 infection infection NN cord-260054-iihgc5nr 168 38 by by IN cord-260054-iihgc5nr 168 39 a a DT cord-260054-iihgc5nr 168 40 highly highly RB cord-260054-iihgc5nr 168 41 potent potent JJ cord-260054-iihgc5nr 168 42 pan pan JJ cord-260054-iihgc5nr 168 43 - - JJ cord-260054-iihgc5nr 168 44 coronavirus coronavirus NN cord-260054-iihgc5nr 168 45 fusion fusion NN cord-260054-iihgc5nr 168 46 inhibitor inhibitor NN cord-260054-iihgc5nr 168 47 targeting target VBG cord-260054-iihgc5nr 168 48 its -PRON- PRP$ cord-260054-iihgc5nr 168 49 spike spike NN cord-260054-iihgc5nr 168 50 protein protein NN cord-260054-iihgc5nr 168 51 that that WDT cord-260054-iihgc5nr 168 52 harbors harbor VBZ cord-260054-iihgc5nr 168 53 a a DT cord-260054-iihgc5nr 168 54 high high JJ cord-260054-iihgc5nr 168 55 capacity capacity NN cord-260054-iihgc5nr 168 56 to to TO cord-260054-iihgc5nr 168 57 mediate mediate VB cord-260054-iihgc5nr 168 58 membrane membrane NN cord-260054-iihgc5nr 168 59 fusion fusion NN cord-260054-iihgc5nr 168 60 Identification identification NN cord-260054-iihgc5nr 168 61 of of IN cord-260054-iihgc5nr 168 62 a a DT cord-260054-iihgc5nr 168 63 novel novel JJ cord-260054-iihgc5nr 168 64 coronavirus coronavirus NN cord-260054-iihgc5nr 168 65 causing cause VBG cord-260054-iihgc5nr 168 66 severe severe JJ cord-260054-iihgc5nr 168 67 pneumonia pneumonia NN cord-260054-iihgc5nr 168 68 in in IN cord-260054-iihgc5nr 168 69 human human NN cord-260054-iihgc5nr 168 70 : : : cord-260054-iihgc5nr 168 71 a a DT cord-260054-iihgc5nr 168 72 descriptive descriptive JJ cord-260054-iihgc5nr 168 73 study study NN cord-260054-iihgc5nr 168 74 Fusion fusion NN cord-260054-iihgc5nr 168 75 mechanism mechanism NN cord-260054-iihgc5nr 168 76 of of IN cord-260054-iihgc5nr 168 77 2019-nCoV 2019-ncov CD cord-260054-iihgc5nr 168 78 and and CC cord-260054-iihgc5nr 168 79 fusion fusion NN cord-260054-iihgc5nr 168 80 inhibitors inhibitor NNS cord-260054-iihgc5nr 168 81 targeting target VBG cord-260054-iihgc5nr 168 82 HR1 HR1 NNP cord-260054-iihgc5nr 168 83 domain domain NN cord-260054-iihgc5nr 168 84 in in IN cord-260054-iihgc5nr 168 85 spike spike NN cord-260054-iihgc5nr 168 86 protein protein NN cord-260054-iihgc5nr 168 87 CD cd NN cord-260054-iihgc5nr 168 88 - - HYPH cord-260054-iihgc5nr 168 89 HIT HIT NNP cord-260054-iihgc5nr 168 90 : : : cord-260054-iihgc5nr 168 91 accelerated accelerate VBN cord-260054-iihgc5nr 168 92 for for IN cord-260054-iihgc5nr 168 93 clustering cluster VBG cord-260054-iihgc5nr 168 94 the the DT cord-260054-iihgc5nr 168 95 next next JJ cord-260054-iihgc5nr 168 96 - - HYPH cord-260054-iihgc5nr 168 97 generation generation NN cord-260054-iihgc5nr 168 98 sequencing sequencing NN cord-260054-iihgc5nr 168 99 data datum NNS cord-260054-iihgc5nr 168 100 Comparative Comparative NNP cord-260054-iihgc5nr 168 101 protein protein NN cord-260054-iihgc5nr 168 102 modelling modelling NN cord-260054-iihgc5nr 168 103 by by IN cord-260054-iihgc5nr 168 104 satisfaction satisfaction NN cord-260054-iihgc5nr 168 105 of of IN cord-260054-iihgc5nr 168 106 spatial spatial JJ cord-260054-iihgc5nr 168 107 restraints restraint NNS cord-260054-iihgc5nr 168 108 Modeling model VBG cord-260054-iihgc5nr 168 109 mutations mutation NNS cord-260054-iihgc5nr 168 110 in in IN cord-260054-iihgc5nr 168 111 protein protein NN cord-260054-iihgc5nr 168 112 structures structure NNS cord-260054-iihgc5nr 168 113 COCOMAPS COCOMAPS NNP cord-260054-iihgc5nr 168 114 : : : cord-260054-iihgc5nr 168 115 a a DT cord-260054-iihgc5nr 168 116 web web NN cord-260054-iihgc5nr 168 117 application application NN cord-260054-iihgc5nr 168 118 to to TO cord-260054-iihgc5nr 168 119 analyze analyze VB cord-260054-iihgc5nr 168 120 and and CC cord-260054-iihgc5nr 168 121 visualize visualize VB cord-260054-iihgc5nr 168 122 contacts contact NNS cord-260054-iihgc5nr 168 123 at at IN cord-260054-iihgc5nr 168 124 the the DT cord-260054-iihgc5nr 168 125 interface interface NN cord-260054-iihgc5nr 168 126 of of IN cord-260054-iihgc5nr 168 127 biomolecular biomolecular JJ cord-260054-iihgc5nr 168 128 complexes complex NNS cord-260054-iihgc5nr 169 1 Could Could MD cord-260054-iihgc5nr 169 2 the the DT cord-260054-iihgc5nr 169 3 D614 d614 NN cord-260054-iihgc5nr 169 4 G g NN cord-260054-iihgc5nr 169 5 substitution substitution NN cord-260054-iihgc5nr 169 6 in in IN cord-260054-iihgc5nr 169 7 the the DT cord-260054-iihgc5nr 169 8 SARS SARS NNP cord-260054-iihgc5nr 169 9 - - HYPH cord-260054-iihgc5nr 169 10 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 169 11 spike spike NN cord-260054-iihgc5nr 169 12 ( ( -LRB- cord-260054-iihgc5nr 169 13 S s NN cord-260054-iihgc5nr 169 14 ) ) -RRB- cord-260054-iihgc5nr 169 15 protein protein NN cord-260054-iihgc5nr 169 16 be be VB cord-260054-iihgc5nr 169 17 associated associate VBN cord-260054-iihgc5nr 169 18 with with IN cord-260054-iihgc5nr 169 19 higher high JJR cord-260054-iihgc5nr 169 20 COVID-19 COVID-19 NNP cord-260054-iihgc5nr 169 21 mortality mortality NN cord-260054-iihgc5nr 169 22 ? ? . cord-260054-iihgc5nr 170 1 Spike spike NN cord-260054-iihgc5nr 170 2 mutation mutation NN cord-260054-iihgc5nr 170 3 pipeline pipeline NN cord-260054-iihgc5nr 170 4 reveals reveal VBZ cord-260054-iihgc5nr 170 5 the the DT cord-260054-iihgc5nr 170 6 emergence emergence NN cord-260054-iihgc5nr 170 7 of of IN cord-260054-iihgc5nr 170 8 a a DT cord-260054-iihgc5nr 170 9 more more RBR cord-260054-iihgc5nr 170 10 transmissible transmissible JJ cord-260054-iihgc5nr 170 11 form form NN cord-260054-iihgc5nr 170 12 of of IN cord-260054-iihgc5nr 170 13 SARS SARS NNP cord-260054-iihgc5nr 170 14 - - HYPH cord-260054-iihgc5nr 170 15 CoV-2 CoV-2 NNP cord-260054-iihgc5nr 170 16 . . . cord-260054-iihgc5nr 171 1 bioRxiv biorxiv RB cord-260054-iihgc5nr 172 1 Structure structure NN cord-260054-iihgc5nr 172 2 validation validation NN cord-260054-iihgc5nr 172 3 by by IN cord-260054-iihgc5nr 172 4 Calpha Calpha NNP cord-260054-iihgc5nr 172 5 geometry geometry NN cord-260054-iihgc5nr 172 6 : : : cord-260054-iihgc5nr 172 7 phi phi NNP cord-260054-iihgc5nr 172 8 , , , cord-260054-iihgc5nr 172 9 psi psi NNP cord-260054-iihgc5nr 172 10 and and CC cord-260054-iihgc5nr 172 11 Cbeta Cbeta NNP cord-260054-iihgc5nr 172 12 deviation deviation NN cord-260054-iihgc5nr 172 13 Influence influence NN cord-260054-iihgc5nr 172 14 of of IN cord-260054-iihgc5nr 172 15 proline proline NN cord-260054-iihgc5nr 172 16 residues residue NNS cord-260054-iihgc5nr 172 17 on on IN cord-260054-iihgc5nr 172 18 protein protein NN cord-260054-iihgc5nr 172 19 conformation conformation NN cord-260054-iihgc5nr 172 20 pH ph NN cord-260054-iihgc5nr 172 21 - - HYPH cord-260054-iihgc5nr 172 22 induced induce VBN cord-260054-iihgc5nr 172 23 denaturation denaturation NN cord-260054-iihgc5nr 172 24 of of IN cord-260054-iihgc5nr 172 25 proteins protein NNS cord-260054-iihgc5nr 173 1 : : : cord-260054-iihgc5nr 173 2 a a DT cord-260054-iihgc5nr 173 3 single single JJ cord-260054-iihgc5nr 173 4 salt salt NN cord-260054-iihgc5nr 173 5 bridge bridge NN cord-260054-iihgc5nr 173 6 contributes contribute VBZ cord-260054-iihgc5nr 173 7 3 3 CD cord-260054-iihgc5nr 173 8 - - SYM cord-260054-iihgc5nr 173 9 5 5 CD cord-260054-iihgc5nr 173 10 kcal kcal NNP cord-260054-iihgc5nr 173 11 / / SYM cord-260054-iihgc5nr 173 12 mol mol NN cord-260054-iihgc5nr 173 13 to to IN cord-260054-iihgc5nr 173 14 the the DT cord-260054-iihgc5nr 173 15 free free JJ cord-260054-iihgc5nr 173 16 energy energy NN cord-260054-iihgc5nr 173 17 of of IN cord-260054-iihgc5nr 173 18 folding folding NN cord-260054-iihgc5nr 173 19 of of IN cord-260054-iihgc5nr 173 20 T4 T4 NNP cord-260054-iihgc5nr 173 21 lysozyme lysozyme NNP cord-260054-iihgc5nr 174 1 Authors author NNS cord-260054-iihgc5nr 174 2 declare declare VBP cord-260054-iihgc5nr 174 3 no no DT cord-260054-iihgc5nr 174 4 competing compete VBG cord-260054-iihgc5nr 174 5 interests interest NNS cord-260054-iihgc5nr 174 6 . . . cord-260054-iihgc5nr 174 7 _SP