id sid tid token lemma pos cord-305054-4d84b2g6 1 1 key key JJ cord-305054-4d84b2g6 1 2 : : : cord-305054-4d84b2g6 1 3 cord-305054 cord-305054 NNP cord-305054-4d84b2g6 1 4 - - HYPH cord-305054-4d84b2g6 1 5 4d84b2g6 4d84b2g6 CD cord-305054-4d84b2g6 1 6 authors author NNS cord-305054-4d84b2g6 1 7 : : : cord-305054-4d84b2g6 1 8 Liu Liu NNP cord-305054-4d84b2g6 1 9 , , , cord-305054-4d84b2g6 1 10 Yuan Yuan NNP cord-305054-4d84b2g6 1 11 ; ; : cord-305054-4d84b2g6 2 1 Yan Yan NNP cord-305054-4d84b2g6 2 2 , , , cord-305054-4d84b2g6 2 3 Changhui Changhui NNP cord-305054-4d84b2g6 2 4 title title NN cord-305054-4d84b2g6 2 5 : : : cord-305054-4d84b2g6 2 6 The the DT cord-305054-4d84b2g6 2 7 selection selection NN cord-305054-4d84b2g6 2 8 of of IN cord-305054-4d84b2g6 2 9 reference reference NN cord-305054-4d84b2g6 2 10 genome genome NN cord-305054-4d84b2g6 2 11 and and CC cord-305054-4d84b2g6 2 12 the the DT cord-305054-4d84b2g6 2 13 search search NN cord-305054-4d84b2g6 2 14 for for IN cord-305054-4d84b2g6 2 15 the the DT cord-305054-4d84b2g6 2 16 origin origin NN cord-305054-4d84b2g6 2 17 of of IN cord-305054-4d84b2g6 2 18 SARS SARS NNP cord-305054-4d84b2g6 2 19 - - HYPH cord-305054-4d84b2g6 2 20 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 2 21 date date NN cord-305054-4d84b2g6 2 22 : : : cord-305054-4d84b2g6 2 23 2020 2020 CD cord-305054-4d84b2g6 2 24 - - HYPH cord-305054-4d84b2g6 2 25 08 08 CD cord-305054-4d84b2g6 2 26 - - HYPH cord-305054-4d84b2g6 2 27 11 11 CD cord-305054-4d84b2g6 2 28 journal journal NN cord-305054-4d84b2g6 2 29 : : : cord-305054-4d84b2g6 3 1 bioRxiv biorxiv IN cord-305054-4d84b2g6 3 2 DOI DOI NNP cord-305054-4d84b2g6 3 3 : : : cord-305054-4d84b2g6 4 1 10.1101/2020.08.10.245290 10.1101/2020.08.10.245290 LS cord-305054-4d84b2g6 5 1 sha sha NNP cord-305054-4d84b2g6 5 2 : : : cord-305054-4d84b2g6 6 1 4321e6d28b6a5e0033fd6cfb4e1d1c092c501427 4321e6d28b6a5e0033fd6cfb4e1d1c092c501427 CD cord-305054-4d84b2g6 6 2 doc_id doc_id CD cord-305054-4d84b2g6 6 3 : : : cord-305054-4d84b2g6 6 4 305054 305054 CD cord-305054-4d84b2g6 6 5 cord_uid cord_uid NNS cord-305054-4d84b2g6 6 6 : : : cord-305054-4d84b2g6 6 7 4d84b2g6 4d84b2g6 CD cord-305054-4d84b2g6 7 1 The the DT cord-305054-4d84b2g6 7 2 pandemic pandemic NN cord-305054-4d84b2g6 7 3 caused cause VBN cord-305054-4d84b2g6 7 4 by by IN cord-305054-4d84b2g6 7 5 SARS SARS NNP cord-305054-4d84b2g6 7 6 - - HYPH cord-305054-4d84b2g6 7 7 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 7 8 has have VBZ cord-305054-4d84b2g6 7 9 a a DT cord-305054-4d84b2g6 7 10 great great JJ cord-305054-4d84b2g6 7 11 impact impact NN cord-305054-4d84b2g6 7 12 on on IN cord-305054-4d84b2g6 7 13 the the DT cord-305054-4d84b2g6 7 14 whole whole JJ cord-305054-4d84b2g6 7 15 world world NN cord-305054-4d84b2g6 7 16 . . . cord-305054-4d84b2g6 8 1 In in IN cord-305054-4d84b2g6 8 2 a a DT cord-305054-4d84b2g6 8 3 theory theory NN cord-305054-4d84b2g6 8 4 of of IN cord-305054-4d84b2g6 8 5 the the DT cord-305054-4d84b2g6 8 6 origin origin NN cord-305054-4d84b2g6 8 7 of of IN cord-305054-4d84b2g6 8 8 SARS SARS NNP cord-305054-4d84b2g6 8 9 - - HYPH cord-305054-4d84b2g6 8 10 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 8 11 , , , cord-305054-4d84b2g6 8 12 pangolins pangolin NNS cord-305054-4d84b2g6 8 13 were be VBD cord-305054-4d84b2g6 8 14 considered consider VBN cord-305054-4d84b2g6 8 15 a a DT cord-305054-4d84b2g6 8 16 potential potential JJ cord-305054-4d84b2g6 8 17 intermediate intermediate JJ cord-305054-4d84b2g6 8 18 host host NN cord-305054-4d84b2g6 8 19 . . . cord-305054-4d84b2g6 9 1 To to TO cord-305054-4d84b2g6 9 2 assemble assemble VB cord-305054-4d84b2g6 9 3 the the DT cord-305054-4d84b2g6 9 4 coronavirus coronavirus NN cord-305054-4d84b2g6 9 5 found find VBN cord-305054-4d84b2g6 9 6 in in IN cord-305054-4d84b2g6 9 7 pangolins pangolin NNS cord-305054-4d84b2g6 9 8 , , , cord-305054-4d84b2g6 9 9 SARS SARS NNP cord-305054-4d84b2g6 9 10 - - HYPH cord-305054-4d84b2g6 9 11 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 9 12 were be VBD cord-305054-4d84b2g6 9 13 used use VBN cord-305054-4d84b2g6 9 14 a a DT cord-305054-4d84b2g6 9 15 reference reference NN cord-305054-4d84b2g6 9 16 genome genome NN cord-305054-4d84b2g6 9 17 in in IN cord-305054-4d84b2g6 9 18 most most JJS cord-305054-4d84b2g6 9 19 of of IN cord-305054-4d84b2g6 9 20 studies study NNS cord-305054-4d84b2g6 9 21 , , , cord-305054-4d84b2g6 9 22 assuming assume VBG cord-305054-4d84b2g6 9 23 that that IN cord-305054-4d84b2g6 9 24 pangolins pangolin NNS cord-305054-4d84b2g6 9 25 CoV CoV NNP cord-305054-4d84b2g6 9 26 and and CC cord-305054-4d84b2g6 9 27 SARS SARS NNP cord-305054-4d84b2g6 9 28 - - HYPH cord-305054-4d84b2g6 9 29 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 9 30 are be VBP cord-305054-4d84b2g6 9 31 the the DT cord-305054-4d84b2g6 9 32 closest close JJS cord-305054-4d84b2g6 9 33 neighbors neighbor NNS cord-305054-4d84b2g6 9 34 in in IN cord-305054-4d84b2g6 9 35 the the DT cord-305054-4d84b2g6 9 36 evolution evolution NN cord-305054-4d84b2g6 9 37 . . . cord-305054-4d84b2g6 10 1 However however RB cord-305054-4d84b2g6 10 2 , , , cord-305054-4d84b2g6 10 3 this this DT cord-305054-4d84b2g6 10 4 assumption assumption NN cord-305054-4d84b2g6 10 5 may may MD cord-305054-4d84b2g6 10 6 not not RB cord-305054-4d84b2g6 10 7 be be VB cord-305054-4d84b2g6 10 8 true true JJ cord-305054-4d84b2g6 10 9 . . . cord-305054-4d84b2g6 11 1 We -PRON- PRP cord-305054-4d84b2g6 11 2 investigated investigate VBD cord-305054-4d84b2g6 11 3 how how WRB cord-305054-4d84b2g6 11 4 the the DT cord-305054-4d84b2g6 11 5 selection selection NN cord-305054-4d84b2g6 11 6 of of IN cord-305054-4d84b2g6 11 7 reference reference NN cord-305054-4d84b2g6 11 8 genome genome NN cord-305054-4d84b2g6 11 9 affect affect VBP cord-305054-4d84b2g6 11 10 the the DT cord-305054-4d84b2g6 11 11 resulting result VBG cord-305054-4d84b2g6 11 12 CoV CoV NNP cord-305054-4d84b2g6 11 13 genome genome NNP cord-305054-4d84b2g6 11 14 assembly assembly NNP cord-305054-4d84b2g6 11 15 . . . cord-305054-4d84b2g6 12 1 We -PRON- PRP cord-305054-4d84b2g6 12 2 explored explore VBD cord-305054-4d84b2g6 12 3 various various JJ cord-305054-4d84b2g6 12 4 representative representative JJ cord-305054-4d84b2g6 12 5 CoV cov NN cord-305054-4d84b2g6 12 6 as as IN cord-305054-4d84b2g6 12 7 reference reference NN cord-305054-4d84b2g6 12 8 genome genome NN cord-305054-4d84b2g6 12 9 , , , cord-305054-4d84b2g6 12 10 and and CC cord-305054-4d84b2g6 12 11 found find VBD cord-305054-4d84b2g6 12 12 significant significant JJ cord-305054-4d84b2g6 12 13 differences difference NNS cord-305054-4d84b2g6 12 14 in in IN cord-305054-4d84b2g6 12 15 the the DT cord-305054-4d84b2g6 12 16 resulting result VBG cord-305054-4d84b2g6 12 17 assemblies assembly NNS cord-305054-4d84b2g6 12 18 . . . cord-305054-4d84b2g6 13 1 The the DT cord-305054-4d84b2g6 13 2 assembly assembly NN cord-305054-4d84b2g6 13 3 obtained obtain VBD cord-305054-4d84b2g6 13 4 using use VBG cord-305054-4d84b2g6 13 5 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 13 6 as as IN cord-305054-4d84b2g6 13 7 reference reference NN cord-305054-4d84b2g6 13 8 showed show VBD cord-305054-4d84b2g6 13 9 better well JJR cord-305054-4d84b2g6 13 10 statistics statistic NNS cord-305054-4d84b2g6 13 11 in in IN cord-305054-4d84b2g6 13 12 total total JJ cord-305054-4d84b2g6 13 13 length length NN cord-305054-4d84b2g6 13 14 and and CC cord-305054-4d84b2g6 13 15 N50 N50 NNP cord-305054-4d84b2g6 13 16 than than IN cord-305054-4d84b2g6 13 17 the the DT cord-305054-4d84b2g6 13 18 assembly assembly NN cord-305054-4d84b2g6 13 19 guided guide VBN cord-305054-4d84b2g6 13 20 by by IN cord-305054-4d84b2g6 13 21 SARS SARS NNP cord-305054-4d84b2g6 13 22 - - HYPH cord-305054-4d84b2g6 13 23 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 13 24 , , , cord-305054-4d84b2g6 13 25 indicating indicate VBG cord-305054-4d84b2g6 13 26 that that IN cord-305054-4d84b2g6 13 27 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 14 1 maybe maybe RB cord-305054-4d84b2g6 14 2 a a DT cord-305054-4d84b2g6 14 3 better well JJR cord-305054-4d84b2g6 14 4 reference reference NN cord-305054-4d84b2g6 14 5 for for IN cord-305054-4d84b2g6 14 6 assembling assemble VBG cord-305054-4d84b2g6 14 7 CoV cov NN cord-305054-4d84b2g6 14 8 in in IN cord-305054-4d84b2g6 14 9 pangolin pangolin NN cord-305054-4d84b2g6 14 10 or or CC cord-305054-4d84b2g6 14 11 other other JJ cord-305054-4d84b2g6 14 12 potential potential JJ cord-305054-4d84b2g6 14 13 intermediate intermediate JJ cord-305054-4d84b2g6 14 14 hosts host NNS cord-305054-4d84b2g6 14 15 . . . cord-305054-4d84b2g6 15 1 Recently recently RB cord-305054-4d84b2g6 15 2 , , , cord-305054-4d84b2g6 15 3 the the DT cord-305054-4d84b2g6 15 4 outbreak outbreak NN cord-305054-4d84b2g6 15 5 of of IN cord-305054-4d84b2g6 15 6 SARS SARS NNP cord-305054-4d84b2g6 15 7 - - HYPH cord-305054-4d84b2g6 15 8 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 15 9 ( ( -LRB- cord-305054-4d84b2g6 15 10 COVID-19 COVID-19 NNP cord-305054-4d84b2g6 15 11 ) ) -RRB- cord-305054-4d84b2g6 15 12 has have VBZ cord-305054-4d84b2g6 15 13 caused cause VBN cord-305054-4d84b2g6 15 14 an an DT cord-305054-4d84b2g6 15 15 ongoing ongoing JJ cord-305054-4d84b2g6 15 16 global global JJ cord-305054-4d84b2g6 15 17 pandemic pandemic NN cord-305054-4d84b2g6 15 18 . . . cord-305054-4d84b2g6 16 1 As as IN cord-305054-4d84b2g6 16 2 of of IN cord-305054-4d84b2g6 16 3 July July NNP cord-305054-4d84b2g6 16 4 21 21 CD cord-305054-4d84b2g6 16 5 , , , cord-305054-4d84b2g6 16 6 2020 2020 CD cord-305054-4d84b2g6 16 7 , , , cord-305054-4d84b2g6 16 8 the the DT cord-305054-4d84b2g6 16 9 pandemic pandemic NN cord-305054-4d84b2g6 16 10 resulting result VBG cord-305054-4d84b2g6 16 11 in in IN cord-305054-4d84b2g6 16 12 a a DT cord-305054-4d84b2g6 16 13 total total NN cord-305054-4d84b2g6 16 14 of of IN cord-305054-4d84b2g6 16 15 14,562,550 14,562,550 CD cord-305054-4d84b2g6 16 16 clinical clinical JJ cord-305054-4d84b2g6 16 17 cases case NNS cord-305054-4d84b2g6 16 18 and and CC cord-305054-4d84b2g6 16 19 607,781 607,781 CD cord-305054-4d84b2g6 16 20 deaths death NNS cord-305054-4d84b2g6 16 21 all all RB cord-305054-4d84b2g6 16 22 over over IN cord-305054-4d84b2g6 16 23 the the DT cord-305054-4d84b2g6 16 24 world world NN cord-305054-4d84b2g6 16 25 ( ( -LRB- cord-305054-4d84b2g6 16 26 www.who.int www.who.int NNP cord-305054-4d84b2g6 16 27 ) ) -RRB- cord-305054-4d84b2g6 16 28 . . . cord-305054-4d84b2g6 17 1 With with IN cord-305054-4d84b2g6 17 2 an an DT cord-305054-4d84b2g6 17 3 effort effort NN cord-305054-4d84b2g6 17 4 of of IN cord-305054-4d84b2g6 17 5 metagenomic metagenomic NNP cord-305054-4d84b2g6 17 6 RNA RNA NNP cord-305054-4d84b2g6 17 7 deep deep JJ cord-305054-4d84b2g6 17 8 sequencing sequencing NN cord-305054-4d84b2g6 17 9 , , , cord-305054-4d84b2g6 17 10 the the DT cord-305054-4d84b2g6 17 11 genome genome NN cord-305054-4d84b2g6 17 12 of of IN cord-305054-4d84b2g6 17 13 a a DT cord-305054-4d84b2g6 17 14 SARS SARS NNP cord-305054-4d84b2g6 17 15 - - HYPH cord-305054-4d84b2g6 17 16 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 17 17 isolate isolate VB cord-305054-4d84b2g6 17 18 , , , cord-305054-4d84b2g6 17 19 Wuhan Wuhan NNP cord-305054-4d84b2g6 17 20 - - HYPH cord-305054-4d84b2g6 17 21 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 17 22 , , , cord-305054-4d84b2g6 17 23 was be VBD cord-305054-4d84b2g6 17 24 published publish VBN cord-305054-4d84b2g6 17 25 in in IN cord-305054-4d84b2g6 17 26 [ [ -LRB- cord-305054-4d84b2g6 17 27 11 11 CD cord-305054-4d84b2g6 17 28 ] ] -RRB- cord-305054-4d84b2g6 17 29 . . . cord-305054-4d84b2g6 18 1 The the DT cord-305054-4d84b2g6 18 2 genome genome NN cord-305054-4d84b2g6 18 3 showed show VBD cord-305054-4d84b2g6 18 4 high high JJ cord-305054-4d84b2g6 18 5 nucleotide nucleotide JJ cord-305054-4d84b2g6 18 6 similarity similarity NN cord-305054-4d84b2g6 18 7 ( ( -LRB- cord-305054-4d84b2g6 18 8 89.1 89.1 CD cord-305054-4d84b2g6 18 9 % % NN cord-305054-4d84b2g6 18 10 ) ) -RRB- cord-305054-4d84b2g6 18 11 to to IN cord-305054-4d84b2g6 18 12 a a DT cord-305054-4d84b2g6 18 13 group group NN cord-305054-4d84b2g6 18 14 of of IN cord-305054-4d84b2g6 18 15 SARS SARS NNP cord-305054-4d84b2g6 18 16 - - HYPH cord-305054-4d84b2g6 18 17 like like JJ cord-305054-4d84b2g6 18 18 coronavirus coronavirus NN cord-305054-4d84b2g6 18 19 that that WDT cord-305054-4d84b2g6 18 20 were be VBD cord-305054-4d84b2g6 18 21 identified identify VBN cord-305054-4d84b2g6 18 22 in in IN cord-305054-4d84b2g6 18 23 bats bat NNS cord-305054-4d84b2g6 18 24 in in IN cord-305054-4d84b2g6 18 25 China China NNP cord-305054-4d84b2g6 18 26 , , , cord-305054-4d84b2g6 18 27 which which WDT cord-305054-4d84b2g6 18 28 indicated indicate VBD cord-305054-4d84b2g6 18 29 the the DT cord-305054-4d84b2g6 18 30 possibility possibility NN cord-305054-4d84b2g6 18 31 of of IN cord-305054-4d84b2g6 18 32 animal animal NN cord-305054-4d84b2g6 18 33 origin origin NN cord-305054-4d84b2g6 18 34 . . . cord-305054-4d84b2g6 19 1 To to TO cord-305054-4d84b2g6 19 2 identify identify VB cord-305054-4d84b2g6 19 3 potential potential JJ cord-305054-4d84b2g6 19 4 direct direct JJ cord-305054-4d84b2g6 19 5 or or CC cord-305054-4d84b2g6 19 6 intermediate intermediate JJ cord-305054-4d84b2g6 19 7 host host NN cord-305054-4d84b2g6 19 8 of of IN cord-305054-4d84b2g6 19 9 SARS SARS NNP cord-305054-4d84b2g6 19 10 - - HYPH cord-305054-4d84b2g6 19 11 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 19 12 , , , cord-305054-4d84b2g6 19 13 coronavirus coronavirus NN cord-305054-4d84b2g6 19 14 in in IN cord-305054-4d84b2g6 19 15 several several JJ cord-305054-4d84b2g6 19 16 animals animal NNS cord-305054-4d84b2g6 19 17 were be VBD cord-305054-4d84b2g6 19 18 studied study VBN cord-305054-4d84b2g6 19 19 and and CC cord-305054-4d84b2g6 19 20 compared compare VBN cord-305054-4d84b2g6 19 21 with with IN cord-305054-4d84b2g6 19 22 SARS SARS NNP cord-305054-4d84b2g6 19 23 - - HYPH cord-305054-4d84b2g6 19 24 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 19 25 . . . cord-305054-4d84b2g6 20 1 Bat bat NN cord-305054-4d84b2g6 20 2 and and CC cord-305054-4d84b2g6 20 3 pangolin pangolin NN cord-305054-4d84b2g6 20 4 were be VBD cord-305054-4d84b2g6 20 5 the the DT cord-305054-4d84b2g6 20 6 two two CD cord-305054-4d84b2g6 20 7 mostly mostly RB cord-305054-4d84b2g6 20 8 investigated investigate VBN cord-305054-4d84b2g6 20 9 species specie NNS cord-305054-4d84b2g6 20 10 . . . cord-305054-4d84b2g6 21 1 A a DT cord-305054-4d84b2g6 21 2 CoV cov NN cord-305054-4d84b2g6 21 3 in in IN cord-305054-4d84b2g6 21 4 bat bat NN cord-305054-4d84b2g6 21 5 , , , cord-305054-4d84b2g6 21 6 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 21 7 , , , cord-305054-4d84b2g6 21 8 showed show VBD cord-305054-4d84b2g6 21 9 96 96 CD cord-305054-4d84b2g6 21 10 % % NN cord-305054-4d84b2g6 21 11 full full JJ cord-305054-4d84b2g6 21 12 - - HYPH cord-305054-4d84b2g6 21 13 genome genome NN cord-305054-4d84b2g6 21 14 similarity similarity NN cord-305054-4d84b2g6 21 15 with with IN cord-305054-4d84b2g6 21 16 SARS SARS NNP cord-305054-4d84b2g6 21 17 - - HYPH cord-305054-4d84b2g6 21 18 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 21 19 [ [ -LRB- cord-305054-4d84b2g6 21 20 14 14 CD cord-305054-4d84b2g6 21 21 ] ] -RRB- cord-305054-4d84b2g6 21 22 . . . cord-305054-4d84b2g6 22 1 Other other JJ cord-305054-4d84b2g6 22 2 virus virus NN cord-305054-4d84b2g6 22 3 isolates isolate VBZ cord-305054-4d84b2g6 22 4 from from IN cord-305054-4d84b2g6 22 5 bat bat NN cord-305054-4d84b2g6 22 6 , , , cord-305054-4d84b2g6 22 7 ZXC21 ZXC21 NNP cord-305054-4d84b2g6 22 8 and and CC cord-305054-4d84b2g6 22 9 ZC45 ZC45 NNP cord-305054-4d84b2g6 22 10 also also RB cord-305054-4d84b2g6 22 11 shared share VBD cord-305054-4d84b2g6 22 12 85 85 CD cord-305054-4d84b2g6 22 13 % % NN cord-305054-4d84b2g6 22 14 of of IN cord-305054-4d84b2g6 22 15 similarity similarity NN cord-305054-4d84b2g6 22 16 to to IN cord-305054-4d84b2g6 22 17 SARS SARS NNP cord-305054-4d84b2g6 22 18 - - HYPH cord-305054-4d84b2g6 22 19 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 22 20 . . . cord-305054-4d84b2g6 23 1 These these DT cord-305054-4d84b2g6 23 2 results result NNS cord-305054-4d84b2g6 23 3 led lead VBD cord-305054-4d84b2g6 23 4 to to IN cord-305054-4d84b2g6 23 5 the the DT cord-305054-4d84b2g6 23 6 hypothesis hypothesis NN cord-305054-4d84b2g6 23 7 that that IN cord-305054-4d84b2g6 23 8 the the DT cord-305054-4d84b2g6 23 9 progenitor progenitor NN cord-305054-4d84b2g6 23 10 of of IN cord-305054-4d84b2g6 23 11 SARS SARS NNP cord-305054-4d84b2g6 23 12 - - HYPH cord-305054-4d84b2g6 23 13 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 23 14 originated originate VBD cord-305054-4d84b2g6 23 15 in in IN cord-305054-4d84b2g6 23 16 Bats bat NNS cord-305054-4d84b2g6 23 17 and and CC cord-305054-4d84b2g6 23 18 it -PRON- PRP cord-305054-4d84b2g6 23 19 spilled spill VBD cord-305054-4d84b2g6 23 20 over over RP cord-305054-4d84b2g6 23 21 to to IN cord-305054-4d84b2g6 23 22 humans human NNS cord-305054-4d84b2g6 23 23 using use VBG cord-305054-4d84b2g6 23 24 another another DT cord-305054-4d84b2g6 23 25 animal animal NN cord-305054-4d84b2g6 23 26 as as IN cord-305054-4d84b2g6 23 27 intermediate intermediate JJ cord-305054-4d84b2g6 23 28 host host NN cord-305054-4d84b2g6 23 29 . . . cord-305054-4d84b2g6 24 1 Many many JJ cord-305054-4d84b2g6 24 2 researchers researcher NNS cord-305054-4d84b2g6 24 3 believed believe VBD cord-305054-4d84b2g6 24 4 that that IN cord-305054-4d84b2g6 24 5 pangolins pangolin NNS cord-305054-4d84b2g6 24 6 are be VBP cord-305054-4d84b2g6 24 7 a a DT cord-305054-4d84b2g6 24 8 potential potential JJ cord-305054-4d84b2g6 24 9 intermediate intermediate JJ cord-305054-4d84b2g6 24 10 host host NN cord-305054-4d84b2g6 24 11 and and CC cord-305054-4d84b2g6 24 12 they -PRON- PRP cord-305054-4d84b2g6 24 13 attempted attempt VBD cord-305054-4d84b2g6 24 14 to to TO cord-305054-4d84b2g6 24 15 characterize characterize VB cord-305054-4d84b2g6 24 16 coronavirus coronavirus NN cord-305054-4d84b2g6 24 17 in in IN cord-305054-4d84b2g6 24 18 pangolin pangolin NN cord-305054-4d84b2g6 24 19 . . . cord-305054-4d84b2g6 25 1 Liu Liu NNP cord-305054-4d84b2g6 25 2 , , , cord-305054-4d84b2g6 25 3 Chen Chen NNP cord-305054-4d84b2g6 25 4 , , , cord-305054-4d84b2g6 25 5 and and CC cord-305054-4d84b2g6 25 6 Chen Chen NNP cord-305054-4d84b2g6 25 7 [ [ -LRB- cord-305054-4d84b2g6 25 8 8 8 CD cord-305054-4d84b2g6 25 9 ] ] -RRB- cord-305054-4d84b2g6 25 10 constructed construct VBN cord-305054-4d84b2g6 25 11 coronavirus coronavirus NN cord-305054-4d84b2g6 25 12 contigs contigs NN cord-305054-4d84b2g6 25 13 using use VBG cord-305054-4d84b2g6 25 14 de de NNP cord-305054-4d84b2g6 25 15 novo novo NNP cord-305054-4d84b2g6 25 16 assembly assembly NNP cord-305054-4d84b2g6 25 17 method method NN cord-305054-4d84b2g6 25 18 from from IN cord-305054-4d84b2g6 25 19 organ organ NN cord-305054-4d84b2g6 25 20 samples sample NNS cord-305054-4d84b2g6 25 21 of of IN cord-305054-4d84b2g6 25 22 dead dead JJ cord-305054-4d84b2g6 25 23 Malayan malayan JJ cord-305054-4d84b2g6 25 24 pangolins pangolin NNS cord-305054-4d84b2g6 25 25 rescued rescue VBN cord-305054-4d84b2g6 25 26 at at IN cord-305054-4d84b2g6 25 27 the the DT cord-305054-4d84b2g6 25 28 Guangdong Guangdong NNP cord-305054-4d84b2g6 25 29 Wildlife Wildlife NNP cord-305054-4d84b2g6 25 30 Rescue Rescue NNP cord-305054-4d84b2g6 25 31 Center Center NNP cord-305054-4d84b2g6 25 32 . . . cord-305054-4d84b2g6 26 1 Of of IN cord-305054-4d84b2g6 26 2 the the DT cord-305054-4d84b2g6 26 3 11 11 CD cord-305054-4d84b2g6 26 4 collected collect VBN cord-305054-4d84b2g6 26 5 pangolins pangolin NNS cord-305054-4d84b2g6 26 6 , , , cord-305054-4d84b2g6 26 7 coronavirus coronavirus NN cord-305054-4d84b2g6 26 8 was be VBD cord-305054-4d84b2g6 26 9 detected detect VBN cord-305054-4d84b2g6 26 10 in in IN cord-305054-4d84b2g6 26 11 two two CD cord-305054-4d84b2g6 26 12 . . . cord-305054-4d84b2g6 27 1 Zhang Zhang NNP cord-305054-4d84b2g6 27 2 , , , cord-305054-4d84b2g6 27 3 Wu Wu NNP cord-305054-4d84b2g6 27 4 , , , cord-305054-4d84b2g6 27 5 and and CC cord-305054-4d84b2g6 27 6 Zhang Zhang NNP cord-305054-4d84b2g6 27 7 [ [ -LRB- cord-305054-4d84b2g6 27 8 13 13 CD cord-305054-4d84b2g6 27 9 ] ] -RRB- cord-305054-4d84b2g6 27 10 re re VBD cord-305054-4d84b2g6 27 11 - - VBN cord-305054-4d84b2g6 27 12 analyzed analyze VBD cord-305054-4d84b2g6 27 13 the the DT cord-305054-4d84b2g6 27 14 RNA RNA NNP cord-305054-4d84b2g6 27 15 - - HYPH cord-305054-4d84b2g6 27 16 Seq Seq NNP cord-305054-4d84b2g6 27 17 reads read VBZ cord-305054-4d84b2g6 27 18 from from IN cord-305054-4d84b2g6 27 19 two two CD cord-305054-4d84b2g6 27 20 pangolins pangolin NNS cord-305054-4d84b2g6 27 21 carrying carry VBG cord-305054-4d84b2g6 27 22 coronavirus coronavirus NN cord-305054-4d84b2g6 27 23 using use VBG cord-305054-4d84b2g6 27 24 reference reference NN cord-305054-4d84b2g6 27 25 - - HYPH cord-305054-4d84b2g6 27 26 guided guide VBN cord-305054-4d84b2g6 27 27 de de FW cord-305054-4d84b2g6 27 28 novo novo NNP cord-305054-4d84b2g6 27 29 assembly assembly NNP cord-305054-4d84b2g6 27 30 method method NN cord-305054-4d84b2g6 27 31 , , , cord-305054-4d84b2g6 27 32 with with IN cord-305054-4d84b2g6 27 33 Wuhan Wuhan NNP cord-305054-4d84b2g6 27 34 - - HYPH cord-305054-4d84b2g6 27 35 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 27 36 as as IN cord-305054-4d84b2g6 27 37 the the DT cord-305054-4d84b2g6 27 38 reference reference NN cord-305054-4d84b2g6 27 39 genome genome NN cord-305054-4d84b2g6 27 40 . . . cord-305054-4d84b2g6 28 1 The the DT cord-305054-4d84b2g6 28 2 resulting result VBG cord-305054-4d84b2g6 28 3 draft draft NN cord-305054-4d84b2g6 28 4 genome genome NN cord-305054-4d84b2g6 28 5 shared share VBD cord-305054-4d84b2g6 28 6 91.02 91.02 CD cord-305054-4d84b2g6 28 7 and and CC cord-305054-4d84b2g6 28 8 90.55 90.55 CD cord-305054-4d84b2g6 28 9 % % NN cord-305054-4d84b2g6 28 10 whole whole JJ cord-305054-4d84b2g6 28 11 genome genome NN cord-305054-4d84b2g6 28 12 similarity similarity NN cord-305054-4d84b2g6 28 13 with with IN cord-305054-4d84b2g6 28 14 Wuhan Wuhan NNP cord-305054-4d84b2g6 28 15 - - HYPH cord-305054-4d84b2g6 28 16 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 29 1 and and CC cord-305054-4d84b2g6 29 2 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 29 3 , , , cord-305054-4d84b2g6 29 4 respectively respectively RB cord-305054-4d84b2g6 29 5 . . . cord-305054-4d84b2g6 30 1 Xiao Xiao NNP cord-305054-4d84b2g6 30 2 et et NNP cord-305054-4d84b2g6 30 3 al al NNP cord-305054-4d84b2g6 30 4 . . . cord-305054-4d84b2g6 31 1 [ [ -LRB- cord-305054-4d84b2g6 31 2 12 12 CD cord-305054-4d84b2g6 31 3 ] ] -RRB- cord-305054-4d84b2g6 31 4 obtained obtain VBN cord-305054-4d84b2g6 31 5 21 21 CD cord-305054-4d84b2g6 31 6 samples sample NNS cord-305054-4d84b2g6 31 7 from from IN cord-305054-4d84b2g6 31 8 pangolins pangolin NNS cord-305054-4d84b2g6 31 9 rescued rescue VBN cord-305054-4d84b2g6 31 10 by by IN cord-305054-4d84b2g6 31 11 Guangdong Guangdong NNP cord-305054-4d84b2g6 31 12 Customs Customs NNP cord-305054-4d84b2g6 31 13 and and CC cord-305054-4d84b2g6 31 14 conducted conduct VBN cord-305054-4d84b2g6 31 15 reference reference NN cord-305054-4d84b2g6 31 16 - - HYPH cord-305054-4d84b2g6 31 17 guided guide VBN cord-305054-4d84b2g6 31 18 genome genome NN cord-305054-4d84b2g6 31 19 assembly assembly NN cord-305054-4d84b2g6 31 20 , , , cord-305054-4d84b2g6 31 21 with with IN cord-305054-4d84b2g6 31 22 Wuhan Wuhan NNP cord-305054-4d84b2g6 31 23 - - HYPH cord-305054-4d84b2g6 31 24 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 31 25 as as IN cord-305054-4d84b2g6 31 26 the the DT cord-305054-4d84b2g6 31 27 reference reference NN cord-305054-4d84b2g6 31 28 genome genome NN cord-305054-4d84b2g6 31 29 . . . cord-305054-4d84b2g6 32 1 The the DT cord-305054-4d84b2g6 32 2 derived derive VBN cord-305054-4d84b2g6 32 3 viral viral JJ cord-305054-4d84b2g6 32 4 genome genome NN cord-305054-4d84b2g6 32 5 showed show VBD cord-305054-4d84b2g6 32 6 80 80 CD cord-305054-4d84b2g6 32 7 and and CC cord-305054-4d84b2g6 32 8 98 98 CD cord-305054-4d84b2g6 32 9 % % NN cord-305054-4d84b2g6 32 10 whole whole JJ cord-305054-4d84b2g6 32 11 genome genome NN cord-305054-4d84b2g6 32 12 sequence sequence NN cord-305054-4d84b2g6 32 13 identity identity NN cord-305054-4d84b2g6 32 14 to to IN cord-305054-4d84b2g6 32 15 SARS SARS NNP cord-305054-4d84b2g6 32 16 - - HYPH cord-305054-4d84b2g6 32 17 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 32 18 and and CC cord-305054-4d84b2g6 32 19 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 32 20 , , , cord-305054-4d84b2g6 32 21 respectively respectively RB cord-305054-4d84b2g6 32 22 . . . cord-305054-4d84b2g6 33 1 Lam Lam NNP cord-305054-4d84b2g6 33 2 et et NNP cord-305054-4d84b2g6 33 3 al al NNP cord-305054-4d84b2g6 33 4 . . . cord-305054-4d84b2g6 34 1 [ [ -LRB- cord-305054-4d84b2g6 34 2 3 3 CD cord-305054-4d84b2g6 34 3 ] ] -RRB- cord-305054-4d84b2g6 34 4 collected collect VBD cord-305054-4d84b2g6 34 5 43 43 CD cord-305054-4d84b2g6 34 6 samples sample NNS cord-305054-4d84b2g6 34 7 from from IN cord-305054-4d84b2g6 34 8 18 18 CD cord-305054-4d84b2g6 34 9 pangolins pangolin NNS cord-305054-4d84b2g6 34 10 from from IN cord-305054-4d84b2g6 34 11 Guangxi Guangxi NNP cord-305054-4d84b2g6 34 12 Medical Medical NNP cord-305054-4d84b2g6 34 13 University University NNP cord-305054-4d84b2g6 34 14 , , , cord-305054-4d84b2g6 34 15 China China NNP cord-305054-4d84b2g6 34 16 , , , cord-305054-4d84b2g6 34 17 and and CC cord-305054-4d84b2g6 34 18 six six CD cord-305054-4d84b2g6 34 19 samples sample NNS cord-305054-4d84b2g6 34 20 contained contain VBD cord-305054-4d84b2g6 34 21 coronavirus coronavirus NN cord-305054-4d84b2g6 34 22 sequences sequence NNS cord-305054-4d84b2g6 34 23 . . . cord-305054-4d84b2g6 35 1 The the DT cord-305054-4d84b2g6 35 2 viral viral JJ cord-305054-4d84b2g6 35 3 genomes genome NNS cord-305054-4d84b2g6 35 4 of of IN cord-305054-4d84b2g6 35 5 six six CD cord-305054-4d84b2g6 35 6 samples sample NNS cord-305054-4d84b2g6 35 7 were be VBD cord-305054-4d84b2g6 35 8 de de IN cord-305054-4d84b2g6 35 9 novo novo NNP cord-305054-4d84b2g6 35 10 assembled assemble VBD cord-305054-4d84b2g6 35 11 . . . cord-305054-4d84b2g6 36 1 They -PRON- PRP cord-305054-4d84b2g6 36 2 also also RB cord-305054-4d84b2g6 36 3 performed perform VBD cord-305054-4d84b2g6 36 4 RNA RNA NNP cord-305054-4d84b2g6 36 5 sequencing sequencing NN cord-305054-4d84b2g6 36 6 in in IN cord-305054-4d84b2g6 36 7 five five CD cord-305054-4d84b2g6 36 8 archived archived JJ cord-305054-4d84b2g6 36 9 pangolins pangolin NNS cord-305054-4d84b2g6 36 10 samples sample NNS cord-305054-4d84b2g6 36 11 from from IN cord-305054-4d84b2g6 36 12 Guangdong Guangdong NNP cord-305054-4d84b2g6 36 13 , , , cord-305054-4d84b2g6 36 14 and and CC cord-305054-4d84b2g6 36 15 assembled assemble VBD cord-305054-4d84b2g6 36 16 the the DT cord-305054-4d84b2g6 36 17 genomes genome NNS cord-305054-4d84b2g6 36 18 using use VBG cord-305054-4d84b2g6 36 19 WIV04 WIV04 NNP cord-305054-4d84b2g6 36 20 , , , cord-305054-4d84b2g6 36 21 another another DT cord-305054-4d84b2g6 36 22 SARS SARS NNP cord-305054-4d84b2g6 36 23 - - HYPH cord-305054-4d84b2g6 36 24 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 36 25 genome genome NN cord-305054-4d84b2g6 36 26 from from IN cord-305054-4d84b2g6 36 27 human human JJ cord-305054-4d84b2g6 36 28 , , , cord-305054-4d84b2g6 36 29 as as IN cord-305054-4d84b2g6 36 30 reference reference NN cord-305054-4d84b2g6 36 31 genome genome NN cord-305054-4d84b2g6 36 32 . . . cord-305054-4d84b2g6 37 1 The the DT cord-305054-4d84b2g6 37 2 resultant resultant JJ cord-305054-4d84b2g6 37 3 draft draft NN cord-305054-4d84b2g6 37 4 genomes genome NNS cord-305054-4d84b2g6 37 5 have have VBP cord-305054-4d84b2g6 37 6 85.5 85.5 CD cord-305054-4d84b2g6 37 7 % % NN cord-305054-4d84b2g6 37 8 to to IN cord-305054-4d84b2g6 37 9 92.4 92.4 CD cord-305054-4d84b2g6 37 10 % % NN cord-305054-4d84b2g6 37 11 identity identity NN cord-305054-4d84b2g6 37 12 to to IN cord-305054-4d84b2g6 37 13 the the DT cord-305054-4d84b2g6 37 14 SARS SARS NNP cord-305054-4d84b2g6 37 15 - - HYPH cord-305054-4d84b2g6 37 16 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 37 17 genome genome NN cord-305054-4d84b2g6 37 18 . . . cord-305054-4d84b2g6 38 1 They -PRON- PRP cord-305054-4d84b2g6 38 2 suggested suggest VBD cord-305054-4d84b2g6 38 3 two two CD cord-305054-4d84b2g6 38 4 sub sub NN cord-305054-4d84b2g6 38 5 - - NNS cord-305054-4d84b2g6 38 6 lineages lineage NNS cord-305054-4d84b2g6 38 7 of of IN cord-305054-4d84b2g6 38 8 coronavirus coronavirus NN cord-305054-4d84b2g6 38 9 existed exist VBD cord-305054-4d84b2g6 38 10 in in IN cord-305054-4d84b2g6 38 11 pangolins pangolin NNS cord-305054-4d84b2g6 38 12 . . . cord-305054-4d84b2g6 39 1 In in IN cord-305054-4d84b2g6 39 2 terms term NNS cord-305054-4d84b2g6 39 3 of of IN cord-305054-4d84b2g6 39 4 all all DT cord-305054-4d84b2g6 39 5 coding code VBG cord-305054-4d84b2g6 39 6 sites site NNS cord-305054-4d84b2g6 39 7 , , , cord-305054-4d84b2g6 39 8 coronavirus coronavirus NN cord-305054-4d84b2g6 39 9 identified identify VBN cord-305054-4d84b2g6 39 10 in in IN cord-305054-4d84b2g6 39 11 pangolins pangolin NNS cord-305054-4d84b2g6 39 12 from from IN cord-305054-4d84b2g6 39 13 Guangdong Guangdong NNP cord-305054-4d84b2g6 39 14 is be VBZ cord-305054-4d84b2g6 39 15 more more RBR cord-305054-4d84b2g6 39 16 closely closely RB cord-305054-4d84b2g6 39 17 related related JJ cord-305054-4d84b2g6 39 18 to to IN cord-305054-4d84b2g6 39 19 WIV04 WIV04 NNP cord-305054-4d84b2g6 39 20 than than IN cord-305054-4d84b2g6 39 21 Bat Bat NNP cord-305054-4d84b2g6 39 22 - - HYPH cord-305054-4d84b2g6 39 23 CoV cov NN cord-305054-4d84b2g6 39 24 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 39 25 , , , cord-305054-4d84b2g6 39 26 whereas whereas IN cord-305054-4d84b2g6 39 27 coronavirus coronavirus NN cord-305054-4d84b2g6 39 28 genome genome NN cord-305054-4d84b2g6 39 29 of of IN cord-305054-4d84b2g6 39 30 pangolins pangolin NNS cord-305054-4d84b2g6 39 31 in in IN cord-305054-4d84b2g6 39 32 Guangxi Guangxi NNP cord-305054-4d84b2g6 39 33 showed show VBD cord-305054-4d84b2g6 39 34 lower low JJR cord-305054-4d84b2g6 39 35 genome genome NN cord-305054-4d84b2g6 39 36 similarity similarity NN cord-305054-4d84b2g6 39 37 to to IN cord-305054-4d84b2g6 39 38 WIV04 WIV04 NNP cord-305054-4d84b2g6 39 39 than than IN cord-305054-4d84b2g6 39 40 Bat Bat NNP cord-305054-4d84b2g6 39 41 - - HYPH cord-305054-4d84b2g6 39 42 CoV cov NN cord-305054-4d84b2g6 39 43 RaTG13 ratg13 NN cord-305054-4d84b2g6 39 44 . . . cord-305054-4d84b2g6 40 1 Liu Liu NNP cord-305054-4d84b2g6 40 2 et et NNP cord-305054-4d84b2g6 40 3 al al NNP cord-305054-4d84b2g6 40 4 . . . cord-305054-4d84b2g6 41 1 [ [ -LRB- cord-305054-4d84b2g6 41 2 8 8 CD cord-305054-4d84b2g6 41 3 ] ] -RRB- cord-305054-4d84b2g6 41 4 took take VBD cord-305054-4d84b2g6 41 5 samples sample NNS cord-305054-4d84b2g6 41 6 from from IN cord-305054-4d84b2g6 41 7 three three CD cord-305054-4d84b2g6 41 8 coronavirus coronavirus NN cord-305054-4d84b2g6 41 9 positive positive JJ cord-305054-4d84b2g6 41 10 pangolins pangolin NNS cord-305054-4d84b2g6 41 11 rescued rescue VBN cord-305054-4d84b2g6 41 12 in in IN cord-305054-4d84b2g6 41 13 Guangdong Guangdong NNP cord-305054-4d84b2g6 41 14 and and CC cord-305054-4d84b2g6 41 15 performed perform VBN cord-305054-4d84b2g6 41 16 deep deep JJ cord-305054-4d84b2g6 41 17 sequencing sequencing NN cord-305054-4d84b2g6 41 18 . . . cord-305054-4d84b2g6 42 1 Using use VBG cord-305054-4d84b2g6 42 2 de de FW cord-305054-4d84b2g6 42 3 novo novo NNP cord-305054-4d84b2g6 42 4 assembly assembly NNP cord-305054-4d84b2g6 42 5 method method NN cord-305054-4d84b2g6 42 6 , , , cord-305054-4d84b2g6 42 7 they -PRON- PRP cord-305054-4d84b2g6 42 8 obtained obtain VBD cord-305054-4d84b2g6 42 9 viral viral JJ cord-305054-4d84b2g6 42 10 genome genome NN cord-305054-4d84b2g6 42 11 that that WDT cord-305054-4d84b2g6 42 12 showed show VBD cord-305054-4d84b2g6 42 13 90.32 90.32 CD cord-305054-4d84b2g6 42 14 % % NN cord-305054-4d84b2g6 42 15 and and CC cord-305054-4d84b2g6 42 16 90.24 90.24 CD cord-305054-4d84b2g6 42 17 % % NN cord-305054-4d84b2g6 42 18 of of IN cord-305054-4d84b2g6 42 19 whole whole JJ cord-305054-4d84b2g6 42 20 genome genome NN cord-305054-4d84b2g6 42 21 identify identify NN cord-305054-4d84b2g6 42 22 to to IN cord-305054-4d84b2g6 42 23 Wuhan Wuhan NNP cord-305054-4d84b2g6 42 24 - - HYPH cord-305054-4d84b2g6 42 25 Hu-1and hu-1and JJ cord-305054-4d84b2g6 42 26 Bat Bat NNP cord-305054-4d84b2g6 42 27 - - HYPH cord-305054-4d84b2g6 42 28 CoV cov NN cord-305054-4d84b2g6 42 29 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 42 30 , , , cord-305054-4d84b2g6 42 31 respectively respectively RB cord-305054-4d84b2g6 42 32 . . . cord-305054-4d84b2g6 43 1 All all DT cord-305054-4d84b2g6 43 2 of of IN cord-305054-4d84b2g6 43 3 the the DT cord-305054-4d84b2g6 43 4 pangolins pangolin NNS cord-305054-4d84b2g6 43 5 involved involve VBN cord-305054-4d84b2g6 43 6 in in IN cord-305054-4d84b2g6 43 7 published publish VBN cord-305054-4d84b2g6 43 8 coronavirus coronavirus NN cord-305054-4d84b2g6 43 9 analysis analysis NN cord-305054-4d84b2g6 43 10 were be VBD cord-305054-4d84b2g6 43 11 from from IN cord-305054-4d84b2g6 43 12 either either CC cord-305054-4d84b2g6 43 13 the the DT cord-305054-4d84b2g6 43 14 Guangdong Guangdong NNP cord-305054-4d84b2g6 43 15 collection collection NN cord-305054-4d84b2g6 43 16 or or CC cord-305054-4d84b2g6 43 17 the the DT cord-305054-4d84b2g6 43 18 Guangxi Guangxi NNP cord-305054-4d84b2g6 43 19 collection collection NN cord-305054-4d84b2g6 43 20 . . . cord-305054-4d84b2g6 44 1 Pangolins pangolin NNS cord-305054-4d84b2g6 44 2 from from IN cord-305054-4d84b2g6 44 3 the the DT cord-305054-4d84b2g6 44 4 Guangdong Guangdong NNP cord-305054-4d84b2g6 44 5 collection collection NN cord-305054-4d84b2g6 44 6 were be VBD cord-305054-4d84b2g6 44 7 investigated investigate VBN cord-305054-4d84b2g6 44 8 in in IN cord-305054-4d84b2g6 44 9 most most JJS cord-305054-4d84b2g6 44 10 studies study NNS cord-305054-4d84b2g6 44 11 [ [ -LRB- cord-305054-4d84b2g6 44 12 3 3 CD cord-305054-4d84b2g6 44 13 , , , cord-305054-4d84b2g6 44 14 8 8 CD cord-305054-4d84b2g6 44 15 , , , cord-305054-4d84b2g6 44 16 9 9 CD cord-305054-4d84b2g6 44 17 , , , cord-305054-4d84b2g6 44 18 12 12 CD cord-305054-4d84b2g6 44 19 ] ] -RRB- cord-305054-4d84b2g6 44 20 . . . cord-305054-4d84b2g6 45 1 The the DT cord-305054-4d84b2g6 45 2 resulting result VBG cord-305054-4d84b2g6 45 3 viral viral JJ cord-305054-4d84b2g6 45 4 genomes genome NNS cord-305054-4d84b2g6 45 5 were be VBD cord-305054-4d84b2g6 45 6 derived derive VBN cord-305054-4d84b2g6 45 7 from from IN cord-305054-4d84b2g6 45 8 de de NNP cord-305054-4d84b2g6 45 9 novo novo NNP cord-305054-4d84b2g6 45 10 assembly assembly NNP cord-305054-4d84b2g6 45 11 with with IN cord-305054-4d84b2g6 45 12 or or CC cord-305054-4d84b2g6 45 13 without without IN cord-305054-4d84b2g6 45 14 a a DT cord-305054-4d84b2g6 45 15 guided guide VBN cord-305054-4d84b2g6 45 16 reference reference NN cord-305054-4d84b2g6 45 17 and and CC cord-305054-4d84b2g6 45 18 further further RB cord-305054-4d84b2g6 45 19 curated curate VBN cord-305054-4d84b2g6 45 20 using use VBG cord-305054-4d84b2g6 45 21 blast blast NN cord-305054-4d84b2g6 45 22 annotation annotation NN cord-305054-4d84b2g6 45 23 or or CC cord-305054-4d84b2g6 45 24 PCR PCR NNP cord-305054-4d84b2g6 45 25 amplicon amplicon NN cord-305054-4d84b2g6 45 26 sequencing sequencing NN cord-305054-4d84b2g6 45 27 . . . cord-305054-4d84b2g6 46 1 All all DT cord-305054-4d84b2g6 46 2 studies study NNS cord-305054-4d84b2g6 46 3 have have VBP cord-305054-4d84b2g6 46 4 showed show VBN cord-305054-4d84b2g6 46 5 that that IN cord-305054-4d84b2g6 46 6 CoV cov NN cord-305054-4d84b2g6 46 7 in in IN cord-305054-4d84b2g6 46 8 pangolin pangolin NN cord-305054-4d84b2g6 46 9 was be VBD cord-305054-4d84b2g6 46 10 highly highly RB cord-305054-4d84b2g6 46 11 related relate VBN cord-305054-4d84b2g6 46 12 to to IN cord-305054-4d84b2g6 46 13 Bat Bat NNP cord-305054-4d84b2g6 46 14 - - HYPH cord-305054-4d84b2g6 46 15 CoV CoV NNP cord-305054-4d84b2g6 46 16 and and CC cord-305054-4d84b2g6 46 17 SARS SARS NNP cord-305054-4d84b2g6 46 18 - - HYPH cord-305054-4d84b2g6 46 19 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 46 20 . . . cord-305054-4d84b2g6 47 1 In in IN cord-305054-4d84b2g6 47 2 all all DT cord-305054-4d84b2g6 47 3 of of IN cord-305054-4d84b2g6 47 4 the the DT cord-305054-4d84b2g6 47 5 studies study NNS cord-305054-4d84b2g6 47 6 that that WDT cord-305054-4d84b2g6 47 7 used use VBD cord-305054-4d84b2g6 47 8 reference reference NN cord-305054-4d84b2g6 47 9 - - HYPH cord-305054-4d84b2g6 47 10 guided guide VBN cord-305054-4d84b2g6 47 11 de de FW cord-305054-4d84b2g6 47 12 novo novo NNP cord-305054-4d84b2g6 47 13 assemblies assembly NNS cord-305054-4d84b2g6 47 14 , , , cord-305054-4d84b2g6 47 15 a a DT cord-305054-4d84b2g6 47 16 SARS SARS NNP cord-305054-4d84b2g6 47 17 - - HYPH cord-305054-4d84b2g6 47 18 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 47 19 genome genome NN cord-305054-4d84b2g6 47 20 ( ( -LRB- cord-305054-4d84b2g6 47 21 Wuhan Wuhan NNP cord-305054-4d84b2g6 47 22 - - HYPH cord-305054-4d84b2g6 47 23 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 47 24 or or CC cord-305054-4d84b2g6 47 25 WIV04 WIV04 NNP cord-305054-4d84b2g6 47 26 ) ) -RRB- cord-305054-4d84b2g6 47 27 was be VBD cord-305054-4d84b2g6 47 28 chosen choose VBN cord-305054-4d84b2g6 47 29 as as IN cord-305054-4d84b2g6 47 30 reference reference NN cord-305054-4d84b2g6 47 31 [ [ -LRB- cord-305054-4d84b2g6 47 32 3 3 CD cord-305054-4d84b2g6 47 33 , , , cord-305054-4d84b2g6 47 34 12 12 CD cord-305054-4d84b2g6 47 35 , , , cord-305054-4d84b2g6 47 36 13 13 CD cord-305054-4d84b2g6 47 37 ] ] -RRB- cord-305054-4d84b2g6 47 38 . . . cord-305054-4d84b2g6 48 1 This this DT cord-305054-4d84b2g6 48 2 choice choice NN cord-305054-4d84b2g6 48 3 was be VBD cord-305054-4d84b2g6 48 4 based base VBN cord-305054-4d84b2g6 48 5 on on IN cord-305054-4d84b2g6 48 6 the the DT cord-305054-4d84b2g6 48 7 assumption assumption NN cord-305054-4d84b2g6 48 8 that that IN cord-305054-4d84b2g6 48 9 SARS SARS NNP cord-305054-4d84b2g6 48 10 - - HYPH cord-305054-4d84b2g6 48 11 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 48 12 was be VBD cord-305054-4d84b2g6 48 13 the the DT cord-305054-4d84b2g6 48 14 closest close JJS cord-305054-4d84b2g6 48 15 neighbor neighbor NN cord-305054-4d84b2g6 48 16 of of IN cord-305054-4d84b2g6 48 17 the the DT cord-305054-4d84b2g6 48 18 Pangolin Pangolin NNP cord-305054-4d84b2g6 48 19 CoV cov NN cord-305054-4d84b2g6 48 20 in in IN cord-305054-4d84b2g6 48 21 the the DT cord-305054-4d84b2g6 48 22 phylogeny phylogeny NN cord-305054-4d84b2g6 48 23 tree tree NN cord-305054-4d84b2g6 48 24 . . . cord-305054-4d84b2g6 49 1 However however RB cord-305054-4d84b2g6 49 2 , , , cord-305054-4d84b2g6 49 3 this this DT cord-305054-4d84b2g6 49 4 assumption assumption NN cord-305054-4d84b2g6 49 5 may may MD cord-305054-4d84b2g6 49 6 not not RB cord-305054-4d84b2g6 49 7 necessary necessary JJ cord-305054-4d84b2g6 49 8 be be VB cord-305054-4d84b2g6 49 9 true true JJ cord-305054-4d84b2g6 49 10 . . . cord-305054-4d84b2g6 50 1 Therefore therefore RB cord-305054-4d84b2g6 50 2 , , , cord-305054-4d84b2g6 50 3 choosing choose VBG cord-305054-4d84b2g6 50 4 the the DT cord-305054-4d84b2g6 50 5 SARS SARS NNP cord-305054-4d84b2g6 50 6 - - HYPH cord-305054-4d84b2g6 50 7 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 50 8 genome genome NN cord-305054-4d84b2g6 50 9 as as IN cord-305054-4d84b2g6 50 10 reference reference NN cord-305054-4d84b2g6 50 11 could could MD cord-305054-4d84b2g6 50 12 inadvertently inadvertently RB cord-305054-4d84b2g6 50 13 introduce introduce VB cord-305054-4d84b2g6 50 14 bias bias NN cord-305054-4d84b2g6 50 15 in in IN cord-305054-4d84b2g6 50 16 the the DT cord-305054-4d84b2g6 50 17 genome genome NN cord-305054-4d84b2g6 50 18 assembly assembly NN cord-305054-4d84b2g6 50 19 , , , cord-305054-4d84b2g6 50 20 leading lead VBG cord-305054-4d84b2g6 50 21 to to IN cord-305054-4d84b2g6 50 22 inaccurate inaccurate JJ cord-305054-4d84b2g6 50 23 or or CC cord-305054-4d84b2g6 50 24 incomplete incomplete JJ cord-305054-4d84b2g6 50 25 results result NNS cord-305054-4d84b2g6 50 26 . . . cord-305054-4d84b2g6 51 1 In in IN cord-305054-4d84b2g6 51 2 this this DT cord-305054-4d84b2g6 51 3 study study NN cord-305054-4d84b2g6 51 4 , , , cord-305054-4d84b2g6 51 5 we -PRON- PRP cord-305054-4d84b2g6 51 6 assembled assemble VBD cord-305054-4d84b2g6 51 7 the the DT cord-305054-4d84b2g6 51 8 Pangolin Pangolin NNP cord-305054-4d84b2g6 51 9 CoV cov NN cord-305054-4d84b2g6 51 10 genome genome NN cord-305054-4d84b2g6 51 11 using use VBG cord-305054-4d84b2g6 51 12 several several JJ cord-305054-4d84b2g6 51 13 different different JJ cord-305054-4d84b2g6 51 14 genomes genome NNS cord-305054-4d84b2g6 51 15 as as IN cord-305054-4d84b2g6 51 16 reference reference NN cord-305054-4d84b2g6 51 17 . . . cord-305054-4d84b2g6 52 1 We -PRON- PRP cord-305054-4d84b2g6 52 2 investigated investigate VBD cord-305054-4d84b2g6 52 3 how how WRB cord-305054-4d84b2g6 52 4 the the DT cord-305054-4d84b2g6 52 5 reference reference NN cord-305054-4d84b2g6 52 6 genome genome NN cord-305054-4d84b2g6 52 7 impact impact NN cord-305054-4d84b2g6 52 8 the the DT cord-305054-4d84b2g6 52 9 resulting result VBG cord-305054-4d84b2g6 52 10 genome genome NN cord-305054-4d84b2g6 52 11 and and CC cord-305054-4d84b2g6 52 12 its -PRON- PRP$ cord-305054-4d84b2g6 52 13 phylogenetic phylogenetic JJ cord-305054-4d84b2g6 52 14 relationship relationship NN cord-305054-4d84b2g6 52 15 with with IN cord-305054-4d84b2g6 52 16 others other NNS cord-305054-4d84b2g6 52 17 . . . cord-305054-4d84b2g6 53 1 The the DT cord-305054-4d84b2g6 53 2 results result NNS cord-305054-4d84b2g6 53 3 from from IN cord-305054-4d84b2g6 53 4 this this DT cord-305054-4d84b2g6 53 5 study study NN cord-305054-4d84b2g6 53 6 will will MD cord-305054-4d84b2g6 53 7 provide provide VB cord-305054-4d84b2g6 53 8 guidance guidance NN cord-305054-4d84b2g6 53 9 for for IN cord-305054-4d84b2g6 53 10 future future JJ cord-305054-4d84b2g6 53 11 studies study NNS cord-305054-4d84b2g6 53 12 on on IN cord-305054-4d84b2g6 53 13 how how WRB cord-305054-4d84b2g6 53 14 to to TO cord-305054-4d84b2g6 53 15 accurately accurately RB cord-305054-4d84b2g6 53 16 construct construct VB cord-305054-4d84b2g6 53 17 CoV cov NN cord-305054-4d84b2g6 53 18 genomes genome NNS cord-305054-4d84b2g6 53 19 from from IN cord-305054-4d84b2g6 53 20 pangolin pangolin NN cord-305054-4d84b2g6 53 21 or or CC cord-305054-4d84b2g6 53 22 other other JJ cord-305054-4d84b2g6 53 23 potential potential JJ cord-305054-4d84b2g6 53 24 intermediate intermediate JJ cord-305054-4d84b2g6 53 25 hosts host NNS cord-305054-4d84b2g6 53 26 . . . cord-305054-4d84b2g6 54 1 Two two CD cord-305054-4d84b2g6 54 2 RNA RNA NNP cord-305054-4d84b2g6 54 3 - - HYPH cord-305054-4d84b2g6 54 4 seq seq NN cord-305054-4d84b2g6 54 5 samples sample NNS cord-305054-4d84b2g6 54 6 , , , cord-305054-4d84b2g6 54 7 lung07 lung07 NNP cord-305054-4d84b2g6 54 8 and and CC cord-305054-4d84b2g6 54 9 lung08 lung08 NNP cord-305054-4d84b2g6 54 10 , , , cord-305054-4d84b2g6 54 11 were be VBD cord-305054-4d84b2g6 54 12 downloaded download VBN cord-305054-4d84b2g6 54 13 from from IN cord-305054-4d84b2g6 54 14 NCBI NCBI NNP cord-305054-4d84b2g6 54 15 SRA SRA NNP cord-305054-4d84b2g6 54 16 under under IN cord-305054-4d84b2g6 54 17 BioProject BioProject NNP cord-305054-4d84b2g6 54 18 SRA SRA NNP cord-305054-4d84b2g6 54 19 : : : cord-305054-4d84b2g6 55 1 PRJNA573298 PRJNA573298 NNP cord-305054-4d84b2g6 55 2 . . . cord-305054-4d84b2g6 56 1 The the DT cord-305054-4d84b2g6 56 2 two two CD cord-305054-4d84b2g6 56 3 samples sample NNS cord-305054-4d84b2g6 56 4 were be VBD cord-305054-4d84b2g6 56 5 originally originally RB cord-305054-4d84b2g6 56 6 published publish VBN cord-305054-4d84b2g6 56 7 in in IN cord-305054-4d84b2g6 56 8 [ [ -LRB- cord-305054-4d84b2g6 56 9 8 8 CD cord-305054-4d84b2g6 56 10 ] ] -RRB- cord-305054-4d84b2g6 56 11 for for IN cord-305054-4d84b2g6 56 12 viral viral JJ cord-305054-4d84b2g6 56 13 metagenomics metagenomic NNS cord-305054-4d84b2g6 56 14 analysis analysis NN cord-305054-4d84b2g6 56 15 . . . cord-305054-4d84b2g6 57 1 Adaptor adaptor NN cord-305054-4d84b2g6 57 2 trimming trimming NN cord-305054-4d84b2g6 57 3 and and CC cord-305054-4d84b2g6 57 4 quality quality NN cord-305054-4d84b2g6 57 5 control control NN cord-305054-4d84b2g6 57 6 were be VBD cord-305054-4d84b2g6 57 7 performed perform VBN cord-305054-4d84b2g6 57 8 on on IN cord-305054-4d84b2g6 57 9 the the DT cord-305054-4d84b2g6 57 10 raw raw JJ cord-305054-4d84b2g6 57 11 sequence sequence NN cord-305054-4d84b2g6 57 12 reads read VBZ cord-305054-4d84b2g6 57 13 using use VBG cord-305054-4d84b2g6 57 14 with with IN cord-305054-4d84b2g6 57 15 the the DT cord-305054-4d84b2g6 57 16 Trimmonmatic Trimmonmatic NNP cord-305054-4d84b2g6 57 17 program program NN cord-305054-4d84b2g6 57 18 ( ( -LRB- cord-305054-4d84b2g6 57 19 verson verson NNP cord-305054-4d84b2g6 57 20 0.39 0.39 CD cord-305054-4d84b2g6 57 21 ) ) -RRB- cord-305054-4d84b2g6 58 1 [ [ -LRB- cord-305054-4d84b2g6 58 2 1 1 CD cord-305054-4d84b2g6 58 3 ] ] -RRB- cord-305054-4d84b2g6 58 4 . . . cord-305054-4d84b2g6 59 1 To to TO cord-305054-4d84b2g6 59 2 eliminate eliminate VB cord-305054-4d84b2g6 59 3 host host NN cord-305054-4d84b2g6 59 4 contamination contamination NN cord-305054-4d84b2g6 59 5 , , , cord-305054-4d84b2g6 59 6 the the DT cord-305054-4d84b2g6 59 7 remaining remain VBG cord-305054-4d84b2g6 59 8 reads read NNS cord-305054-4d84b2g6 59 9 were be VBD cord-305054-4d84b2g6 59 10 aligned align VBN cord-305054-4d84b2g6 59 11 to to IN cord-305054-4d84b2g6 59 12 the the DT cord-305054-4d84b2g6 59 13 Manis Manis NNP cord-305054-4d84b2g6 59 14 javanica javanica NN cord-305054-4d84b2g6 59 15 genome genome NN cord-305054-4d84b2g6 59 16 ( ( -LRB- cord-305054-4d84b2g6 59 17 SRA SRA NNP cord-305054-4d84b2g6 59 18 : : : cord-305054-4d84b2g6 59 19 PRJNA256023 prjna256023 LS cord-305054-4d84b2g6 59 20 ) ) -RRB- cord-305054-4d84b2g6 59 21 using use VBG cord-305054-4d84b2g6 59 22 BWAaln BWAaln NNP cord-305054-4d84b2g6 59 23 ( ( -LRB- cord-305054-4d84b2g6 59 24 version version NN cord-305054-4d84b2g6 59 25 0.7.17 0.7.17 CD cord-305054-4d84b2g6 59 26 ) ) -RRB- cord-305054-4d84b2g6 59 27 [ [ -LRB- cord-305054-4d84b2g6 59 28 6 6 CD cord-305054-4d84b2g6 59 29 ] ] -RRB- cord-305054-4d84b2g6 59 30 and and CC cord-305054-4d84b2g6 59 31 reads read VBZ cord-305054-4d84b2g6 59 32 mapped map VBN cord-305054-4d84b2g6 59 33 to to IN cord-305054-4d84b2g6 59 34 the the DT cord-305054-4d84b2g6 59 35 host host NN cord-305054-4d84b2g6 59 36 genome genome NN cord-305054-4d84b2g6 59 37 were be VBD cord-305054-4d84b2g6 59 38 discarded discard VBN cord-305054-4d84b2g6 59 39 . . . cord-305054-4d84b2g6 60 1 Reads read VBZ cord-305054-4d84b2g6 60 2 unmapped unmapped JJ cord-305054-4d84b2g6 60 3 to to IN cord-305054-4d84b2g6 60 4 the the DT cord-305054-4d84b2g6 60 5 host host NN cord-305054-4d84b2g6 60 6 reference reference NN cord-305054-4d84b2g6 60 7 genome genome NN cord-305054-4d84b2g6 60 8 were be VBD cord-305054-4d84b2g6 60 9 used use VBN cord-305054-4d84b2g6 60 10 to to TO cord-305054-4d84b2g6 60 11 construct construct VB cord-305054-4d84b2g6 60 12 genome genome NN cord-305054-4d84b2g6 60 13 in in IN cord-305054-4d84b2g6 60 14 the the DT cord-305054-4d84b2g6 60 15 subsequent subsequent JJ cord-305054-4d84b2g6 60 16 de de FW cord-305054-4d84b2g6 60 17 novo novo NNP cord-305054-4d84b2g6 60 18 assembly assembly NNP cord-305054-4d84b2g6 60 19 . . . cord-305054-4d84b2g6 60 20 _SP cord-305054-4d84b2g6 61 1 Cleaned clean VBN cord-305054-4d84b2g6 61 2 reads read NNS cord-305054-4d84b2g6 61 3 were be VBD cord-305054-4d84b2g6 61 4 used use VBN cord-305054-4d84b2g6 61 5 to to TO cord-305054-4d84b2g6 61 6 assemble assemble VB cord-305054-4d84b2g6 61 7 genome genome NN cord-305054-4d84b2g6 61 8 using use VBG cord-305054-4d84b2g6 61 9 reference reference NN cord-305054-4d84b2g6 61 10 - - HYPH cord-305054-4d84b2g6 61 11 guided guide VBN cord-305054-4d84b2g6 61 12 de de FW cord-305054-4d84b2g6 61 13 novo novo NNP cord-305054-4d84b2g6 61 14 assembly assembly NNP cord-305054-4d84b2g6 61 15 . . . cord-305054-4d84b2g6 62 1 To to TO cord-305054-4d84b2g6 62 2 investigate investigate VB cord-305054-4d84b2g6 62 3 how how WRB cord-305054-4d84b2g6 62 4 the the DT cord-305054-4d84b2g6 62 5 results result NNS cord-305054-4d84b2g6 62 6 were be VBD cord-305054-4d84b2g6 62 7 influenced influence VBN cord-305054-4d84b2g6 62 8 by by IN cord-305054-4d84b2g6 62 9 the the DT cord-305054-4d84b2g6 62 10 choice choice NN cord-305054-4d84b2g6 62 11 of of IN cord-305054-4d84b2g6 62 12 reference reference NN cord-305054-4d84b2g6 62 13 genome genome NN cord-305054-4d84b2g6 62 14 , , , cord-305054-4d84b2g6 62 15 we -PRON- PRP cord-305054-4d84b2g6 62 16 explored explore VBD cord-305054-4d84b2g6 62 17 a a DT cord-305054-4d84b2g6 62 18 few few JJ cord-305054-4d84b2g6 62 19 representative representative JJ cord-305054-4d84b2g6 62 20 virus virus NN cord-305054-4d84b2g6 62 21 genomes genome NNS cord-305054-4d84b2g6 62 22 on on IN cord-305054-4d84b2g6 62 23 the the DT cord-305054-4d84b2g6 62 24 phylogeny phylogeny NN cord-305054-4d84b2g6 62 25 tree tree NN cord-305054-4d84b2g6 62 26 ( ( -LRB- cord-305054-4d84b2g6 62 27 Table table NN cord-305054-4d84b2g6 62 28 1 1 CD cord-305054-4d84b2g6 62 29 ) ) -RRB- cord-305054-4d84b2g6 62 30 as as IN cord-305054-4d84b2g6 62 31 the the DT cord-305054-4d84b2g6 62 32 reference reference NN cord-305054-4d84b2g6 62 33 genome genome NN cord-305054-4d84b2g6 62 34 . . . cord-305054-4d84b2g6 63 1 These these DT cord-305054-4d84b2g6 63 2 genomes genome NNS cord-305054-4d84b2g6 63 3 were be VBD cord-305054-4d84b2g6 63 4 selected select VBN cord-305054-4d84b2g6 63 5 based base VBN cord-305054-4d84b2g6 63 6 on on IN cord-305054-4d84b2g6 63 7 previous previous JJ cord-305054-4d84b2g6 63 8 studies study NNS cord-305054-4d84b2g6 63 9 [ [ -LRB- cord-305054-4d84b2g6 63 10 3 3 CD cord-305054-4d84b2g6 63 11 , , , cord-305054-4d84b2g6 63 12 9 9 CD cord-305054-4d84b2g6 63 13 , , , cord-305054-4d84b2g6 63 14 11 11 CD cord-305054-4d84b2g6 63 15 , , , cord-305054-4d84b2g6 63 16 13 13 CD cord-305054-4d84b2g6 63 17 ] ] -RRB- cord-305054-4d84b2g6 63 18 . . . cord-305054-4d84b2g6 64 1 Once once RB cord-305054-4d84b2g6 64 2 a a DT cord-305054-4d84b2g6 64 3 reference reference NN cord-305054-4d84b2g6 64 4 genome genome NN cord-305054-4d84b2g6 64 5 was be VBD cord-305054-4d84b2g6 64 6 picked pick VBN cord-305054-4d84b2g6 64 7 , , , cord-305054-4d84b2g6 64 8 the the DT cord-305054-4d84b2g6 64 9 cleaned clean VBN cord-305054-4d84b2g6 64 10 reads read NNS cord-305054-4d84b2g6 64 11 were be VBD cord-305054-4d84b2g6 64 12 aligned align VBN cord-305054-4d84b2g6 64 13 to to IN cord-305054-4d84b2g6 64 14 the the DT cord-305054-4d84b2g6 64 15 reference reference NN cord-305054-4d84b2g6 64 16 genome genome NN cord-305054-4d84b2g6 64 17 using use VBG cord-305054-4d84b2g6 64 18 BWA BWA NNP cord-305054-4d84b2g6 64 19 - - HYPH cord-305054-4d84b2g6 64 20 MEM MEM NNP cord-305054-4d84b2g6 64 21 [ [ -LRB- cord-305054-4d84b2g6 64 22 5 5 CD cord-305054-4d84b2g6 64 23 ] ] -RRB- cord-305054-4d84b2g6 64 24 , , , cord-305054-4d84b2g6 64 25 and and CC cord-305054-4d84b2g6 64 26 the the DT cord-305054-4d84b2g6 64 27 mapped map VBN cord-305054-4d84b2g6 64 28 reads read NNS cord-305054-4d84b2g6 64 29 were be VBD cord-305054-4d84b2g6 64 30 assembled assemble VBN cord-305054-4d84b2g6 64 31 de de IN cord-305054-4d84b2g6 64 32 novo novo NNP cord-305054-4d84b2g6 64 33 using use VBG cord-305054-4d84b2g6 64 34 MEGHIT MEGHIT NNP cord-305054-4d84b2g6 64 35 ( ( -LRB- cord-305054-4d84b2g6 64 36 version version NN cord-305054-4d84b2g6 64 37 1.1.3 1.1.3 CD cord-305054-4d84b2g6 64 38 ) ) -RRB- cord-305054-4d84b2g6 64 39 with with IN cord-305054-4d84b2g6 64 40 meta meta JJ cord-305054-4d84b2g6 64 41 - - HYPH cord-305054-4d84b2g6 64 42 sensitive sensitive JJ cord-305054-4d84b2g6 64 43 mode mode NN cord-305054-4d84b2g6 64 44 [ [ -LRB- cord-305054-4d84b2g6 64 45 4 4 CD cord-305054-4d84b2g6 64 46 ] ] -RRB- cord-305054-4d84b2g6 64 47 . . . cord-305054-4d84b2g6 65 1 The the DT cord-305054-4d84b2g6 65 2 resulting result VBG cord-305054-4d84b2g6 65 3 contigs contig NNS cord-305054-4d84b2g6 65 4 were be VBD cord-305054-4d84b2g6 65 5 concatenated concatenate VBN cord-305054-4d84b2g6 65 6 into into IN cord-305054-4d84b2g6 65 7 an an DT cord-305054-4d84b2g6 65 8 assembly assembly NN cord-305054-4d84b2g6 65 9 by by IN cord-305054-4d84b2g6 65 10 aligning align VBG cord-305054-4d84b2g6 65 11 them -PRON- PRP cord-305054-4d84b2g6 65 12 to to IN cord-305054-4d84b2g6 65 13 the the DT cord-305054-4d84b2g6 65 14 reference reference NN cord-305054-4d84b2g6 65 15 genome genome NN cord-305054-4d84b2g6 65 16 . . . cord-305054-4d84b2g6 66 1 Phylogenetic phylogenetic JJ cord-305054-4d84b2g6 66 2 distance distance NN cord-305054-4d84b2g6 66 3 analysis analysis NN cord-305054-4d84b2g6 66 4 was be VBD cord-305054-4d84b2g6 66 5 performed perform VBN cord-305054-4d84b2g6 66 6 using use VBG cord-305054-4d84b2g6 66 7 MEGA MEGA NNP cord-305054-4d84b2g6 66 8 X X NNP cord-305054-4d84b2g6 66 9 ( ( -LRB- cord-305054-4d84b2g6 66 10 version version NN cord-305054-4d84b2g6 66 11 10.1.8 10.1.8 CD cord-305054-4d84b2g6 66 12 ) ) -RRB- cord-305054-4d84b2g6 66 13 [ [ -LRB- cord-305054-4d84b2g6 66 14 2 2 CD cord-305054-4d84b2g6 66 15 ] ] -RRB- cord-305054-4d84b2g6 66 16 .The .The . cord-305054-4d84b2g6 67 1 whole whole JJ cord-305054-4d84b2g6 67 2 genome genome NN cord-305054-4d84b2g6 67 3 was be VBD cord-305054-4d84b2g6 67 4 used use VBN cord-305054-4d84b2g6 67 5 in in IN cord-305054-4d84b2g6 67 6 phylogenetic phylogenetic NN cord-305054-4d84b2g6 67 7 and and CC cord-305054-4d84b2g6 67 8 distance distance NN cord-305054-4d84b2g6 67 9 analysis analysis NN cord-305054-4d84b2g6 67 10 , , , cord-305054-4d84b2g6 67 11 and and CC cord-305054-4d84b2g6 67 12 phylogenetic phylogenetic JJ cord-305054-4d84b2g6 67 13 trees tree NNS cord-305054-4d84b2g6 67 14 were be VBD cord-305054-4d84b2g6 67 15 constructed construct VBN cord-305054-4d84b2g6 67 16 in in IN cord-305054-4d84b2g6 67 17 the the DT cord-305054-4d84b2g6 67 18 best good JJS cord-305054-4d84b2g6 67 19 - - HYPH cord-305054-4d84b2g6 67 20 fit fit JJ cord-305054-4d84b2g6 67 21 DNA dna NN cord-305054-4d84b2g6 67 22 / / SYM cord-305054-4d84b2g6 67 23 amino amino NN cord-305054-4d84b2g6 67 24 acid acid NN cord-305054-4d84b2g6 67 25 substitution substitution NN cord-305054-4d84b2g6 67 26 mode mode NN cord-305054-4d84b2g6 67 27 with with IN cord-305054-4d84b2g6 67 28 1000 1000 CD cord-305054-4d84b2g6 67 29 bootstrap bootstrap NN cord-305054-4d84b2g6 67 30 replications replication NNS cord-305054-4d84b2g6 67 31 . . . cord-305054-4d84b2g6 68 1 The the DT cord-305054-4d84b2g6 68 2 whole whole JJ cord-305054-4d84b2g6 68 3 genome genome NN cord-305054-4d84b2g6 68 4 nucleotide nucleotide JJ cord-305054-4d84b2g6 68 5 identity identity NN cord-305054-4d84b2g6 68 6 analysis analysis NN cord-305054-4d84b2g6 68 7 was be VBD cord-305054-4d84b2g6 68 8 performed perform VBN cord-305054-4d84b2g6 68 9 in in IN cord-305054-4d84b2g6 68 10 SimPlot SimPlot NNP cord-305054-4d84b2g6 68 11 3.5.1 3.5.1 CD cord-305054-4d84b2g6 68 12 [ [ -LRB- cord-305054-4d84b2g6 68 13 10 10 CD cord-305054-4d84b2g6 68 14 ] ] -RRB- cord-305054-4d84b2g6 68 15 . . . cord-305054-4d84b2g6 69 1 A a DT cord-305054-4d84b2g6 69 2 total total NN cord-305054-4d84b2g6 69 3 of of IN cord-305054-4d84b2g6 69 4 eight eight CD cord-305054-4d84b2g6 69 5 viral viral JJ cord-305054-4d84b2g6 69 6 genomes genome NNS cord-305054-4d84b2g6 69 7 were be VBD cord-305054-4d84b2g6 69 8 tested test VBN cord-305054-4d84b2g6 69 9 as as IN cord-305054-4d84b2g6 69 10 the the DT cord-305054-4d84b2g6 69 11 reference reference NN cord-305054-4d84b2g6 69 12 genome genome NN cord-305054-4d84b2g6 69 13 in in IN cord-305054-4d84b2g6 69 14 reference reference NN cord-305054-4d84b2g6 69 15 - - HYPH cord-305054-4d84b2g6 69 16 guided guide VBN cord-305054-4d84b2g6 69 17 de de IN cord-305054-4d84b2g6 69 18 novo novo NNP cord-305054-4d84b2g6 69 19 assembling assembling NN cord-305054-4d84b2g6 69 20 . . . cord-305054-4d84b2g6 70 1 Numbers number NNS cord-305054-4d84b2g6 70 2 of of IN cord-305054-4d84b2g6 70 3 mapped map VBN cord-305054-4d84b2g6 70 4 reads read VBZ cord-305054-4d84b2g6 70 5 ranged range VBN cord-305054-4d84b2g6 70 6 from from IN cord-305054-4d84b2g6 70 7 2 2 CD cord-305054-4d84b2g6 70 8 to to IN cord-305054-4d84b2g6 70 9 3,060 3,060 CD cord-305054-4d84b2g6 70 10 , , , cord-305054-4d84b2g6 70 11 and and CC cord-305054-4d84b2g6 70 12 length length NN cord-305054-4d84b2g6 70 13 of of IN cord-305054-4d84b2g6 70 14 the the DT cord-305054-4d84b2g6 70 15 resulting result VBG cord-305054-4d84b2g6 70 16 draft draft NN cord-305054-4d84b2g6 70 17 assemblies assembly NNS cord-305054-4d84b2g6 70 18 varied vary VBN cord-305054-4d84b2g6 70 19 from from IN cord-305054-4d84b2g6 70 20 5,969 5,969 CD cord-305054-4d84b2g6 70 21 to to IN cord-305054-4d84b2g6 70 22 22,419 22,419 CD cord-305054-4d84b2g6 70 23 bp bp NNP cord-305054-4d84b2g6 70 24 ( ( -LRB- cord-305054-4d84b2g6 70 25 Table Table NNP cord-305054-4d84b2g6 70 26 2 2 CD cord-305054-4d84b2g6 70 27 ) ) -RRB- cord-305054-4d84b2g6 70 28 . . . cord-305054-4d84b2g6 71 1 Two two CD cord-305054-4d84b2g6 71 2 reference reference NN cord-305054-4d84b2g6 71 3 genomes genome NNS cord-305054-4d84b2g6 71 4 , , , cord-305054-4d84b2g6 71 5 MersCoV MersCoV , cord-305054-4d84b2g6 71 6 and and CC cord-305054-4d84b2g6 71 7 Bat Bat NNP cord-305054-4d84b2g6 71 8 Hp Hp NNP cord-305054-4d84b2g6 71 9 - - HYPH cord-305054-4d84b2g6 71 10 BetaCoV BetaCoV NNS cord-305054-4d84b2g6 71 11 Zhejiang2013 Zhejiang2013 NNP cord-305054-4d84b2g6 71 12 , , , cord-305054-4d84b2g6 71 13 failed fail VBD cord-305054-4d84b2g6 71 14 in in IN cord-305054-4d84b2g6 71 15 reads read NNS cord-305054-4d84b2g6 71 16 assembling assemble VBG cord-305054-4d84b2g6 71 17 due due IN cord-305054-4d84b2g6 71 18 to to IN cord-305054-4d84b2g6 71 19 the the DT cord-305054-4d84b2g6 71 20 limited limited JJ cord-305054-4d84b2g6 71 21 number number NN cord-305054-4d84b2g6 71 22 of of IN cord-305054-4d84b2g6 71 23 remaining remain VBG cord-305054-4d84b2g6 71 24 reads read NNS cord-305054-4d84b2g6 71 25 . . . cord-305054-4d84b2g6 72 1 Less Less JJR cord-305054-4d84b2g6 72 2 than than IN cord-305054-4d84b2g6 72 3 1,000 1,000 CD cord-305054-4d84b2g6 72 4 reads read NNS cord-305054-4d84b2g6 72 5 were be VBD cord-305054-4d84b2g6 72 6 mapped map VBN cord-305054-4d84b2g6 72 7 to to IN cord-305054-4d84b2g6 72 8 three three CD cord-305054-4d84b2g6 72 9 reference reference NN cord-305054-4d84b2g6 72 10 genomes genome NNS cord-305054-4d84b2g6 72 11 , , , cord-305054-4d84b2g6 72 12 BJ01 BJ01 NNP cord-305054-4d84b2g6 72 13 , , , cord-305054-4d84b2g6 72 14 BM48_31 BM48_31 NNP cord-305054-4d84b2g6 72 15 , , , cord-305054-4d84b2g6 72 16 and and CC cord-305054-4d84b2g6 72 17 Longquan140 Longquan140 NNP cord-305054-4d84b2g6 72 18 , , , cord-305054-4d84b2g6 72 19 resulting result VBG cord-305054-4d84b2g6 72 20 in in IN cord-305054-4d84b2g6 72 21 shorter short JJR cord-305054-4d84b2g6 72 22 assemblies assembly NNS cord-305054-4d84b2g6 72 23 . . . cord-305054-4d84b2g6 73 1 A a DT cord-305054-4d84b2g6 73 2 total total NN cord-305054-4d84b2g6 73 3 of of IN cord-305054-4d84b2g6 73 4 1,061 1,061 CD cord-305054-4d84b2g6 73 5 reads read NNS cord-305054-4d84b2g6 73 6 were be VBD cord-305054-4d84b2g6 73 7 mapped map VBN cord-305054-4d84b2g6 73 8 to to IN cord-305054-4d84b2g6 73 9 ZC45 ZC45 NNP cord-305054-4d84b2g6 73 10 and and CC cord-305054-4d84b2g6 73 11 subsequently subsequently RB cord-305054-4d84b2g6 73 12 assembled assemble VBD cord-305054-4d84b2g6 73 13 into into IN cord-305054-4d84b2g6 73 14 a a DT cord-305054-4d84b2g6 73 15 21,819-bp 21,819-bp CD cord-305054-4d84b2g6 73 16 assembly assembly NN cord-305054-4d84b2g6 73 17 with with IN cord-305054-4d84b2g6 73 18 67.8 67.8 CD cord-305054-4d84b2g6 73 19 % % NN cord-305054-4d84b2g6 73 20 of of IN cord-305054-4d84b2g6 73 21 coverage coverage NN cord-305054-4d84b2g6 73 22 . . . cord-305054-4d84b2g6 74 1 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 74 2 , , , cord-305054-4d84b2g6 74 3 which which WDT cord-305054-4d84b2g6 74 4 is be VBZ cord-305054-4d84b2g6 74 5 a a DT cord-305054-4d84b2g6 74 6 bat bat NN cord-305054-4d84b2g6 74 7 CoV cov NN cord-305054-4d84b2g6 74 8 , , , cord-305054-4d84b2g6 74 9 had have VBD cord-305054-4d84b2g6 74 10 1,287 1,287 CD cord-305054-4d84b2g6 74 11 reads read NNS cord-305054-4d84b2g6 74 12 mapped map VBN cord-305054-4d84b2g6 74 13 to to IN cord-305054-4d84b2g6 74 14 it -PRON- PRP cord-305054-4d84b2g6 74 15 , , , cord-305054-4d84b2g6 74 16 and and CC cord-305054-4d84b2g6 74 17 the the DT cord-305054-4d84b2g6 74 18 resulting result VBG cord-305054-4d84b2g6 74 19 assembly assembly NN cord-305054-4d84b2g6 74 20 has have VBZ cord-305054-4d84b2g6 74 21 total total JJ cord-305054-4d84b2g6 74 22 length length NN cord-305054-4d84b2g6 74 23 of of IN cord-305054-4d84b2g6 74 24 21,925 21,925 CD cord-305054-4d84b2g6 74 25 and and CC cord-305054-4d84b2g6 74 26 N50 N50 NNP cord-305054-4d84b2g6 74 27 of of IN cord-305054-4d84b2g6 74 28 1,428 1,428 CD cord-305054-4d84b2g6 74 29 . . . cord-305054-4d84b2g6 75 1 A a DT cord-305054-4d84b2g6 75 2 total total NN cord-305054-4d84b2g6 75 3 of of IN cord-305054-4d84b2g6 75 4 3,060 3,060 CD cord-305054-4d84b2g6 75 5 reads read NNS cord-305054-4d84b2g6 75 6 were be VBD cord-305054-4d84b2g6 75 7 mapped map VBN cord-305054-4d84b2g6 75 8 to to IN cord-305054-4d84b2g6 75 9 Wuhan Wuhan NNP cord-305054-4d84b2g6 75 10 - - HYPH cord-305054-4d84b2g6 75 11 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 75 12 , , , cord-305054-4d84b2g6 75 13 a a DT cord-305054-4d84b2g6 75 14 human human JJ cord-305054-4d84b2g6 75 15 SARS SARS NNP cord-305054-4d84b2g6 75 16 - - HYPH cord-305054-4d84b2g6 75 17 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 75 18 strain strain NN cord-305054-4d84b2g6 75 19 , , , cord-305054-4d84b2g6 75 20 which which WDT cord-305054-4d84b2g6 75 21 was be VBD cord-305054-4d84b2g6 75 22 the the DT cord-305054-4d84b2g6 75 23 highest high JJS cord-305054-4d84b2g6 75 24 number number NN cord-305054-4d84b2g6 75 25 among among IN cord-305054-4d84b2g6 75 26 all all DT cord-305054-4d84b2g6 75 27 genomes genome NNS cord-305054-4d84b2g6 75 28 we -PRON- PRP cord-305054-4d84b2g6 75 29 surveyed survey VBD cord-305054-4d84b2g6 75 30 . . . cord-305054-4d84b2g6 76 1 The the DT cord-305054-4d84b2g6 76 2 assembly assembly NN cord-305054-4d84b2g6 76 3 guided guide VBN cord-305054-4d84b2g6 76 4 by by IN cord-305054-4d84b2g6 76 5 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 76 6 and and CC cord-305054-4d84b2g6 76 7 Wuhan Wuhan NNP cord-305054-4d84b2g6 76 8 - - HYPH cord-305054-4d84b2g6 76 9 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 76 10 showed show VBD cord-305054-4d84b2g6 76 11 similar similar JJ cord-305054-4d84b2g6 76 12 coverage coverage NN cord-305054-4d84b2g6 76 13 at at IN cord-305054-4d84b2g6 76 14 about about RB cord-305054-4d84b2g6 76 15 eighty eighty CD cord-305054-4d84b2g6 76 16 percent percent NN cord-305054-4d84b2g6 76 17 . . . cord-305054-4d84b2g6 77 1 However however RB cord-305054-4d84b2g6 77 2 , , , cord-305054-4d84b2g6 77 3 the the DT cord-305054-4d84b2g6 77 4 resulting result VBG cord-305054-4d84b2g6 77 5 assembly assembly NN cord-305054-4d84b2g6 77 6 had have VBD cord-305054-4d84b2g6 77 7 shorter short JJR cord-305054-4d84b2g6 77 8 total total JJ cord-305054-4d84b2g6 77 9 length length NN cord-305054-4d84b2g6 77 10 ( ( -LRB- cord-305054-4d84b2g6 77 11 21,819 21,819 CD cord-305054-4d84b2g6 77 12 ) ) -RRB- cord-305054-4d84b2g6 77 13 and and CC cord-305054-4d84b2g6 77 14 N50 N50 NNP cord-305054-4d84b2g6 77 15 ( ( -LRB- cord-305054-4d84b2g6 77 16 1 1 CD cord-305054-4d84b2g6 77 17 , , , cord-305054-4d84b2g6 77 18 195 195 CD cord-305054-4d84b2g6 77 19 ) ) -RRB- cord-305054-4d84b2g6 77 20 than than IN cord-305054-4d84b2g6 77 21 those those DT cord-305054-4d84b2g6 77 22 of of IN cord-305054-4d84b2g6 77 23 the the DT cord-305054-4d84b2g6 77 24 assembly assembly NN cord-305054-4d84b2g6 77 25 that that WDT cord-305054-4d84b2g6 77 26 used use VBD cord-305054-4d84b2g6 77 27 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 77 28 as as IN cord-305054-4d84b2g6 77 29 reference reference NN cord-305054-4d84b2g6 77 30 . . . cord-305054-4d84b2g6 78 1 To to TO cord-305054-4d84b2g6 78 2 understand understand VB cord-305054-4d84b2g6 78 3 this this DT cord-305054-4d84b2g6 78 4 seemly seemly JJ cord-305054-4d84b2g6 78 5 contradiction contradiction NN cord-305054-4d84b2g6 78 6 , , , cord-305054-4d84b2g6 78 7 we -PRON- PRP cord-305054-4d84b2g6 78 8 investigated investigate VBD cord-305054-4d84b2g6 78 9 the the DT cord-305054-4d84b2g6 78 10 reads read NNS cord-305054-4d84b2g6 78 11 coverage coverage NN cord-305054-4d84b2g6 78 12 and and CC cord-305054-4d84b2g6 78 13 depth depth NN cord-305054-4d84b2g6 78 14 on on IN cord-305054-4d84b2g6 78 15 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 78 16 and and CC cord-305054-4d84b2g6 78 17 Wuhan Wuhan NNP cord-305054-4d84b2g6 78 18 - - HYPH cord-305054-4d84b2g6 78 19 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 78 20 . . . cord-305054-4d84b2g6 79 1 The the DT cord-305054-4d84b2g6 79 2 results result NNS cord-305054-4d84b2g6 79 3 ( ( -LRB- cord-305054-4d84b2g6 79 4 Figure figure NN cord-305054-4d84b2g6 79 5 1 1 CD cord-305054-4d84b2g6 79 6 ) ) -RRB- cord-305054-4d84b2g6 79 7 show show VBP cord-305054-4d84b2g6 79 8 that that IN cord-305054-4d84b2g6 79 9 there there EX cord-305054-4d84b2g6 79 10 were be VBD cord-305054-4d84b2g6 79 11 an an DT cord-305054-4d84b2g6 79 12 excessive excessive JJ cord-305054-4d84b2g6 79 13 number number NN cord-305054-4d84b2g6 79 14 of of IN cord-305054-4d84b2g6 79 15 reads read NNS cord-305054-4d84b2g6 79 16 mapped map VBN cord-305054-4d84b2g6 79 17 to to IN cord-305054-4d84b2g6 79 18 distal distal JJ cord-305054-4d84b2g6 79 19 regions region NNS cord-305054-4d84b2g6 79 20 of of IN cord-305054-4d84b2g6 79 21 Wuhan Wuhan NNP cord-305054-4d84b2g6 79 22 - - HYPH cord-305054-4d84b2g6 79 23 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 79 24 , , , cord-305054-4d84b2g6 79 25 which which WDT cord-305054-4d84b2g6 79 26 could could MD cord-305054-4d84b2g6 79 27 indicate indicate VB cord-305054-4d84b2g6 79 28 artifacts artifact NNS cord-305054-4d84b2g6 79 29 or or CC cord-305054-4d84b2g6 79 30 contaminations contamination NNS cord-305054-4d84b2g6 79 31 during during IN cord-305054-4d84b2g6 79 32 the the DT cord-305054-4d84b2g6 79 33 sequencing sequencing NN cord-305054-4d84b2g6 79 34 . . . cord-305054-4d84b2g6 80 1 After after IN cord-305054-4d84b2g6 80 2 removing remove VBG cord-305054-4d84b2g6 80 3 these these DT cord-305054-4d84b2g6 80 4 tail tail NN cord-305054-4d84b2g6 80 5 regions region NNS cord-305054-4d84b2g6 80 6 ( ( -LRB- cord-305054-4d84b2g6 80 7 with with IN cord-305054-4d84b2g6 80 8 > > XX cord-305054-4d84b2g6 80 9 200X 200x CD cord-305054-4d84b2g6 80 10 depth depth NN cord-305054-4d84b2g6 80 11 ) ) -RRB- cord-305054-4d84b2g6 80 12 , , , cord-305054-4d84b2g6 80 13 62 62 CD cord-305054-4d84b2g6 80 14 more more JJR cord-305054-4d84b2g6 80 15 unique unique JJ cord-305054-4d84b2g6 80 16 reads read NNS cord-305054-4d84b2g6 80 17 were be VBD cord-305054-4d84b2g6 80 18 mapped map VBN cord-305054-4d84b2g6 80 19 to to IN cord-305054-4d84b2g6 80 20 Wuhan Wuhan NNP cord-305054-4d84b2g6 80 21 - - HYPH cord-305054-4d84b2g6 80 22 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 80 23 . . . cord-305054-4d84b2g6 81 1 Figure figure NN cord-305054-4d84b2g6 81 2 2 2 CD cord-305054-4d84b2g6 81 3 shows show VBZ cord-305054-4d84b2g6 81 4 the the DT cord-305054-4d84b2g6 81 5 overlap overlap NN cord-305054-4d84b2g6 81 6 of of IN cord-305054-4d84b2g6 81 7 between between IN cord-305054-4d84b2g6 81 8 the the DT cord-305054-4d84b2g6 81 9 unique unique JJ cord-305054-4d84b2g6 81 10 reads read NNS cord-305054-4d84b2g6 81 11 mapped map VBN cord-305054-4d84b2g6 81 12 to to IN cord-305054-4d84b2g6 81 13 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 81 14 and and CC cord-305054-4d84b2g6 81 15 Wuhan Wuhan NNP cord-305054-4d84b2g6 81 16 - - HYPH cord-305054-4d84b2g6 81 17 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 81 18 . . . cord-305054-4d84b2g6 82 1 unique unique JJ cord-305054-4d84b2g6 82 2 reads read NNS cord-305054-4d84b2g6 82 3 were be VBD cord-305054-4d84b2g6 82 4 used use VBN cord-305054-4d84b2g6 82 5 to to TO cord-305054-4d84b2g6 82 6 construct construct VB cord-305054-4d84b2g6 82 7 Venn Venn NNP cord-305054-4d84b2g6 82 8 diagrams diagram NNS cord-305054-4d84b2g6 82 9 . . . cord-305054-4d84b2g6 83 1 All all DT cord-305054-4d84b2g6 83 2 reads read VBZ cord-305054-4d84b2g6 83 3 mapped map VBN cord-305054-4d84b2g6 83 4 to to IN cord-305054-4d84b2g6 83 5 Italy Italy NNP cord-305054-4d84b2g6 83 6 strain strain NN cord-305054-4d84b2g6 83 7 were be VBD cord-305054-4d84b2g6 83 8 also also RB cord-305054-4d84b2g6 83 9 mapped map VBN cord-305054-4d84b2g6 83 10 to to IN cord-305054-4d84b2g6 83 11 Wuahn Wuahn NNP cord-305054-4d84b2g6 83 12 - - HYPH cord-305054-4d84b2g6 83 13 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 83 14 genome genome NN cord-305054-4d84b2g6 84 1 ( ( -LRB- cord-305054-4d84b2g6 84 2 Figure figure NN cord-305054-4d84b2g6 84 3 2 2 CD cord-305054-4d84b2g6 84 4 ) ) -RRB- cord-305054-4d84b2g6 84 5 . . . cord-305054-4d84b2g6 85 1 The the DT cord-305054-4d84b2g6 85 2 additional additional JJ cord-305054-4d84b2g6 85 3 1516 1516 CD cord-305054-4d84b2g6 85 4 unique unique JJ cord-305054-4d84b2g6 85 5 reads read NNS cord-305054-4d84b2g6 85 6 were be VBD cord-305054-4d84b2g6 85 7 mapped map VBN cord-305054-4d84b2g6 85 8 to to IN cord-305054-4d84b2g6 85 9 Wuhan Wuhan NNP cord-305054-4d84b2g6 85 10 - - HYPH cord-305054-4d84b2g6 85 11 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 85 12 . . . cord-305054-4d84b2g6 86 1 Most Most JJS cord-305054-4d84b2g6 86 2 reads read NNS cord-305054-4d84b2g6 86 3 were be VBD cord-305054-4d84b2g6 86 4 mapped map VBN cord-305054-4d84b2g6 86 5 to to IN cord-305054-4d84b2g6 86 6 both both DT cord-305054-4d84b2g6 86 7 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 86 8 and and CC cord-305054-4d84b2g6 86 9 Italy Italy NNP cord-305054-4d84b2g6 86 10 strain strain VBP cord-305054-4d84b2g6 86 11 , , , cord-305054-4d84b2g6 86 12 but but CC cord-305054-4d84b2g6 86 13 a a DT cord-305054-4d84b2g6 86 14 total total NN cord-305054-4d84b2g6 86 15 of of IN cord-305054-4d84b2g6 86 16 45 45 CD cord-305054-4d84b2g6 86 17 reads read NNS cord-305054-4d84b2g6 86 18 were be VBD cord-305054-4d84b2g6 86 19 aligned align VBN cord-305054-4d84b2g6 86 20 to to IN cord-305054-4d84b2g6 86 21 either either DT cord-305054-4d84b2g6 86 22 one one NN cord-305054-4d84b2g6 86 23 . . . cord-305054-4d84b2g6 87 1 Bat bat NN cord-305054-4d84b2g6 87 2 - - HYPH cord-305054-4d84b2g6 87 3 CoV CoV NNP cord-305054-4d84b2g6 87 4 , , , cord-305054-4d84b2g6 87 5 ZC45 ZC45 NNP cord-305054-4d84b2g6 87 6 and and CC cord-305054-4d84b2g6 87 7 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 87 8 , , , cord-305054-4d84b2g6 87 9 shared share VBD cord-305054-4d84b2g6 87 10 999 999 CD cord-305054-4d84b2g6 87 11 reads read NNS cord-305054-4d84b2g6 87 12 in in IN cord-305054-4d84b2g6 87 13 alignment alignment NN cord-305054-4d84b2g6 87 14 , , , cord-305054-4d84b2g6 87 15 and and CC cord-305054-4d84b2g6 87 16 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 87 17 genome genome NN cord-305054-4d84b2g6 87 18 guided guide VBN cord-305054-4d84b2g6 87 19 over over IN cord-305054-4d84b2g6 87 20 two two CD cord-305054-4d84b2g6 87 21 hundred hundred CD cord-305054-4d84b2g6 87 22 more more JJR cord-305054-4d84b2g6 87 23 reads read NNS cord-305054-4d84b2g6 87 24 into into IN cord-305054-4d84b2g6 87 25 assembly assembly NN cord-305054-4d84b2g6 87 26 . . . cord-305054-4d84b2g6 88 1 The the DT cord-305054-4d84b2g6 88 2 two two CD cord-305054-4d84b2g6 88 3 assemblies assembly NNS cord-305054-4d84b2g6 88 4 that that WDT cord-305054-4d84b2g6 88 5 used use VBD cord-305054-4d84b2g6 88 6 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 88 7 and and CC cord-305054-4d84b2g6 88 8 Wuhan Wuhan NNP cord-305054-4d84b2g6 88 9 - - HYPH cord-305054-4d84b2g6 88 10 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 88 11 as as IN cord-305054-4d84b2g6 88 12 reference reference NN cord-305054-4d84b2g6 88 13 genomes genome NNS cord-305054-4d84b2g6 88 14 respectively respectively RB cord-305054-4d84b2g6 88 15 , , , cord-305054-4d84b2g6 88 16 were be VBD cord-305054-4d84b2g6 88 17 aligned align VBN cord-305054-4d84b2g6 88 18 with with IN cord-305054-4d84b2g6 88 19 the the DT cord-305054-4d84b2g6 88 20 eight eight CD cord-305054-4d84b2g6 88 21 reference reference NN cord-305054-4d84b2g6 88 22 viral viral JJ cord-305054-4d84b2g6 88 23 genomes genome NNS cord-305054-4d84b2g6 88 24 in in IN cord-305054-4d84b2g6 88 25 Table Table NNP cord-305054-4d84b2g6 88 26 1 1 CD cord-305054-4d84b2g6 88 27 in in IN cord-305054-4d84b2g6 88 28 a a DT cord-305054-4d84b2g6 88 29 multiple multiple JJ cord-305054-4d84b2g6 88 30 alignment alignment NN cord-305054-4d84b2g6 88 31 . . . cord-305054-4d84b2g6 89 1 The the DT cord-305054-4d84b2g6 89 2 multiple multiple JJ cord-305054-4d84b2g6 89 3 alignment alignment NN cord-305054-4d84b2g6 89 4 result result NN cord-305054-4d84b2g6 89 5 was be VBD cord-305054-4d84b2g6 89 6 used use VBN cord-305054-4d84b2g6 89 7 for for IN cord-305054-4d84b2g6 89 8 similarity similarity NN cord-305054-4d84b2g6 89 9 analysis analysis NN cord-305054-4d84b2g6 89 10 and and CC cord-305054-4d84b2g6 89 11 phylogenetic phylogenetic NN cord-305054-4d84b2g6 89 12 analysis analysis NN cord-305054-4d84b2g6 89 13 . . . cord-305054-4d84b2g6 90 1 In in IN cord-305054-4d84b2g6 90 2 terms term NNS cord-305054-4d84b2g6 90 3 of of IN cord-305054-4d84b2g6 90 4 the the DT cord-305054-4d84b2g6 90 5 whole whole JJ cord-305054-4d84b2g6 90 6 genome genome NN cord-305054-4d84b2g6 90 7 nucleotide nucleotide NN cord-305054-4d84b2g6 90 8 , , , cord-305054-4d84b2g6 90 9 the the DT cord-305054-4d84b2g6 90 10 RaTG13-guided RaTG13-guided NNP cord-305054-4d84b2g6 90 11 assembly assembly NN cord-305054-4d84b2g6 90 12 showed show VBD cord-305054-4d84b2g6 90 13 85.8 85.8 CD cord-305054-4d84b2g6 90 14 % % NN cord-305054-4d84b2g6 90 15 and and CC cord-305054-4d84b2g6 90 16 85.2 85.2 CD cord-305054-4d84b2g6 90 17 % % NN cord-305054-4d84b2g6 90 18 identity identity NN cord-305054-4d84b2g6 90 19 to to IN cord-305054-4d84b2g6 90 20 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 90 21 and and CC cord-305054-4d84b2g6 90 22 Wuhan Wuhan NNP cord-305054-4d84b2g6 90 23 - - HYPH cord-305054-4d84b2g6 90 24 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 90 25 , , , cord-305054-4d84b2g6 90 26 respectively respectively RB cord-305054-4d84b2g6 90 27 , , , cord-305054-4d84b2g6 90 28 and and CC cord-305054-4d84b2g6 90 29 the the DT cord-305054-4d84b2g6 90 30 Wuhan Wuhan NNP cord-305054-4d84b2g6 90 31 - - HYPH cord-305054-4d84b2g6 90 32 Hu-1-guided hu-1-guide VBN cord-305054-4d84b2g6 90 33 assembly assembly NNP cord-305054-4d84b2g6 90 34 showed show VBD cord-305054-4d84b2g6 90 35 88.6 88.6 CD cord-305054-4d84b2g6 90 36 and and CC cord-305054-4d84b2g6 90 37 88.8 88.8 CD cord-305054-4d84b2g6 90 38 % % NN cord-305054-4d84b2g6 90 39 identity identity NN cord-305054-4d84b2g6 90 40 to to IN cord-305054-4d84b2g6 90 41 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 90 42 and and CC cord-305054-4d84b2g6 90 43 Wuhan Wuhan NNP cord-305054-4d84b2g6 90 44 - - HYPH cord-305054-4d84b2g6 90 45 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 90 46 respectively respectively RB cord-305054-4d84b2g6 90 47 . . . cord-305054-4d84b2g6 91 1 The the DT cord-305054-4d84b2g6 91 2 difference difference NN cord-305054-4d84b2g6 91 3 between between IN cord-305054-4d84b2g6 91 4 Wuhan Wuhan NNP cord-305054-4d84b2g6 91 5 - - HYPH cord-305054-4d84b2g6 91 6 Hu-1-guided hu-1-guide VBN cord-305054-4d84b2g6 91 7 assembly assembly NN cord-305054-4d84b2g6 91 8 and and CC cord-305054-4d84b2g6 91 9 RaTG13-guided RaTG13-guided NNP cord-305054-4d84b2g6 91 10 assembly assembly NN cord-305054-4d84b2g6 91 11 was be VBD cord-305054-4d84b2g6 91 12 not not RB cord-305054-4d84b2g6 91 13 always always RB cord-305054-4d84b2g6 91 14 consistent consistent JJ cord-305054-4d84b2g6 91 15 with with IN cord-305054-4d84b2g6 91 16 difference difference NN cord-305054-4d84b2g6 91 17 between between IN cord-305054-4d84b2g6 91 18 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 91 19 and and CC cord-305054-4d84b2g6 91 20 Wuhan Wuhan NNP cord-305054-4d84b2g6 91 21 - - HYPH cord-305054-4d84b2g6 91 22 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 91 23 ( ( -LRB- cord-305054-4d84b2g6 91 24 Table Table NNP cord-305054-4d84b2g6 91 25 S1 S1 NNP cord-305054-4d84b2g6 91 26 ) ) -RRB- cord-305054-4d84b2g6 91 27 . . . cord-305054-4d84b2g6 92 1 The the DT cord-305054-4d84b2g6 92 2 differences difference NNS cord-305054-4d84b2g6 92 3 between between IN cord-305054-4d84b2g6 92 4 RaTG13-guided RaTG13-guided NNP cord-305054-4d84b2g6 92 5 and and CC cord-305054-4d84b2g6 92 6 Wuhan Wuhan NNP cord-305054-4d84b2g6 92 7 - - HYPH cord-305054-4d84b2g6 92 8 Hu-1-guided hu-1-guide VBN cord-305054-4d84b2g6 92 9 assemblies assembly NNS cord-305054-4d84b2g6 92 10 in in IN cord-305054-4d84b2g6 92 11 the the DT cord-305054-4d84b2g6 92 12 regions region NNS cord-305054-4d84b2g6 92 13 of of IN cord-305054-4d84b2g6 92 14 18,431 18,431 CD cord-305054-4d84b2g6 92 15 - - SYM cord-305054-4d84b2g6 92 16 18,601 18,601 CD cord-305054-4d84b2g6 92 17 bp bp NNP cord-305054-4d84b2g6 92 18 were be VBD cord-305054-4d84b2g6 92 19 probably probably RB cord-305054-4d84b2g6 92 20 due due JJ cord-305054-4d84b2g6 92 21 to to IN cord-305054-4d84b2g6 92 22 references reference NNS cord-305054-4d84b2g6 92 23 between between IN cord-305054-4d84b2g6 92 24 the the DT cord-305054-4d84b2g6 92 25 reference reference NN cord-305054-4d84b2g6 92 26 genomes genome NNS cord-305054-4d84b2g6 92 27 ( ( -LRB- cord-305054-4d84b2g6 92 28 Table Table NNP cord-305054-4d84b2g6 92 29 S1 S1 NNP cord-305054-4d84b2g6 92 30 ) ) -RRB- cord-305054-4d84b2g6 92 31 . . . cord-305054-4d84b2g6 93 1 However however RB cord-305054-4d84b2g6 93 2 , , , cord-305054-4d84b2g6 93 3 the the DT cord-305054-4d84b2g6 93 4 differences difference NNS cord-305054-4d84b2g6 93 5 in in IN cord-305054-4d84b2g6 93 6 some some DT cord-305054-4d84b2g6 93 7 regions region NNS cord-305054-4d84b2g6 93 8 including include VBG cord-305054-4d84b2g6 93 9 4,761 4,761 CD cord-305054-4d84b2g6 93 10 - - SYM cord-305054-4d84b2g6 93 11 5,021 5,021 CD cord-305054-4d84b2g6 93 12 bp bp NNP cord-305054-4d84b2g6 93 13 and and CC cord-305054-4d84b2g6 93 14 10,121 10,121 CD cord-305054-4d84b2g6 93 15 - - SYM cord-305054-4d84b2g6 93 16 11,321 11,321 CD cord-305054-4d84b2g6 93 17 bp bp NNP cord-305054-4d84b2g6 93 18 , , , cord-305054-4d84b2g6 93 19 were be VBD cord-305054-4d84b2g6 93 20 not not RB cord-305054-4d84b2g6 93 21 due due JJ cord-305054-4d84b2g6 93 22 to to IN cord-305054-4d84b2g6 93 23 differences difference NNS cord-305054-4d84b2g6 93 24 in in IN cord-305054-4d84b2g6 93 25 the the DT cord-305054-4d84b2g6 93 26 reference reference NN cord-305054-4d84b2g6 93 27 genomes genome NNS cord-305054-4d84b2g6 93 28 . . . cord-305054-4d84b2g6 94 1 We -PRON- PRP cord-305054-4d84b2g6 94 2 further further RB cord-305054-4d84b2g6 94 3 investigated investigate VBD cord-305054-4d84b2g6 94 4 phylogenetic phylogenetic JJ cord-305054-4d84b2g6 94 5 relationship relationship NN cord-305054-4d84b2g6 94 6 between between IN cord-305054-4d84b2g6 94 7 the the DT cord-305054-4d84b2g6 94 8 two two CD cord-305054-4d84b2g6 94 9 assemblies assembly NNS cord-305054-4d84b2g6 94 10 and and CC cord-305054-4d84b2g6 94 11 eight eight CD cord-305054-4d84b2g6 94 12 reference reference NN cord-305054-4d84b2g6 94 13 viral viral JJ cord-305054-4d84b2g6 94 14 genomes genome NNS cord-305054-4d84b2g6 94 15 using use VBG cord-305054-4d84b2g6 94 16 MEGA MEGA NNP cord-305054-4d84b2g6 94 17 X x NN cord-305054-4d84b2g6 94 18 with with IN cord-305054-4d84b2g6 94 19 1000 1000 CD cord-305054-4d84b2g6 94 20 Bootstrap Bootstrap NNP cord-305054-4d84b2g6 94 21 tests test NNS cord-305054-4d84b2g6 94 22 . . . cord-305054-4d84b2g6 95 1 The the DT cord-305054-4d84b2g6 95 2 two two CD cord-305054-4d84b2g6 95 3 assemblies assembly NNS cord-305054-4d84b2g6 95 4 positioned position VBD cord-305054-4d84b2g6 95 5 between between IN cord-305054-4d84b2g6 95 6 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 95 7 and and CC cord-305054-4d84b2g6 95 8 ZC45 ZC45 NNP cord-305054-4d84b2g6 95 9 with with IN cord-305054-4d84b2g6 95 10 strong strong JJ cord-305054-4d84b2g6 95 11 statistical statistical JJ cord-305054-4d84b2g6 95 12 evidence evidence NN cord-305054-4d84b2g6 95 13 ( ( -LRB- cord-305054-4d84b2g6 95 14 Figure figure NN cord-305054-4d84b2g6 95 15 3 3 CD cord-305054-4d84b2g6 95 16 ) ) -RRB- cord-305054-4d84b2g6 95 17 . . . cord-305054-4d84b2g6 96 1 The the DT cord-305054-4d84b2g6 96 2 SARS SARS NNP cord-305054-4d84b2g6 96 3 - - HYPH cord-305054-4d84b2g6 96 4 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 96 5 genome genome NN cord-305054-4d84b2g6 96 6 , , , cord-305054-4d84b2g6 96 7 Wuhan Wuhan NNP cord-305054-4d84b2g6 96 8 - - HYPH cord-305054-4d84b2g6 96 9 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 96 10 , , , cord-305054-4d84b2g6 96 11 and and CC cord-305054-4d84b2g6 96 12 Bat Bat NNP cord-305054-4d84b2g6 96 13 - - HYPH cord-305054-4d84b2g6 96 14 CoV CoV NNP cord-305054-4d84b2g6 96 15 , , , cord-305054-4d84b2g6 96 16 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 96 17 , , , cord-305054-4d84b2g6 96 18 clustered cluster VBN cord-305054-4d84b2g6 96 19 closely closely RB cord-305054-4d84b2g6 96 20 together together RB cord-305054-4d84b2g6 96 21 . . . cord-305054-4d84b2g6 97 1 The the DT cord-305054-4d84b2g6 97 2 two two CD cord-305054-4d84b2g6 97 3 assemblies assembly NNS cord-305054-4d84b2g6 97 4 from from IN cord-305054-4d84b2g6 97 5 this this DT cord-305054-4d84b2g6 97 6 study study NN cord-305054-4d84b2g6 97 7 consistently consistently RB cord-305054-4d84b2g6 97 8 positioned position VBD cord-305054-4d84b2g6 97 9 between between IN cord-305054-4d84b2g6 97 10 this this DT cord-305054-4d84b2g6 97 11 cluster cluster NN cord-305054-4d84b2g6 97 12 and and CC cord-305054-4d84b2g6 97 13 other other JJ cord-305054-4d84b2g6 97 14 CoVs covs NN cord-305054-4d84b2g6 97 15 ( ( -LRB- cord-305054-4d84b2g6 97 16 Figure figure NN cord-305054-4d84b2g6 97 17 _SP cord-305054-4d84b2g6 97 18 4 4 CD cord-305054-4d84b2g6 97 19 ) ) -RRB- cord-305054-4d84b2g6 97 20 . . . cord-305054-4d84b2g6 98 1 Among among IN cord-305054-4d84b2g6 98 2 other other JJ cord-305054-4d84b2g6 98 3 CoVs CoVs NNP cord-305054-4d84b2g6 98 4 , , , cord-305054-4d84b2g6 98 5 ZC45 ZC45 NNP cord-305054-4d84b2g6 98 6 is be VBZ cord-305054-4d84b2g6 98 7 the the DT cord-305054-4d84b2g6 98 8 closest close JJS cord-305054-4d84b2g6 98 9 neighbor neighbor NN cord-305054-4d84b2g6 98 10 to to IN cord-305054-4d84b2g6 98 11 the the DT cord-305054-4d84b2g6 98 12 assemblies assembly NNS cord-305054-4d84b2g6 98 13 . . . cord-305054-4d84b2g6 99 1 Currently currently RB cord-305054-4d84b2g6 99 2 , , , cord-305054-4d84b2g6 99 3 very very RB cord-305054-4d84b2g6 99 4 few few JJ cord-305054-4d84b2g6 99 5 samples sample NNS cord-305054-4d84b2g6 99 6 of of IN cord-305054-4d84b2g6 99 7 coronavirus coronavirus NN cord-305054-4d84b2g6 99 8 - - HYPH cord-305054-4d84b2g6 99 9 positive positive JJ cord-305054-4d84b2g6 99 10 pangolins pangolin NNS cord-305054-4d84b2g6 99 11 have have VBP cord-305054-4d84b2g6 99 12 been be VBN cord-305054-4d84b2g6 99 13 sequenced sequence VBN cord-305054-4d84b2g6 99 14 . . . cord-305054-4d84b2g6 100 1 Pangolins pangolin NNS cord-305054-4d84b2g6 100 2 - - HYPH cord-305054-4d84b2g6 100 3 CoV cov NN cord-305054-4d84b2g6 100 4 obtained obtain VBN cord-305054-4d84b2g6 100 5 from from IN cord-305054-4d84b2g6 100 6 the the DT cord-305054-4d84b2g6 100 7 Guangdong Guangdong NNP cord-305054-4d84b2g6 100 8 collection collection NN cord-305054-4d84b2g6 100 9 and and CC cord-305054-4d84b2g6 100 10 the the DT cord-305054-4d84b2g6 100 11 Guangxi Guangxi NNP cord-305054-4d84b2g6 100 12 collection collection NN cord-305054-4d84b2g6 100 13 represent represent VBP cord-305054-4d84b2g6 100 14 two two CD cord-305054-4d84b2g6 100 15 lineages lineage NNS cord-305054-4d84b2g6 100 16 of of IN cord-305054-4d84b2g6 100 17 coronavirus coronavirus NN cord-305054-4d84b2g6 100 18 [ [ -LRB- cord-305054-4d84b2g6 100 19 3 3 CD cord-305054-4d84b2g6 100 20 ] ] -RRB- cord-305054-4d84b2g6 100 21 . . . cord-305054-4d84b2g6 101 1 After after IN cord-305054-4d84b2g6 101 2 outbreak outbreak NN cord-305054-4d84b2g6 101 3 of of IN cord-305054-4d84b2g6 101 4 SARS SARS NNP cord-305054-4d84b2g6 101 5 - - HYPH cord-305054-4d84b2g6 101 6 CoV CoV NNP cord-305054-4d84b2g6 101 7 , , , cord-305054-4d84b2g6 101 8 thousands thousand NNS cord-305054-4d84b2g6 101 9 of of IN cord-305054-4d84b2g6 101 10 bat bat NN cord-305054-4d84b2g6 101 11 samples sample NNS cord-305054-4d84b2g6 101 12 were be VBD cord-305054-4d84b2g6 101 13 collected collect VBN cord-305054-4d84b2g6 101 14 and and CC cord-305054-4d84b2g6 101 15 sequenced sequence VBN cord-305054-4d84b2g6 101 16 to to TO cord-305054-4d84b2g6 101 17 identify identify VB cord-305054-4d84b2g6 101 18 coronaviruses coronaviruse NNS cord-305054-4d84b2g6 101 19 that that DT cord-305054-4d84b2g6 101 20 bat bat NN cord-305054-4d84b2g6 101 21 may may MD cord-305054-4d84b2g6 101 22 carry carry VB cord-305054-4d84b2g6 101 23 . . . cord-305054-4d84b2g6 102 1 We -PRON- PRP cord-305054-4d84b2g6 102 2 can can MD cord-305054-4d84b2g6 102 3 expect expect VB cord-305054-4d84b2g6 102 4 that that IN cord-305054-4d84b2g6 102 5 in in IN cord-305054-4d84b2g6 102 6 near near JJ cord-305054-4d84b2g6 102 7 future future JJ cord-305054-4d84b2g6 102 8 more more JJR cord-305054-4d84b2g6 102 9 pangolin pangolin NN cord-305054-4d84b2g6 102 10 samples sample NNS cord-305054-4d84b2g6 102 11 will will MD cord-305054-4d84b2g6 102 12 be be VB cord-305054-4d84b2g6 102 13 collected collect VBN cord-305054-4d84b2g6 102 14 and and CC cord-305054-4d84b2g6 102 15 studied study VBN cord-305054-4d84b2g6 102 16 for for IN cord-305054-4d84b2g6 102 17 better well JJR cord-305054-4d84b2g6 102 18 understanding understanding NN cord-305054-4d84b2g6 102 19 of of IN cord-305054-4d84b2g6 102 20 coronavirus coronavirus NN cord-305054-4d84b2g6 102 21 in in IN cord-305054-4d84b2g6 102 22 pangolin pangolin NN cord-305054-4d84b2g6 102 23 . . . cord-305054-4d84b2g6 103 1 In in IN cord-305054-4d84b2g6 103 2 a a DT cord-305054-4d84b2g6 103 3 reference reference NN cord-305054-4d84b2g6 103 4 - - HYPH cord-305054-4d84b2g6 103 5 guided guide VBN cord-305054-4d84b2g6 103 6 assembling assembling NN cord-305054-4d84b2g6 103 7 , , , cord-305054-4d84b2g6 103 8 the the DT cord-305054-4d84b2g6 103 9 resulting result VBG cord-305054-4d84b2g6 103 10 assembly assembly NN cord-305054-4d84b2g6 103 11 may may MD cord-305054-4d84b2g6 103 12 show show VB cord-305054-4d84b2g6 103 13 bias bias NN cord-305054-4d84b2g6 103 14 towards towards IN cord-305054-4d84b2g6 103 15 the the DT cord-305054-4d84b2g6 103 16 reference reference NN cord-305054-4d84b2g6 103 17 genome genome NN cord-305054-4d84b2g6 103 18 [ [ -LRB- cord-305054-4d84b2g6 103 19 7 7 CD cord-305054-4d84b2g6 103 20 ] ] -RRB- cord-305054-4d84b2g6 103 21 . . . cord-305054-4d84b2g6 104 1 Successful successful JJ cord-305054-4d84b2g6 104 2 decoding decode VBG cord-305054-4d84b2g6 104 3 a a DT cord-305054-4d84b2g6 104 4 complete complete JJ cord-305054-4d84b2g6 104 5 viral viral JJ cord-305054-4d84b2g6 104 6 genome genome NN cord-305054-4d84b2g6 104 7 usually usually RB cord-305054-4d84b2g6 104 8 require require VBP cord-305054-4d84b2g6 104 9 deep deep JJ cord-305054-4d84b2g6 104 10 sequencing sequencing NN cord-305054-4d84b2g6 104 11 and and CC cord-305054-4d84b2g6 104 12 further further JJ cord-305054-4d84b2g6 104 13 manual manual JJ cord-305054-4d84b2g6 104 14 curations curation NNS cord-305054-4d84b2g6 104 15 to to TO cord-305054-4d84b2g6 104 16 fix fix VB cord-305054-4d84b2g6 104 17 the the DT cord-305054-4d84b2g6 104 18 gaps gap NNS cord-305054-4d84b2g6 104 19 . . . cord-305054-4d84b2g6 105 1 Inaccuracies inaccuracy NNS cord-305054-4d84b2g6 105 2 in in IN cord-305054-4d84b2g6 105 3 assembling assemble VBG cord-305054-4d84b2g6 105 4 the the DT cord-305054-4d84b2g6 105 5 sequencing sequencing NN cord-305054-4d84b2g6 105 6 reads read NNS cord-305054-4d84b2g6 105 7 could could MD cord-305054-4d84b2g6 105 8 mislead mislead VB cord-305054-4d84b2g6 105 9 the the DT cord-305054-4d84b2g6 105 10 subsequent subsequent JJ cord-305054-4d84b2g6 105 11 curation curation NN cord-305054-4d84b2g6 105 12 step step NN cord-305054-4d84b2g6 105 13 . . . cord-305054-4d84b2g6 106 1 The the DT cord-305054-4d84b2g6 106 2 whole whole JJ cord-305054-4d84b2g6 106 3 genome genome NN cord-305054-4d84b2g6 106 4 identity identity NN cord-305054-4d84b2g6 106 5 between between IN cord-305054-4d84b2g6 106 6 Bat Bat NNP cord-305054-4d84b2g6 106 7 - - HYPH cord-305054-4d84b2g6 106 8 CoV cov NN cord-305054-4d84b2g6 106 9 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 106 10 and and CC cord-305054-4d84b2g6 106 11 SARS SARS NNP cord-305054-4d84b2g6 106 12 - - HYPH cord-305054-4d84b2g6 106 13 CoV2 CoV2 NNP cord-305054-4d84b2g6 106 14 Wuhan Wuhan NNP cord-305054-4d84b2g6 106 15 - - HYPH cord-305054-4d84b2g6 106 16 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 106 17 is be VBZ cord-305054-4d84b2g6 106 18 about about RB cord-305054-4d84b2g6 106 19 96 96 CD cord-305054-4d84b2g6 106 20 % % NN cord-305054-4d84b2g6 106 21 , , , cord-305054-4d84b2g6 106 22 which which WDT cord-305054-4d84b2g6 106 23 corresponds correspond VBZ cord-305054-4d84b2g6 106 24 to to IN cord-305054-4d84b2g6 106 25 a a DT cord-305054-4d84b2g6 106 26 total total JJ cord-305054-4d84b2g6 106 27 difference difference NN cord-305054-4d84b2g6 106 28 of of IN cord-305054-4d84b2g6 106 29 about about IN cord-305054-4d84b2g6 106 30 1,500 1,500 CD cord-305054-4d84b2g6 106 31 nucleotides nucleotide NNS cord-305054-4d84b2g6 106 32 . . . cord-305054-4d84b2g6 107 1 Our -PRON- PRP$ cord-305054-4d84b2g6 107 2 results result NNS cord-305054-4d84b2g6 107 3 have have VBP cord-305054-4d84b2g6 107 4 shown show VBN cord-305054-4d84b2g6 107 5 that that IN cord-305054-4d84b2g6 107 6 observable observable JJ cord-305054-4d84b2g6 107 7 difference difference NN cord-305054-4d84b2g6 107 8 could could MD cord-305054-4d84b2g6 107 9 be be VB cord-305054-4d84b2g6 107 10 found find VBN cord-305054-4d84b2g6 107 11 in in IN cord-305054-4d84b2g6 107 12 the the DT cord-305054-4d84b2g6 107 13 resulting result VBG cord-305054-4d84b2g6 107 14 assemblies assembly NNS cord-305054-4d84b2g6 107 15 when when WRB cord-305054-4d84b2g6 107 16 these these DT cord-305054-4d84b2g6 107 17 genomes genome NNS cord-305054-4d84b2g6 107 18 were be VBD cord-305054-4d84b2g6 107 19 used use VBN cord-305054-4d84b2g6 107 20 as as IN cord-305054-4d84b2g6 107 21 reference reference NN cord-305054-4d84b2g6 107 22 separately separately RB cord-305054-4d84b2g6 107 23 . . . cord-305054-4d84b2g6 108 1 Particular particular JJ cord-305054-4d84b2g6 108 2 , , , cord-305054-4d84b2g6 108 3 when when WRB cord-305054-4d84b2g6 108 4 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 108 5 was be VBD cord-305054-4d84b2g6 108 6 used use VBN cord-305054-4d84b2g6 108 7 as as IN cord-305054-4d84b2g6 108 8 reference reference NN cord-305054-4d84b2g6 108 9 , , , cord-305054-4d84b2g6 108 10 the the DT cord-305054-4d84b2g6 108 11 resulting result VBG cord-305054-4d84b2g6 108 12 assembly assembly NN cord-305054-4d84b2g6 108 13 had have VBD cord-305054-4d84b2g6 108 14 a a DT cord-305054-4d84b2g6 108 15 longer long RBR cord-305054-4d84b2g6 108 16 total total JJ cord-305054-4d84b2g6 108 17 length length NN cord-305054-4d84b2g6 108 18 and and CC cord-305054-4d84b2g6 108 19 higher high JJR cord-305054-4d84b2g6 108 20 N50 N50 NNP cord-305054-4d84b2g6 108 21 value value NN cord-305054-4d84b2g6 108 22 than than IN cord-305054-4d84b2g6 108 23 when when WRB cord-305054-4d84b2g6 108 24 Wuhan Wuhan NNP cord-305054-4d84b2g6 108 25 - - HYPH cord-305054-4d84b2g6 108 26 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 108 27 was be VBD cord-305054-4d84b2g6 108 28 used use VBN cord-305054-4d84b2g6 108 29 . . . cord-305054-4d84b2g6 109 1 This this DT cord-305054-4d84b2g6 109 2 points point VBZ cord-305054-4d84b2g6 109 3 to to IN cord-305054-4d84b2g6 109 4 the the DT cord-305054-4d84b2g6 109 5 possibility possibility NN cord-305054-4d84b2g6 109 6 that that IN cord-305054-4d84b2g6 109 7 Pangolin Pangolin NNP cord-305054-4d84b2g6 109 8 - - HYPH cord-305054-4d84b2g6 109 9 CoV CoV NNP cord-305054-4d84b2g6 109 10 is be VBZ cord-305054-4d84b2g6 109 11 more more RBR cord-305054-4d84b2g6 109 12 closely closely RB cord-305054-4d84b2g6 109 13 related related JJ cord-305054-4d84b2g6 109 14 to to IN cord-305054-4d84b2g6 109 15 RaTG13 ratg13 NN cord-305054-4d84b2g6 109 16 than than IN cord-305054-4d84b2g6 109 17 to to IN cord-305054-4d84b2g6 109 18 Wuhan Wuhan NNP cord-305054-4d84b2g6 109 19 - - HYPH cord-305054-4d84b2g6 109 20 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 109 21 . . . cord-305054-4d84b2g6 110 1 Therefore therefore RB cord-305054-4d84b2g6 110 2 , , , cord-305054-4d84b2g6 110 3 in in IN cord-305054-4d84b2g6 110 4 order order NN cord-305054-4d84b2g6 110 5 to to TO cord-305054-4d84b2g6 110 6 decode decode VB cord-305054-4d84b2g6 110 7 the the DT cord-305054-4d84b2g6 110 8 coronavirus coronavirus NN cord-305054-4d84b2g6 110 9 sequence sequence NN cord-305054-4d84b2g6 110 10 accurately accurately RB cord-305054-4d84b2g6 110 11 , , , cord-305054-4d84b2g6 110 12 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 110 13 , , , cord-305054-4d84b2g6 110 14 and and CC cord-305054-4d84b2g6 110 15 possibly possibly RB cord-305054-4d84b2g6 110 16 other other JJ cord-305054-4d84b2g6 110 17 SARS SARS NNP cord-305054-4d84b2g6 110 18 - - HYPH cord-305054-4d84b2g6 110 19 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 110 20 isolates isolate NNS cord-305054-4d84b2g6 110 21 , , , cord-305054-4d84b2g6 110 22 should should MD cord-305054-4d84b2g6 110 23 also also RB cord-305054-4d84b2g6 110 24 be be VB cord-305054-4d84b2g6 110 25 considered consider VBN cord-305054-4d84b2g6 110 26 as as IN cord-305054-4d84b2g6 110 27 reference reference NN cord-305054-4d84b2g6 110 28 in in IN cord-305054-4d84b2g6 110 29 future future JJ cord-305054-4d84b2g6 110 30 studies study NNS cord-305054-4d84b2g6 110 31 of of IN cord-305054-4d84b2g6 110 32 coronavirus coronavirus NN cord-305054-4d84b2g6 110 33 in in IN cord-305054-4d84b2g6 110 34 Pangolin Pangolin NNP cord-305054-4d84b2g6 110 35 or or CC cord-305054-4d84b2g6 110 36 other other JJ cord-305054-4d84b2g6 110 37 potential potential JJ cord-305054-4d84b2g6 110 38 intermediate intermediate JJ cord-305054-4d84b2g6 110 39 hosts host NNS cord-305054-4d84b2g6 110 40 . . . cord-305054-4d84b2g6 111 1 In in IN cord-305054-4d84b2g6 111 2 addition addition NN cord-305054-4d84b2g6 111 3 to to IN cord-305054-4d84b2g6 111 4 using use VBG cord-305054-4d84b2g6 111 5 one one CD cord-305054-4d84b2g6 111 6 reference reference NN cord-305054-4d84b2g6 111 7 genome genome NN cord-305054-4d84b2g6 111 8 to to TO cord-305054-4d84b2g6 111 9 guide guide VB cord-305054-4d84b2g6 111 10 the the DT cord-305054-4d84b2g6 111 11 assembling assembling NN cord-305054-4d84b2g6 111 12 , , , cord-305054-4d84b2g6 111 13 we -PRON- PRP cord-305054-4d84b2g6 111 14 also also RB cord-305054-4d84b2g6 111 15 attempted attempt VBD cord-305054-4d84b2g6 111 16 to to TO cord-305054-4d84b2g6 111 17 assemble assemble VB cord-305054-4d84b2g6 111 18 all all DT cord-305054-4d84b2g6 111 19 reads read NNS cord-305054-4d84b2g6 111 20 that that WDT cord-305054-4d84b2g6 111 21 mapped map VBD cord-305054-4d84b2g6 111 22 to to IN cord-305054-4d84b2g6 111 23 either either CC cord-305054-4d84b2g6 111 24 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 111 25 or or CC cord-305054-4d84b2g6 111 26 Wuhan Wuhan NNP cord-305054-4d84b2g6 111 27 - - HYPH cord-305054-4d84b2g6 111 28 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 111 29 genomes genome NNS cord-305054-4d84b2g6 111 30 . . . cord-305054-4d84b2g6 112 1 The the DT cord-305054-4d84b2g6 112 2 resulting result VBG cord-305054-4d84b2g6 112 3 twogenomes twogenome NNS cord-305054-4d84b2g6 112 4 - - HYPH cord-305054-4d84b2g6 112 5 guided guide VBN cord-305054-4d84b2g6 112 6 assembly assembly NN cord-305054-4d84b2g6 112 7 has have VBZ cord-305054-4d84b2g6 112 8 a a DT cord-305054-4d84b2g6 112 9 total total JJ cord-305054-4d84b2g6 112 10 length length NN cord-305054-4d84b2g6 112 11 of of IN cord-305054-4d84b2g6 112 12 22,707 22,707 CD cord-305054-4d84b2g6 112 13 bp bp NNP cord-305054-4d84b2g6 112 14 , , , cord-305054-4d84b2g6 112 15 which which WDT cord-305054-4d84b2g6 112 16 is be VBZ cord-305054-4d84b2g6 112 17 slightly slightly RB cord-305054-4d84b2g6 112 18 longer long JJR cord-305054-4d84b2g6 112 19 than than IN cord-305054-4d84b2g6 112 20 that that DT cord-305054-4d84b2g6 112 21 of of IN cord-305054-4d84b2g6 112 22 the the DT cord-305054-4d84b2g6 112 23 RaTG13-guided RaTG13-guided NNP cord-305054-4d84b2g6 112 24 assembly assembly NN cord-305054-4d84b2g6 112 25 . . . cord-305054-4d84b2g6 113 1 However however RB cord-305054-4d84b2g6 113 2 , , , cord-305054-4d84b2g6 113 3 the the DT cord-305054-4d84b2g6 113 4 N50 N50 NNP cord-305054-4d84b2g6 113 5 , , , cord-305054-4d84b2g6 113 6 1,388bp 1,388bp CD cord-305054-4d84b2g6 113 7 , , , cord-305054-4d84b2g6 113 8 is be VBZ cord-305054-4d84b2g6 113 9 slightly slightly RB cord-305054-4d84b2g6 113 10 shorter short JJR cord-305054-4d84b2g6 113 11 . . . cord-305054-4d84b2g6 114 1 The the DT cord-305054-4d84b2g6 114 2 authors author NNS cord-305054-4d84b2g6 114 3 have have VBP cord-305054-4d84b2g6 114 4 no no DT cord-305054-4d84b2g6 114 5 conflict conflict NN cord-305054-4d84b2g6 114 6 of of IN cord-305054-4d84b2g6 114 7 interest interest NN cord-305054-4d84b2g6 114 8 . . . cord-305054-4d84b2g6 115 1 Table table NN cord-305054-4d84b2g6 115 2 S1 S1 NNP cord-305054-4d84b2g6 115 3 . . . cord-305054-4d84b2g6 116 1 The the DT cord-305054-4d84b2g6 116 2 whole whole JJ cord-305054-4d84b2g6 116 3 genome genome NN cord-305054-4d84b2g6 116 4 nucleotide nucleotide NN cord-305054-4d84b2g6 116 5 similarity similarity NN cord-305054-4d84b2g6 116 6 from from IN cord-305054-4d84b2g6 116 7 RaTG13 RaTG13 NNP cord-305054-4d84b2g6 116 8 , , , cord-305054-4d84b2g6 116 9 Wuhan Wuhan NNP cord-305054-4d84b2g6 116 10 - - HYPH cord-305054-4d84b2g6 116 11 Hu-1 Hu-1 NNP cord-305054-4d84b2g6 116 12 , , , cord-305054-4d84b2g6 116 13 and and CC cord-305054-4d84b2g6 116 14 resulting result VBG cord-305054-4d84b2g6 116 15 assemblies assembly NNS cord-305054-4d84b2g6 116 16 . . . cord-305054-4d84b2g6 117 1 Trimmomatic Trimmomatic NNP cord-305054-4d84b2g6 117 2 : : : cord-305054-4d84b2g6 117 3 a a DT cord-305054-4d84b2g6 117 4 flexible flexible JJ cord-305054-4d84b2g6 117 5 trimmer trimmer NN cord-305054-4d84b2g6 117 6 for for IN cord-305054-4d84b2g6 118 1 Illumina Illumina NNP cord-305054-4d84b2g6 118 2 sequence sequence NN cord-305054-4d84b2g6 118 3 data datum NNS cord-305054-4d84b2g6 119 1 MEGA MEGA NNP cord-305054-4d84b2g6 119 2 X X NNP cord-305054-4d84b2g6 119 3 : : : cord-305054-4d84b2g6 119 4 Molecular Molecular NNP cord-305054-4d84b2g6 119 5 Evolutionary Evolutionary NNP cord-305054-4d84b2g6 119 6 Genetics Genetics NNP cord-305054-4d84b2g6 119 7 Analysis Analysis NNP cord-305054-4d84b2g6 119 8 across across IN cord-305054-4d84b2g6 119 9 Computing Computing NNP cord-305054-4d84b2g6 119 10 Platforms Platforms NNPS cord-305054-4d84b2g6 120 1 Identifying identify VBG cord-305054-4d84b2g6 120 2 SARS SARS NNP cord-305054-4d84b2g6 120 3 - - HYPH cord-305054-4d84b2g6 120 4 CoV-2-related cov-2-relate VBN cord-305054-4d84b2g6 120 5 coronaviruses coronaviruse NNS cord-305054-4d84b2g6 120 6 in in IN cord-305054-4d84b2g6 120 7 Malayan malayan JJ cord-305054-4d84b2g6 120 8 pangolins pangolin NNS cord-305054-4d84b2g6 121 1 MEGAHIT megahit NN cord-305054-4d84b2g6 121 2 : : : cord-305054-4d84b2g6 122 1 an an DT cord-305054-4d84b2g6 122 2 ultra ultra JJ cord-305054-4d84b2g6 122 3 - - JJ cord-305054-4d84b2g6 122 4 fast fast JJ cord-305054-4d84b2g6 122 5 single single JJ cord-305054-4d84b2g6 122 6 - - HYPH cord-305054-4d84b2g6 122 7 node node NN cord-305054-4d84b2g6 122 8 solution solution NN cord-305054-4d84b2g6 122 9 for for IN cord-305054-4d84b2g6 122 10 large large JJ cord-305054-4d84b2g6 122 11 and and CC cord-305054-4d84b2g6 122 12 complex complex JJ cord-305054-4d84b2g6 122 13 metagenomics metagenomic NNS cord-305054-4d84b2g6 122 14 assembly assembly NN cord-305054-4d84b2g6 122 15 via via IN cord-305054-4d84b2g6 122 16 succinct succinct JJ cord-305054-4d84b2g6 122 17 de de NNP cord-305054-4d84b2g6 122 18 Bruijn Bruijn NNP cord-305054-4d84b2g6 122 19 graph graph NN cord-305054-4d84b2g6 122 20 Aligning align VBG cord-305054-4d84b2g6 122 21 sequence sequence NN cord-305054-4d84b2g6 122 22 reads read NNS cord-305054-4d84b2g6 122 23 , , , cord-305054-4d84b2g6 122 24 clone clone NN cord-305054-4d84b2g6 122 25 sequences sequence NNS cord-305054-4d84b2g6 122 26 and and CC cord-305054-4d84b2g6 122 27 assembly assembly NN cord-305054-4d84b2g6 122 28 contigs contigs NNP cord-305054-4d84b2g6 122 29 with with IN cord-305054-4d84b2g6 122 30 BWA BWA NNP cord-305054-4d84b2g6 122 31 - - HYPH cord-305054-4d84b2g6 122 32 MEM MEM NNP cord-305054-4d84b2g6 122 33 Fast Fast NNP cord-305054-4d84b2g6 122 34 and and CC cord-305054-4d84b2g6 122 35 accurate accurate JJ cord-305054-4d84b2g6 123 1 short short RB cord-305054-4d84b2g6 123 2 read read VB cord-305054-4d84b2g6 123 3 alignment alignment NN cord-305054-4d84b2g6 123 4 with with IN cord-305054-4d84b2g6 123 5 Burrows Burrows NNP cord-305054-4d84b2g6 123 6 - - HYPH cord-305054-4d84b2g6 123 7 Wheeler Wheeler NNP cord-305054-4d84b2g6 123 8 transform transform VB cord-305054-4d84b2g6 123 9 Reference reference NN cord-305054-4d84b2g6 123 10 - - HYPH cord-305054-4d84b2g6 123 11 guided guide VBN cord-305054-4d84b2g6 123 12 de de FW cord-305054-4d84b2g6 123 13 novo novo NNP cord-305054-4d84b2g6 123 14 assembly assembly NNP cord-305054-4d84b2g6 123 15 approach approach NN cord-305054-4d84b2g6 123 16 improves improve VBZ cord-305054-4d84b2g6 123 17 genome genome NN cord-305054-4d84b2g6 123 18 reconstruction reconstruction NN cord-305054-4d84b2g6 123 19 for for IN cord-305054-4d84b2g6 123 20 related relate VBN cord-305054-4d84b2g6 123 21 species specie NNS cord-305054-4d84b2g6 123 22 Viral Viral NNP cord-305054-4d84b2g6 123 23 Metagenomics Metagenomics NNPS cord-305054-4d84b2g6 123 24 Revealed reveal VBD cord-305054-4d84b2g6 123 25 Sendai Sendai NNP cord-305054-4d84b2g6 123 26 Virus Virus NNP cord-305054-4d84b2g6 123 27 and and CC cord-305054-4d84b2g6 123 28 Coronavirus Coronavirus NNP cord-305054-4d84b2g6 123 29 Infection Infection NNP cord-305054-4d84b2g6 123 30 of of IN cord-305054-4d84b2g6 123 31 Malayan malayan JJ cord-305054-4d84b2g6 123 32 Pangolins Pangolins NNPS cord-305054-4d84b2g6 123 33 ( ( -LRB- cord-305054-4d84b2g6 123 34 Manis Manis NNP cord-305054-4d84b2g6 123 35 javanica javanica NNS cord-305054-4d84b2g6 123 36 ) ) -RRB- cord-305054-4d84b2g6 123 37 Are be VBP cord-305054-4d84b2g6 123 38 pangolins pangolin NNS cord-305054-4d84b2g6 124 1 the the DT cord-305054-4d84b2g6 124 2 intermediate intermediate JJ cord-305054-4d84b2g6 124 3 host host NN cord-305054-4d84b2g6 124 4 of of IN cord-305054-4d84b2g6 124 5 the the DT cord-305054-4d84b2g6 124 6 2019 2019 CD cord-305054-4d84b2g6 124 7 novel novel NN cord-305054-4d84b2g6 124 8 coronavirus coronavirus NN cord-305054-4d84b2g6 124 9 ( ( -LRB- cord-305054-4d84b2g6 124 10 SARS SARS NNP cord-305054-4d84b2g6 124 11 - - HYPH cord-305054-4d84b2g6 124 12 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 124 13 ) ) -RRB- cord-305054-4d84b2g6 124 14 ? ? . cord-305054-4d84b2g6 125 1 Full full JJ cord-305054-4d84b2g6 125 2 - - HYPH cord-305054-4d84b2g6 125 3 Length length NN cord-305054-4d84b2g6 125 4 Human Human NNP cord-305054-4d84b2g6 125 5 Immunodeficiency Immunodeficiency NNP cord-305054-4d84b2g6 125 6 Virus Virus NNP cord-305054-4d84b2g6 125 7 Type Type NNP cord-305054-4d84b2g6 125 8 1 1 CD cord-305054-4d84b2g6 125 9 Genomes Genomes NNPS cord-305054-4d84b2g6 125 10 from from IN cord-305054-4d84b2g6 125 11 Subtype Subtype NNP cord-305054-4d84b2g6 125 12 C C NNP cord-305054-4d84b2g6 125 13 - - HYPH cord-305054-4d84b2g6 125 14 Infected infect VBN cord-305054-4d84b2g6 125 15 Seroconverters Seroconverters NNP cord-305054-4d84b2g6 125 16 in in IN cord-305054-4d84b2g6 125 17 India India NNP cord-305054-4d84b2g6 125 18 , , , cord-305054-4d84b2g6 125 19 with with IN cord-305054-4d84b2g6 125 20 Evidence evidence NN cord-305054-4d84b2g6 125 21 of of IN cord-305054-4d84b2g6 125 22 Intersubtype Intersubtype NNP cord-305054-4d84b2g6 125 23 Recombination Recombination NNP cord-305054-4d84b2g6 125 24 A a DT cord-305054-4d84b2g6 125 25 new new JJ cord-305054-4d84b2g6 125 26 coronavirus coronavirus NN cord-305054-4d84b2g6 125 27 associated associate VBN cord-305054-4d84b2g6 125 28 with with IN cord-305054-4d84b2g6 125 29 human human JJ cord-305054-4d84b2g6 125 30 respiratory respiratory JJ cord-305054-4d84b2g6 125 31 disease disease NN cord-305054-4d84b2g6 125 32 in in IN cord-305054-4d84b2g6 125 33 China China NNP cord-305054-4d84b2g6 125 34 Isolation Isolation NNP cord-305054-4d84b2g6 125 35 of of IN cord-305054-4d84b2g6 125 36 SARS SARS NNP cord-305054-4d84b2g6 125 37 - - HYPH cord-305054-4d84b2g6 125 38 CoV-2-related cov-2-relate VBN cord-305054-4d84b2g6 125 39 coronavirus coronavirus NN cord-305054-4d84b2g6 125 40 from from IN cord-305054-4d84b2g6 125 41 Malayan malayan JJ cord-305054-4d84b2g6 125 42 pangolins pangolin NNS cord-305054-4d84b2g6 126 1 Probable probable JJ cord-305054-4d84b2g6 126 2 Pangolin Pangolin NNP cord-305054-4d84b2g6 126 3 Origin Origin NNP cord-305054-4d84b2g6 126 4 of of IN cord-305054-4d84b2g6 126 5 SARS SARS NNP cord-305054-4d84b2g6 126 6 - - HYPH cord-305054-4d84b2g6 126 7 CoV-2 CoV-2 NNP cord-305054-4d84b2g6 127 1 Associated associate VBN cord-305054-4d84b2g6 127 2 with with IN cord-305054-4d84b2g6 127 3 the the DT cord-305054-4d84b2g6 127 4 COVID-19 COVID-19 NNP cord-305054-4d84b2g6 127 5 Outbreak Outbreak NNP cord-305054-4d84b2g6 127 6 A a DT cord-305054-4d84b2g6 127 7 pneumonia pneumonia NN cord-305054-4d84b2g6 127 8 outbreak outbreak NN cord-305054-4d84b2g6 127 9 associated associate VBN cord-305054-4d84b2g6 127 10 with with IN cord-305054-4d84b2g6 127 11 a a DT cord-305054-4d84b2g6 127 12 new new JJ cord-305054-4d84b2g6 127 13 coronavirus coronavirus NN cord-305054-4d84b2g6 127 14 of of IN cord-305054-4d84b2g6 127 15 probable probable JJ cord-305054-4d84b2g6 127 16 bat bat NN cord-305054-4d84b2g6 127 17 origin origin NN