id sid tid token lemma pos 10_1101-2021_02_13_429885 1 1 A a DT 10_1101-2021_02_13_429885 1 2 fully fully RB 10_1101-2021_02_13_429885 1 3 automated automate VBN 10_1101-2021_02_13_429885 1 4 approach approach NN 10_1101-2021_02_13_429885 1 5 for for IN 10_1101-2021_02_13_429885 1 6 quality quality NN 10_1101-2021_02_13_429885 1 7 control control NN 10_1101-2021_02_13_429885 1 8 of of IN 10_1101-2021_02_13_429885 1 9 cancer cancer NN 10_1101-2021_02_13_429885 1 10 mutations mutation NNS 10_1101-2021_02_13_429885 1 11 in in IN 10_1101-2021_02_13_429885 1 12 the the DT 10_1101-2021_02_13_429885 1 13 era era NN 10_1101-2021_02_13_429885 1 14 of of IN 10_1101-2021_02_13_429885 1 15 high high JJ 10_1101-2021_02_13_429885 1 16 - - HYPH 10_1101-2021_02_13_429885 1 17 resolution resolution NN 10_1101-2021_02_13_429885 1 18 whole whole JJ 10_1101-2021_02_13_429885 1 19 genome genome JJ 10_1101-2021_02_13_429885 1 20 sequencing sequencing NN 10_1101-2021_02_13_429885 1 21 A a DT 10_1101-2021_02_13_429885 1 22 fully fully RB 10_1101-2021_02_13_429885 1 23 automated automate VBN 10_1101-2021_02_13_429885 1 24 approach approach NN 10_1101-2021_02_13_429885 1 25 for for IN 10_1101-2021_02_13_429885 1 26 quality quality NN 10_1101-2021_02_13_429885 1 27 control control NN 10_1101-2021_02_13_429885 1 28 of of IN 10_1101-2021_02_13_429885 1 29 cancer cancer NN 10_1101-2021_02_13_429885 1 30 mutations mutation NNS 10_1101-2021_02_13_429885 1 31 in in IN 10_1101-2021_02_13_429885 1 32 the the DT 10_1101-2021_02_13_429885 1 33 era era NN 10_1101-2021_02_13_429885 1 34 of of IN 10_1101-2021_02_13_429885 1 35 high high JJ 10_1101-2021_02_13_429885 1 36 - - HYPH 10_1101-2021_02_13_429885 1 37 resolution resolution NN 10_1101-2021_02_13_429885 1 38 whole whole JJ 10_1101-2021_02_13_429885 1 39 genome genome JJ 10_1101-2021_02_13_429885 1 40 sequencing sequence VBG 10_1101-2021_02_13_429885 1 41 Jacob Jacob NNP 10_1101-2021_02_13_429885 1 42 Househam Househam NNP 10_1101-2021_02_13_429885 1 43 , , , 10_1101-2021_02_13_429885 1 44 ​Barts ​bart VBZ 10_1101-2021_02_13_429885 1 45 Cancer Cancer NNP 10_1101-2021_02_13_429885 1 46 Institute Institute NNP 10_1101-2021_02_13_429885 1 47 , , , 10_1101-2021_02_13_429885 1 48 Queen Queen NNP 10_1101-2021_02_13_429885 1 49 Mary Mary NNP 10_1101-2021_02_13_429885 1 50 University University NNP 10_1101-2021_02_13_429885 1 51 of of IN 10_1101-2021_02_13_429885 1 52 London London NNP 10_1101-2021_02_13_429885 1 53 , , , 10_1101-2021_02_13_429885 1 54 UK UK NNP 10_1101-2021_02_13_429885 1 55 William William NNP 10_1101-2021_02_13_429885 1 56 CH CH NNP 10_1101-2021_02_13_429885 1 57 Cross Cross NNP 10_1101-2021_02_13_429885 1 58 , , , 10_1101-2021_02_13_429885 1 59 ​UCL ​UCL NNP 10_1101-2021_02_13_429885 1 60 Cancer Cancer NNP 10_1101-2021_02_13_429885 1 61 Institute Institute NNP 10_1101-2021_02_13_429885 1 62 , , , 10_1101-2021_02_13_429885 1 63 University University NNP 10_1101-2021_02_13_429885 1 64 College College NNP 10_1101-2021_02_13_429885 1 65 London London NNP 10_1101-2021_02_13_429885 1 66 , , , 10_1101-2021_02_13_429885 1 67 UK UK NNP 10_1101-2021_02_13_429885 1 68 ( ( -LRB- 10_1101-2021_02_13_429885 1 69 ★ ★ NNP 10_1101-2021_02_13_429885 1 70 ) ) -RRB- 10_1101-2021_02_13_429885 1 71 Giulio​ Giulio​ NNP 10_1101-2021_02_13_429885 1 72 ​Caravagna ​Caravagna NNP 10_1101-2021_02_13_429885 1 73 , , , 10_1101-2021_02_13_429885 1 74 ​ ​ JJ 10_1101-2021_02_13_429885 1 75 ​Department ​department NN 10_1101-2021_02_13_429885 1 76 of of IN 10_1101-2021_02_13_429885 1 77 Mathematics Mathematics NNP 10_1101-2021_02_13_429885 1 78 and and CC 10_1101-2021_02_13_429885 1 79 Geosciences Geosciences NNP 10_1101-2021_02_13_429885 1 80 , , , 10_1101-2021_02_13_429885 1 81 University University NNP 10_1101-2021_02_13_429885 1 82 of of IN 10_1101-2021_02_13_429885 1 83 Trieste Trieste NNP 10_1101-2021_02_13_429885 1 84 , , , 10_1101-2021_02_13_429885 1 85 Italy Italy NNP 10_1101-2021_02_13_429885 1 86 ( ( -LRB- 10_1101-2021_02_13_429885 1 87 ★ ★ NNP 10_1101-2021_02_13_429885 1 88 ) ) -RRB- 10_1101-2021_02_13_429885 1 89 Joint joint JJ 10_1101-2021_02_13_429885 1 90 last last JJ 10_1101-2021_02_13_429885 1 91 authors author NNS 10_1101-2021_02_13_429885 1 92 . . . 10_1101-2021_02_13_429885 2 1 ( ( -LRB- 10_1101-2021_02_13_429885 2 2 ★ ★ LS 10_1101-2021_02_13_429885 2 3 ) ) -RRB- 10_1101-2021_02_13_429885 2 4 Corresponding corresponding NN 10_1101-2021_02_13_429885 2 5 : : : 10_1101-2021_02_13_429885 2 6 ​(GC ​(GC NNP 10_1101-2021_02_13_429885 2 7 ) ) -RRB- 10_1101-2021_02_13_429885 2 8 ​gcaravagna@units.it​. ​gcaravagna@units.it​. ADD 10_1101-2021_02_13_429885 3 1 Abstract abstract JJ 10_1101-2021_02_13_429885 3 2 . . . 10_1101-2021_02_13_429885 4 1 ​Cancer ​cancer DT 10_1101-2021_02_13_429885 4 2 is be VBZ 10_1101-2021_02_13_429885 4 3 a a DT 10_1101-2021_02_13_429885 4 4 global global JJ 10_1101-2021_02_13_429885 4 5 health health NN 10_1101-2021_02_13_429885 4 6 issue issue NN 10_1101-2021_02_13_429885 4 7 that that WDT 10_1101-2021_02_13_429885 4 8 places place VBZ 10_1101-2021_02_13_429885 4 9 enormous enormous JJ 10_1101-2021_02_13_429885 4 10 demands demand NNS 10_1101-2021_02_13_429885 4 11 on on IN 10_1101-2021_02_13_429885 4 12 healthcare healthcare NN 10_1101-2021_02_13_429885 4 13 systems system NNS 10_1101-2021_02_13_429885 4 14 . . . 10_1101-2021_02_13_429885 5 1 Basic basic JJ 10_1101-2021_02_13_429885 5 2 research research NN 10_1101-2021_02_13_429885 5 3 , , , 10_1101-2021_02_13_429885 5 4 the the DT 10_1101-2021_02_13_429885 5 5 development development NN 10_1101-2021_02_13_429885 5 6 of of IN 10_1101-2021_02_13_429885 5 7 targeted target VBN 10_1101-2021_02_13_429885 5 8 treatments treatment NNS 10_1101-2021_02_13_429885 5 9 , , , 10_1101-2021_02_13_429885 5 10 and and CC 10_1101-2021_02_13_429885 5 11 the the DT 10_1101-2021_02_13_429885 5 12 utility utility NN 10_1101-2021_02_13_429885 5 13 of of IN 10_1101-2021_02_13_429885 5 14 DNA DNA NNP 10_1101-2021_02_13_429885 5 15 sequencing sequence VBG 10_1101-2021_02_13_429885 5 16 in in IN 10_1101-2021_02_13_429885 5 17 clinical clinical JJ 10_1101-2021_02_13_429885 5 18 settings setting NNS 10_1101-2021_02_13_429885 5 19 , , , 10_1101-2021_02_13_429885 5 20 have have VBP 10_1101-2021_02_13_429885 5 21 been be VBN 10_1101-2021_02_13_429885 5 22 significantly significantly RB 10_1101-2021_02_13_429885 5 23 improved improve VBN 10_1101-2021_02_13_429885 5 24 with with IN 10_1101-2021_02_13_429885 5 25 the the DT 10_1101-2021_02_13_429885 5 26 introduction introduction NN 10_1101-2021_02_13_429885 5 27 of of IN 10_1101-2021_02_13_429885 5 28 whole whole JJ 10_1101-2021_02_13_429885 5 29 genome genome JJ 10_1101-2021_02_13_429885 5 30 sequencing sequencing NN 10_1101-2021_02_13_429885 5 31 . . . 10_1101-2021_02_13_429885 6 1 However however RB 10_1101-2021_02_13_429885 6 2 the the DT 10_1101-2021_02_13_429885 6 3 broad broad JJ 10_1101-2021_02_13_429885 6 4 applications application NNS 10_1101-2021_02_13_429885 6 5 of of IN 10_1101-2021_02_13_429885 6 6 this this DT 10_1101-2021_02_13_429885 6 7 technology technology NN 10_1101-2021_02_13_429885 6 8 come come VBP 10_1101-2021_02_13_429885 6 9 with with IN 10_1101-2021_02_13_429885 6 10 complications complication NNS 10_1101-2021_02_13_429885 6 11 . . . 10_1101-2021_02_13_429885 7 1 To to IN 10_1101-2021_02_13_429885 7 2 date date NN 10_1101-2021_02_13_429885 7 3 there there EX 10_1101-2021_02_13_429885 7 4 has have VBZ 10_1101-2021_02_13_429885 7 5 been be VBN 10_1101-2021_02_13_429885 7 6 very very RB 10_1101-2021_02_13_429885 7 7 little little JJ 10_1101-2021_02_13_429885 7 8 standardisation standardisation NN 10_1101-2021_02_13_429885 7 9 in in IN 10_1101-2021_02_13_429885 7 10 how how WRB 10_1101-2021_02_13_429885 7 11 data datum NNS 10_1101-2021_02_13_429885 7 12 quality quality NN 10_1101-2021_02_13_429885 7 13 is be VBZ 10_1101-2021_02_13_429885 7 14 assessed assess VBN 10_1101-2021_02_13_429885 7 15 , , , 10_1101-2021_02_13_429885 7 16 leading lead VBG 10_1101-2021_02_13_429885 7 17 to to IN 10_1101-2021_02_13_429885 7 18 inconsistencies inconsistency NNS 10_1101-2021_02_13_429885 7 19 in in IN 10_1101-2021_02_13_429885 7 20 analyses analysis NNS 10_1101-2021_02_13_429885 7 21 and and CC 10_1101-2021_02_13_429885 7 22 disparate disparate JJ 10_1101-2021_02_13_429885 7 23 conclusions conclusion NNS 10_1101-2021_02_13_429885 7 24 . . . 10_1101-2021_02_13_429885 8 1 Manual manual JJ 10_1101-2021_02_13_429885 8 2 checking checking NN 10_1101-2021_02_13_429885 8 3 and and CC 10_1101-2021_02_13_429885 8 4 complex complex JJ 10_1101-2021_02_13_429885 8 5 consensus consensus NN 10_1101-2021_02_13_429885 8 6 calling call VBG 10_1101-2021_02_13_429885 8 7 strategies strategy NNS 10_1101-2021_02_13_429885 8 8 often often RB 10_1101-2021_02_13_429885 8 9 do do VBP 10_1101-2021_02_13_429885 8 10 not not RB 10_1101-2021_02_13_429885 8 11 scale scale VB 10_1101-2021_02_13_429885 8 12 to to IN 10_1101-2021_02_13_429885 8 13 large large JJ 10_1101-2021_02_13_429885 8 14 sample sample NN 10_1101-2021_02_13_429885 8 15 numbers number NNS 10_1101-2021_02_13_429885 8 16 , , , 10_1101-2021_02_13_429885 8 17 which which WDT 10_1101-2021_02_13_429885 8 18 leads lead VBZ 10_1101-2021_02_13_429885 8 19 to to IN 10_1101-2021_02_13_429885 8 20 procedural procedural JJ 10_1101-2021_02_13_429885 8 21 bottlenecks bottleneck NNS 10_1101-2021_02_13_429885 8 22 . . . 10_1101-2021_02_13_429885 9 1 To to TO 10_1101-2021_02_13_429885 9 2 address address VB 10_1101-2021_02_13_429885 9 3 this this DT 10_1101-2021_02_13_429885 9 4 issue issue NN 10_1101-2021_02_13_429885 9 5 , , , 10_1101-2021_02_13_429885 9 6 we -PRON- PRP 10_1101-2021_02_13_429885 9 7 present present VBP 10_1101-2021_02_13_429885 9 8 a a DT 10_1101-2021_02_13_429885 9 9 quality quality NN 10_1101-2021_02_13_429885 9 10 control control NN 10_1101-2021_02_13_429885 9 11 method method NN 10_1101-2021_02_13_429885 9 12 that that WDT 10_1101-2021_02_13_429885 9 13 integrates integrate VBZ 10_1101-2021_02_13_429885 9 14 point point NN 10_1101-2021_02_13_429885 9 15 mutations mutation NNS 10_1101-2021_02_13_429885 9 16 , , , 10_1101-2021_02_13_429885 9 17 copy copy NN 10_1101-2021_02_13_429885 9 18 numbers number NNS 10_1101-2021_02_13_429885 9 19 , , , 10_1101-2021_02_13_429885 9 20 and and CC 10_1101-2021_02_13_429885 9 21 other other JJ 10_1101-2021_02_13_429885 9 22 metrics metric NNS 10_1101-2021_02_13_429885 9 23 into into IN 10_1101-2021_02_13_429885 9 24 a a DT 10_1101-2021_02_13_429885 9 25 single single JJ 10_1101-2021_02_13_429885 9 26 quantitative quantitative JJ 10_1101-2021_02_13_429885 9 27 score score NN 10_1101-2021_02_13_429885 9 28 . . . 10_1101-2021_02_13_429885 10 1 We -PRON- PRP 10_1101-2021_02_13_429885 10 2 demonstrate demonstrate VBP 10_1101-2021_02_13_429885 10 3 its -PRON- PRP$ 10_1101-2021_02_13_429885 10 4 power power NN 10_1101-2021_02_13_429885 10 5 on on IN 10_1101-2021_02_13_429885 10 6 1,065 1,065 CD 10_1101-2021_02_13_429885 10 7 whole whole JJ 10_1101-2021_02_13_429885 10 8 - - HYPH 10_1101-2021_02_13_429885 10 9 genomes genome NNS 10_1101-2021_02_13_429885 10 10 from from IN 10_1101-2021_02_13_429885 10 11 a a DT 10_1101-2021_02_13_429885 10 12 large large JJ 10_1101-2021_02_13_429885 10 13 - - HYPH 10_1101-2021_02_13_429885 10 14 scale scale NN 10_1101-2021_02_13_429885 10 15 pan pan NN 10_1101-2021_02_13_429885 10 16 - - HYPH 10_1101-2021_02_13_429885 10 17 cancer cancer NN 10_1101-2021_02_13_429885 10 18 cohort cohort NN 10_1101-2021_02_13_429885 10 19 , , , 10_1101-2021_02_13_429885 10 20 and and CC 10_1101-2021_02_13_429885 10 21 on on IN 10_1101-2021_02_13_429885 10 22 multi multi JJ 10_1101-2021_02_13_429885 10 23 - - JJ 10_1101-2021_02_13_429885 10 24 region region JJ 10_1101-2021_02_13_429885 10 25 data datum NNS 10_1101-2021_02_13_429885 10 26 of of IN 10_1101-2021_02_13_429885 10 27 two two CD 10_1101-2021_02_13_429885 10 28 colorectal colorectal JJ 10_1101-2021_02_13_429885 10 29 cancer cancer NN 10_1101-2021_02_13_429885 10 30 patients patient NNS 10_1101-2021_02_13_429885 10 31 . . . 10_1101-2021_02_13_429885 11 1 We -PRON- PRP 10_1101-2021_02_13_429885 11 2 highlight highlight VBP 10_1101-2021_02_13_429885 11 3 how how WRB 10_1101-2021_02_13_429885 11 4 our -PRON- PRP$ 10_1101-2021_02_13_429885 11 5 approach approach NN 10_1101-2021_02_13_429885 11 6 significantly significantly RB 10_1101-2021_02_13_429885 11 7 improves improve VBZ 10_1101-2021_02_13_429885 11 8 the the DT 10_1101-2021_02_13_429885 11 9 generation generation NN 10_1101-2021_02_13_429885 11 10 of of IN 10_1101-2021_02_13_429885 11 11 cancer cancer NN 10_1101-2021_02_13_429885 11 12 mutation mutation NN 10_1101-2021_02_13_429885 11 13 data datum NNS 10_1101-2021_02_13_429885 11 14 , , , 10_1101-2021_02_13_429885 11 15 providing provide VBG 10_1101-2021_02_13_429885 11 16 visualisations visualisation NNS 10_1101-2021_02_13_429885 11 17 for for IN 10_1101-2021_02_13_429885 11 18 cross cross NN 10_1101-2021_02_13_429885 11 19 - - JJ 10_1101-2021_02_13_429885 11 20 referencing reference VBG 10_1101-2021_02_13_429885 11 21 with with IN 10_1101-2021_02_13_429885 11 22 other other JJ 10_1101-2021_02_13_429885 11 23 analyses analysis NNS 10_1101-2021_02_13_429885 11 24 . . . 10_1101-2021_02_13_429885 12 1 Our -PRON- PRP$ 10_1101-2021_02_13_429885 12 2 approach approach NN 10_1101-2021_02_13_429885 12 3 is be VBZ 10_1101-2021_02_13_429885 12 4 fully fully RB 10_1101-2021_02_13_429885 12 5 automated automate VBN 10_1101-2021_02_13_429885 12 6 , , , 10_1101-2021_02_13_429885 12 7 designed design VBN 10_1101-2021_02_13_429885 12 8 to to TO 10_1101-2021_02_13_429885 12 9 work work VB 10_1101-2021_02_13_429885 12 10 downstream downstream JJ 10_1101-2021_02_13_429885 12 11 of of IN 10_1101-2021_02_13_429885 12 12 any any DT 10_1101-2021_02_13_429885 12 13 bioinformatic bioinformatic JJ 10_1101-2021_02_13_429885 12 14 pipeline pipeline NN 10_1101-2021_02_13_429885 12 15 , , , 10_1101-2021_02_13_429885 12 16 and and CC 10_1101-2021_02_13_429885 12 17 can can MD 10_1101-2021_02_13_429885 12 18 automatise automatise VB 10_1101-2021_02_13_429885 12 19 tool tool NN 10_1101-2021_02_13_429885 12 20 parameterization parameterization NN 10_1101-2021_02_13_429885 12 21 paving pave VBG 10_1101-2021_02_13_429885 12 22 the the DT 10_1101-2021_02_13_429885 12 23 way way NN 10_1101-2021_02_13_429885 12 24 for for IN 10_1101-2021_02_13_429885 12 25 fast fast JJ 10_1101-2021_02_13_429885 12 26 computational computational JJ 10_1101-2021_02_13_429885 12 27 assessment assessment NN 10_1101-2021_02_13_429885 12 28 of of IN 10_1101-2021_02_13_429885 12 29 data datum NNS 10_1101-2021_02_13_429885 12 30 quality quality NN 10_1101-2021_02_13_429885 12 31 in in IN 10_1101-2021_02_13_429885 12 32 the the DT 10_1101-2021_02_13_429885 12 33 era era NN 10_1101-2021_02_13_429885 12 34 of of IN 10_1101-2021_02_13_429885 12 35 whole whole JJ 10_1101-2021_02_13_429885 12 36 genome genome JJ 10_1101-2021_02_13_429885 12 37 sequencing sequencing NN 10_1101-2021_02_13_429885 12 38 . . . 10_1101-2021_02_13_429885 13 1 Introduction introduction NN 10_1101-2021_02_13_429885 13 2 Cancer Cancer NNP 10_1101-2021_02_13_429885 13 3 remains remain VBZ 10_1101-2021_02_13_429885 13 4 an an DT 10_1101-2021_02_13_429885 13 5 unsolved unsolved JJ 10_1101-2021_02_13_429885 13 6 problem problem NN 10_1101-2021_02_13_429885 13 7 , , , 10_1101-2021_02_13_429885 13 8 and and CC 10_1101-2021_02_13_429885 13 9 a a DT 10_1101-2021_02_13_429885 13 10 key key JJ 10_1101-2021_02_13_429885 13 11 factor factor NN 10_1101-2021_02_13_429885 13 12 is be VBZ 10_1101-2021_02_13_429885 13 13 that that IN 10_1101-2021_02_13_429885 13 14 tumours tumour NNS 10_1101-2021_02_13_429885 13 15 develop develop VBP 10_1101-2021_02_13_429885 13 16 as as IN 10_1101-2021_02_13_429885 13 17 heterogeneous heterogeneous JJ 10_1101-2021_02_13_429885 13 18 cellular cellular JJ 10_1101-2021_02_13_429885 13 19 populations population NNS 10_1101-2021_02_13_429885 13 20 ​(Greaves ​(Greaves NNPS 10_1101-2021_02_13_429885 13 21 and and CC 10_1101-2021_02_13_429885 13 22 Maley Maley NNP 10_1101-2021_02_13_429885 13 23 2012 2012 CD 10_1101-2021_02_13_429885 13 24 ; ; : 10_1101-2021_02_13_429885 13 25 McGranahan McGranahan NNP 10_1101-2021_02_13_429885 13 26 and and CC 10_1101-2021_02_13_429885 13 27 Swanton Swanton NNP 10_1101-2021_02_13_429885 13 28 2017 2017 CD 10_1101-2021_02_13_429885 13 29 , , , 10_1101-2021_02_13_429885 13 30 2015)​. 2015)​. CD 10_1101-2021_02_13_429885 14 1 Cancer cancer NN 10_1101-2021_02_13_429885 14 2 genomes genome NNS 10_1101-2021_02_13_429885 14 3 can can MD 10_1101-2021_02_13_429885 14 4 harbour harbour VB 10_1101-2021_02_13_429885 14 5 multiple multiple JJ 10_1101-2021_02_13_429885 14 6 types type NNS 10_1101-2021_02_13_429885 14 7 of of IN 10_1101-2021_02_13_429885 14 8 mutations mutation NNS 10_1101-2021_02_13_429885 14 9 compared compare VBN 10_1101-2021_02_13_429885 14 10 to to IN 10_1101-2021_02_13_429885 14 11 healthy healthy JJ 10_1101-2021_02_13_429885 14 12 cells cell NNS 10_1101-2021_02_13_429885 14 13 ​(Macintyre ​(Macintyre NNP 10_1101-2021_02_13_429885 14 14 et et FW 10_1101-2021_02_13_429885 14 15 al al NNP 10_1101-2021_02_13_429885 14 16 . . . 10_1101-2021_02_13_429885 15 1 2018 2018 CD 10_1101-2021_02_13_429885 15 2 ; ; : 10_1101-2021_02_13_429885 15 3 Martincorena Martincorena NNP 10_1101-2021_02_13_429885 15 4 et et FW 10_1101-2021_02_13_429885 15 5 al al NNP 10_1101-2021_02_13_429885 15 6 . . . 10_1101-2021_02_13_429885 16 1 2018 2018 CD 10_1101-2021_02_13_429885 16 2 , , , 10_1101-2021_02_13_429885 16 3 2015 2015 CD 10_1101-2021_02_13_429885 16 4 ; ; : 10_1101-2021_02_13_429885 16 5 Nik Nik NNP 10_1101-2021_02_13_429885 16 6 - - HYPH 10_1101-2021_02_13_429885 16 7 Zainal Zainal NNP 10_1101-2021_02_13_429885 16 8 et et NNP 10_1101-2021_02_13_429885 16 9 al al NNP 10_1101-2021_02_13_429885 16 10 . . . 10_1101-2021_02_13_429885 17 1 2012)​ 2012)​ CD 10_1101-2021_02_13_429885 17 2 , , , 10_1101-2021_02_13_429885 17 3 and and CC 10_1101-2021_02_13_429885 17 4 many many JJ 10_1101-2021_02_13_429885 17 5 of of IN 10_1101-2021_02_13_429885 17 6 these these DT 10_1101-2021_02_13_429885 17 7 events event NNS 10_1101-2021_02_13_429885 17 8 contribute contribute VBP 10_1101-2021_02_13_429885 17 9 to to IN 10_1101-2021_02_13_429885 17 10 the the DT 10_1101-2021_02_13_429885 17 11 pathogenesis pathogenesis NN 10_1101-2021_02_13_429885 17 12 of of IN 10_1101-2021_02_13_429885 17 13 the the DT 10_1101-2021_02_13_429885 17 14 disease disease NN 10_1101-2021_02_13_429885 17 15 , , , 10_1101-2021_02_13_429885 17 16 and and CC 10_1101-2021_02_13_429885 17 17 therapeutic therapeutic JJ 10_1101-2021_02_13_429885 17 18 resistance resistance NN 10_1101-2021_02_13_429885 17 19 . . . 10_1101-2021_02_13_429885 18 1 A a DT 10_1101-2021_02_13_429885 18 2 popular popular JJ 10_1101-2021_02_13_429885 18 3 design design NN 10_1101-2021_02_13_429885 18 4 of of IN 10_1101-2021_02_13_429885 18 5 studies study NNS 10_1101-2021_02_13_429885 18 6 intending intend VBG 10_1101-2021_02_13_429885 18 7 to to IN 10_1101-2021_02_13_429885 18 8 .CC .CC : 10_1101-2021_02_13_429885 18 9 - - : 10_1101-2021_02_13_429885 18 10 BY by IN 10_1101-2021_02_13_429885 18 11 - - HYPH 10_1101-2021_02_13_429885 18 12 NC NC NNP 10_1101-2021_02_13_429885 18 13 - - HYPH 10_1101-2021_02_13_429885 18 14 ND ND NNP 10_1101-2021_02_13_429885 18 15 4.0 4.0 CD 10_1101-2021_02_13_429885 18 16 International International NNP 10_1101-2021_02_13_429885 18 17 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 18 18 under under IN 10_1101-2021_02_13_429885 18 19 a a DT 10_1101-2021_02_13_429885 18 20 ( ( -LRB- 10_1101-2021_02_13_429885 18 21 which which WDT 10_1101-2021_02_13_429885 18 22 was be VBD 10_1101-2021_02_13_429885 18 23 not not RB 10_1101-2021_02_13_429885 18 24 certified certify VBN 10_1101-2021_02_13_429885 18 25 by by IN 10_1101-2021_02_13_429885 18 26 peer peer NN 10_1101-2021_02_13_429885 18 27 review review NN 10_1101-2021_02_13_429885 18 28 ) ) -RRB- 10_1101-2021_02_13_429885 18 29 is be VBZ 10_1101-2021_02_13_429885 18 30 the the DT 10_1101-2021_02_13_429885 18 31 author author NN 10_1101-2021_02_13_429885 18 32 / / SYM 10_1101-2021_02_13_429885 18 33 funder funder NN 10_1101-2021_02_13_429885 18 34 , , , 10_1101-2021_02_13_429885 18 35 who who WP 10_1101-2021_02_13_429885 18 36 has have VBZ 10_1101-2021_02_13_429885 18 37 granted grant VBN 10_1101-2021_02_13_429885 18 38 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 18 39 a a DT 10_1101-2021_02_13_429885 18 40 license license NN 10_1101-2021_02_13_429885 18 41 to to TO 10_1101-2021_02_13_429885 18 42 display display VB 10_1101-2021_02_13_429885 18 43 the the DT 10_1101-2021_02_13_429885 18 44 preprint preprint NN 10_1101-2021_02_13_429885 18 45 in in IN 10_1101-2021_02_13_429885 18 46 perpetuity perpetuity NN 10_1101-2021_02_13_429885 18 47 . . . 10_1101-2021_02_13_429885 19 1 It -PRON- PRP 10_1101-2021_02_13_429885 19 2 is be VBZ 10_1101-2021_02_13_429885 19 3 made make VBN 10_1101-2021_02_13_429885 19 4 The the DT 10_1101-2021_02_13_429885 19 5 copyright copyright NN 10_1101-2021_02_13_429885 19 6 holder holder NN 10_1101-2021_02_13_429885 19 7 for for IN 10_1101-2021_02_13_429885 19 8 this this DT 10_1101-2021_02_13_429885 19 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 19 10 version version NN 10_1101-2021_02_13_429885 19 11 posted post VBD 10_1101-2021_02_13_429885 19 12 February February NNP 10_1101-2021_02_13_429885 19 13 13 13 CD 10_1101-2021_02_13_429885 19 14 , , , 10_1101-2021_02_13_429885 19 15 2021 2021 CD 10_1101-2021_02_13_429885 19 16 . . . 10_1101-2021_02_13_429885 19 17 ; ; : 10_1101-2021_02_13_429885 19 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 19 19 : : : 10_1101-2021_02_13_429885 19 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 19 21 preprint preprint NN 10_1101-2021_02_13_429885 19 22 mailto:gcaravagna@units.it mailto:gcaravagna@units.it VBZ 10_1101-2021_02_13_429885 19 23 https://paperpile.com/c/rqVmzs/Pf2t+5LH8+ZoHM https://paperpile.com/c/rqVmzs/Pf2t+5LH8+ZoHM NNP 10_1101-2021_02_13_429885 19 24 https://paperpile.com/c/rqVmzs/Pf2t+5LH8+ZoHM https://paperpile.com/c/rqVmzs/Pf2t+5LH8+ZoHM NNP 10_1101-2021_02_13_429885 19 25 https://paperpile.com/c/rqVmzs/P1Yv+uG2X+4mqr+bHGV https://paperpile.com/c/rqVmzs/P1Yv+uG2X+4mqr+bHGV NNP 10_1101-2021_02_13_429885 19 26 https://paperpile.com/c/rqVmzs/P1Yv+uG2X+4mqr+bHGV https://paperpile.com/c/rqvmzs/p1yv+ug2x+4mqr+bhgv NN 10_1101-2021_02_13_429885 19 27 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 19 28 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 19 29 Househam Househam NNP 10_1101-2021_02_13_429885 19 30 et et FW 10_1101-2021_02_13_429885 19 31 al al NNP 10_1101-2021_02_13_429885 19 32 . . . 10_1101-2021_02_13_429885 20 1 A a DT 10_1101-2021_02_13_429885 20 2 fully fully RB 10_1101-2021_02_13_429885 20 3 automated automate VBN 10_1101-2021_02_13_429885 20 4 approach approach NN 10_1101-2021_02_13_429885 20 5 for for IN 10_1101-2021_02_13_429885 20 6 quality quality NN 10_1101-2021_02_13_429885 20 7 control control NN 10_1101-2021_02_13_429885 20 8 of of IN 10_1101-2021_02_13_429885 20 9 cancer cancer NN 10_1101-2021_02_13_429885 20 10 mutations mutation NNS 10_1101-2021_02_13_429885 20 11 in in IN 10_1101-2021_02_13_429885 20 12 the the DT 10_1101-2021_02_13_429885 20 13 era era NN 10_1101-2021_02_13_429885 20 14 of of IN 10_1101-2021_02_13_429885 20 15 high high JJ 10_1101-2021_02_13_429885 20 16 - - HYPH 10_1101-2021_02_13_429885 20 17 resolution resolution NN 10_1101-2021_02_13_429885 20 18 whole whole JJ 10_1101-2021_02_13_429885 20 19 genome genome JJ 10_1101-2021_02_13_429885 20 20 sequencing sequencing NN 10_1101-2021_02_13_429885 20 21 . . . 10_1101-2021_02_13_429885 21 1 understand understand VB 10_1101-2021_02_13_429885 21 2 tumour tumour NNP 10_1101-2021_02_13_429885 21 3 development development NN 10_1101-2021_02_13_429885 21 4 involves involve VBZ 10_1101-2021_02_13_429885 21 5 collecting collect VBG 10_1101-2021_02_13_429885 21 6 tumour tumour NN 10_1101-2021_02_13_429885 21 7 and and CC 10_1101-2021_02_13_429885 21 8 matched match VBN 10_1101-2021_02_13_429885 21 9 - - HYPH 10_1101-2021_02_13_429885 21 10 normal normal JJ 10_1101-2021_02_13_429885 21 11 biopsies biopsy NNS 10_1101-2021_02_13_429885 21 12 , , , 10_1101-2021_02_13_429885 21 13 and and CC 10_1101-2021_02_13_429885 21 14 generating generate VBG 10_1101-2021_02_13_429885 21 15 so so RB 10_1101-2021_02_13_429885 21 16 - - HYPH 10_1101-2021_02_13_429885 21 17 called call VBN 10_1101-2021_02_13_429885 21 18 “ " `` 10_1101-2021_02_13_429885 21 19 bulk bulk NN 10_1101-2021_02_13_429885 21 20 ” " '' 10_1101-2021_02_13_429885 21 21 DNA dna NN 10_1101-2021_02_13_429885 21 22 sequencing sequencing NN 10_1101-2021_02_13_429885 21 23 data datum NNS 10_1101-2021_02_13_429885 21 24 for for IN 10_1101-2021_02_13_429885 21 25 both both DT 10_1101-2021_02_13_429885 21 26 ​(Barnell ​(Barnell NNP 10_1101-2021_02_13_429885 21 27 et et FW 10_1101-2021_02_13_429885 21 28 al al NNP 10_1101-2021_02_13_429885 21 29 . . . 10_1101-2021_02_13_429885 22 1 2019)​. 2019)​. CD 10_1101-2021_02_13_429885 23 1 Using use VBG 10_1101-2021_02_13_429885 23 2 bioinformatic bioinformatic JJ 10_1101-2021_02_13_429885 23 3 tools tool NNS 10_1101-2021_02_13_429885 23 4 to to TO 10_1101-2021_02_13_429885 23 5 cross cross VB 10_1101-2021_02_13_429885 23 6 reference reference NN 10_1101-2021_02_13_429885 23 7 the the DT 10_1101-2021_02_13_429885 23 8 normal normal JJ 10_1101-2021_02_13_429885 23 9 genome genome NN 10_1101-2021_02_13_429885 23 10 against against IN 10_1101-2021_02_13_429885 23 11 the the DT 10_1101-2021_02_13_429885 23 12 aberrant aberrant JJ 10_1101-2021_02_13_429885 23 13 one one CD 10_1101-2021_02_13_429885 23 14 , , , 10_1101-2021_02_13_429885 23 15 the the DT 10_1101-2021_02_13_429885 23 16 mutations mutation NNS 10_1101-2021_02_13_429885 23 17 and and CC 10_1101-2021_02_13_429885 23 18 heterogeneity heterogeneity NN 10_1101-2021_02_13_429885 23 19 thereof thereof RB 10_1101-2021_02_13_429885 23 20 found find VBN 10_1101-2021_02_13_429885 23 21 in in IN 10_1101-2021_02_13_429885 23 22 the the DT 10_1101-2021_02_13_429885 23 23 tumour tumour NN 10_1101-2021_02_13_429885 23 24 sample sample NN 10_1101-2021_02_13_429885 23 25 can can MD 10_1101-2021_02_13_429885 23 26 be be VB 10_1101-2021_02_13_429885 23 27 derived derive VBN 10_1101-2021_02_13_429885 23 28 and and CC 10_1101-2021_02_13_429885 23 29 used use VBN 10_1101-2021_02_13_429885 23 30 in in IN 10_1101-2021_02_13_429885 23 31 other other JJ 10_1101-2021_02_13_429885 23 32 analyses analysis NNS 10_1101-2021_02_13_429885 23 33 . . . 10_1101-2021_02_13_429885 24 1 These these DT 10_1101-2021_02_13_429885 24 2 analyses analysis NNS 10_1101-2021_02_13_429885 24 3 include include VBP 10_1101-2021_02_13_429885 24 4 , , , 10_1101-2021_02_13_429885 24 5 but but CC 10_1101-2021_02_13_429885 24 6 are be VBP 10_1101-2021_02_13_429885 24 7 not not RB 10_1101-2021_02_13_429885 24 8 limited limit VBN 10_1101-2021_02_13_429885 24 9 to to IN 10_1101-2021_02_13_429885 24 10 , , , 10_1101-2021_02_13_429885 24 11 driver driver VB 10_1101-2021_02_13_429885 24 12 mutation mutation NN 10_1101-2021_02_13_429885 24 13 identification identification NN 10_1101-2021_02_13_429885 24 14 ​(Bailey ​(Bailey NNP 10_1101-2021_02_13_429885 24 15 et et FW 10_1101-2021_02_13_429885 24 16 al al NNP 10_1101-2021_02_13_429885 24 17 . . . 10_1101-2021_02_13_429885 25 1 2018 2018 CD 10_1101-2021_02_13_429885 25 2 ; ; : 10_1101-2021_02_13_429885 25 3 Gonzalez Gonzalez NNP 10_1101-2021_02_13_429885 25 4 - - HYPH 10_1101-2021_02_13_429885 25 5 Perez Perez NNP 10_1101-2021_02_13_429885 25 6 et et NNP 10_1101-2021_02_13_429885 25 7 al al NNP 10_1101-2021_02_13_429885 25 8 . . . 10_1101-2021_02_13_429885 26 1 2013)​ 2013)​ CD 10_1101-2021_02_13_429885 26 2 , , , 10_1101-2021_02_13_429885 26 3 which which WDT 10_1101-2021_02_13_429885 26 4 aims aim VBZ 10_1101-2021_02_13_429885 26 5 to to TO 10_1101-2021_02_13_429885 26 6 discern discern VB 10_1101-2021_02_13_429885 26 7 the the DT 10_1101-2021_02_13_429885 26 8 key key JJ 10_1101-2021_02_13_429885 26 9 aberrations aberration NNS 10_1101-2021_02_13_429885 26 10 that that WDT 10_1101-2021_02_13_429885 26 11 cause cause VBP 10_1101-2021_02_13_429885 26 12 a a DT 10_1101-2021_02_13_429885 26 13 tumour tumour NN 10_1101-2021_02_13_429885 26 14 to to TO 10_1101-2021_02_13_429885 26 15 grow grow VB 10_1101-2021_02_13_429885 26 16 , , , 10_1101-2021_02_13_429885 26 17 patient patient JJ 10_1101-2021_02_13_429885 26 18 clustering clustering NN 10_1101-2021_02_13_429885 26 19 , , , 10_1101-2021_02_13_429885 26 20 which which WDT 10_1101-2021_02_13_429885 26 21 aims aim VBZ 10_1101-2021_02_13_429885 26 22 to to TO 10_1101-2021_02_13_429885 26 23 identify identify VB 10_1101-2021_02_13_429885 26 24 treatment treatment NN 10_1101-2021_02_13_429885 26 25 groups group NNS 10_1101-2021_02_13_429885 26 26 with with IN 10_1101-2021_02_13_429885 26 27 similar similar JJ 10_1101-2021_02_13_429885 26 28 biological biological JJ 10_1101-2021_02_13_429885 26 29 characteristics characteristic NNS 10_1101-2021_02_13_429885 26 30 , , , 10_1101-2021_02_13_429885 26 31 and and CC 10_1101-2021_02_13_429885 26 32 evolutionary evolutionary JJ 10_1101-2021_02_13_429885 26 33 inference inference NN 10_1101-2021_02_13_429885 26 34 ​(Gerstung ​(Gerstung NNP 10_1101-2021_02_13_429885 26 35 et et NNP 10_1101-2021_02_13_429885 26 36 al al NNP 10_1101-2021_02_13_429885 26 37 . . . 10_1101-2021_02_13_429885 27 1 2020 2020 CD 10_1101-2021_02_13_429885 27 2 ; ; : 10_1101-2021_02_13_429885 27 3 Nik Nik NNP 10_1101-2021_02_13_429885 27 4 - - HYPH 10_1101-2021_02_13_429885 27 5 Zainal Zainal NNP 10_1101-2021_02_13_429885 27 6 et et NNP 10_1101-2021_02_13_429885 27 7 al al NNP 10_1101-2021_02_13_429885 27 8 . . . 10_1101-2021_02_13_429885 28 1 2012 2012 CD 10_1101-2021_02_13_429885 28 2 ; ; : 10_1101-2021_02_13_429885 28 3 Caravagna Caravagna NNP 10_1101-2021_02_13_429885 28 4 et et NNP 10_1101-2021_02_13_429885 28 5 al al NNP 10_1101-2021_02_13_429885 28 6 . . . 10_1101-2021_02_13_429885 29 1 2020)​ 2020)​ CD 10_1101-2021_02_13_429885 29 2 , , , 10_1101-2021_02_13_429885 29 3 which which WDT 10_1101-2021_02_13_429885 29 4 informs inform VBZ 10_1101-2021_02_13_429885 29 5 us -PRON- PRP 10_1101-2021_02_13_429885 29 6 how how WRB 10_1101-2021_02_13_429885 29 7 a a DT 10_1101-2021_02_13_429885 29 8 particular particular JJ 10_1101-2021_02_13_429885 29 9 tumour tumour NN 10_1101-2021_02_13_429885 29 10 developed develop VBN 10_1101-2021_02_13_429885 29 11 from from IN 10_1101-2021_02_13_429885 29 12 normal normal JJ 10_1101-2021_02_13_429885 29 13 cells cell NNS 10_1101-2021_02_13_429885 29 14 . . . 10_1101-2021_02_13_429885 30 1 There there EX 10_1101-2021_02_13_429885 30 2 are be VBP 10_1101-2021_02_13_429885 30 3 several several JJ 10_1101-2021_02_13_429885 30 4 types type NNS 10_1101-2021_02_13_429885 30 5 of of IN 10_1101-2021_02_13_429885 30 6 mutations mutation NNS 10_1101-2021_02_13_429885 30 7 that that WDT 10_1101-2021_02_13_429885 30 8 we -PRON- PRP 10_1101-2021_02_13_429885 30 9 can can MD 10_1101-2021_02_13_429885 30 10 retrieve retrieve VB 10_1101-2021_02_13_429885 30 11 from from IN 10_1101-2021_02_13_429885 30 12 DNA dna NN 10_1101-2021_02_13_429885 30 13 sequencing sequencing NN 10_1101-2021_02_13_429885 30 14 data datum NNS 10_1101-2021_02_13_429885 30 15 ( ( -LRB- 10_1101-2021_02_13_429885 30 16 Campbell Campbell NNP 10_1101-2021_02_13_429885 30 17 et et NNP 10_1101-2021_02_13_429885 30 18 al al NNP 10_1101-2021_02_13_429885 30 19 . . . 10_1101-2021_02_13_429885 31 1 2020)​. 2020)​. CD 10_1101-2021_02_13_429885 32 1 Broadly broadly RB 10_1101-2021_02_13_429885 32 2 these these DT 10_1101-2021_02_13_429885 32 3 can can MD 10_1101-2021_02_13_429885 32 4 be be VB 10_1101-2021_02_13_429885 32 5 categorized categorize VBN 10_1101-2021_02_13_429885 32 6 as as IN 10_1101-2021_02_13_429885 32 7 single single JJ 10_1101-2021_02_13_429885 32 8 nucleotide nucleotide JJ 10_1101-2021_02_13_429885 32 9 variants variant NNS 10_1101-2021_02_13_429885 32 10 ( ( -LRB- 10_1101-2021_02_13_429885 32 11 SNVs SNVs NNPS 10_1101-2021_02_13_429885 32 12 ) ) -RRB- 10_1101-2021_02_13_429885 32 13 , , , 10_1101-2021_02_13_429885 32 14 copy copy VBP 10_1101-2021_02_13_429885 32 15 number number NN 10_1101-2021_02_13_429885 32 16 alterations alteration NNS 10_1101-2021_02_13_429885 32 17 ( ( -LRB- 10_1101-2021_02_13_429885 32 18 CNAs CNAs NNP 10_1101-2021_02_13_429885 32 19 ) ) -RRB- 10_1101-2021_02_13_429885 32 20 and and CC 10_1101-2021_02_13_429885 32 21 other other JJ 10_1101-2021_02_13_429885 32 22 more more RBR 10_1101-2021_02_13_429885 32 23 complex complex JJ 10_1101-2021_02_13_429885 32 24 changes change NNS 10_1101-2021_02_13_429885 32 25 such such JJ 10_1101-2021_02_13_429885 32 26 as as IN 10_1101-2021_02_13_429885 32 27 structural structural JJ 10_1101-2021_02_13_429885 32 28 variants variant NNS 10_1101-2021_02_13_429885 32 29 ​(Li ​(Li NNP 10_1101-2021_02_13_429885 32 30 et et FW 10_1101-2021_02_13_429885 32 31 al al NNP 10_1101-2021_02_13_429885 32 32 . . . 10_1101-2021_02_13_429885 33 1 2020)​. 2020)​. CD 10_1101-2021_02_13_429885 34 1 All all DT 10_1101-2021_02_13_429885 34 2 types type NNS 10_1101-2021_02_13_429885 34 3 of of IN 10_1101-2021_02_13_429885 34 4 mutations mutation NNS 10_1101-2021_02_13_429885 34 5 can can MD 10_1101-2021_02_13_429885 34 6 drive drive VB 10_1101-2021_02_13_429885 34 7 tumour tumour NN 10_1101-2021_02_13_429885 34 8 progression progression NN 10_1101-2021_02_13_429885 34 9 , , , 10_1101-2021_02_13_429885 34 10 and and CC 10_1101-2021_02_13_429885 34 11 are be VBP 10_1101-2021_02_13_429885 34 12 therefore therefore RB 10_1101-2021_02_13_429885 34 13 important important JJ 10_1101-2021_02_13_429885 34 14 entities entity NNS 10_1101-2021_02_13_429885 34 15 to to TO 10_1101-2021_02_13_429885 34 16 study study VB 10_1101-2021_02_13_429885 34 17 ​(Kent ​(Kent NNP 10_1101-2021_02_13_429885 34 18 and and CC 10_1101-2021_02_13_429885 34 19 Green Green NNP 10_1101-2021_02_13_429885 34 20 2017 2017 CD 10_1101-2021_02_13_429885 34 21 - - SYM 10_1101-2021_02_13_429885 34 22 4 4 CD 10_1101-2021_02_13_429885 34 23 ; ; : 10_1101-2021_02_13_429885 34 24 Levine Levine NNP 10_1101-2021_02_13_429885 34 25 , , , 10_1101-2021_02_13_429885 34 26 Jenkins Jenkins NNP 10_1101-2021_02_13_429885 34 27 , , , 10_1101-2021_02_13_429885 34 28 and and CC 10_1101-2021_02_13_429885 34 29 Copeland Copeland NNP 10_1101-2021_02_13_429885 34 30 2019)​. 2019)​. CD 10_1101-2021_02_13_429885 35 1 Luckily luckily RB 10_1101-2021_02_13_429885 35 2 , , , 10_1101-2021_02_13_429885 35 3 the the DT 10_1101-2021_02_13_429885 35 4 steady steady JJ 10_1101-2021_02_13_429885 35 5 drop drop NN 10_1101-2021_02_13_429885 35 6 in in IN 10_1101-2021_02_13_429885 35 7 sequencing sequence VBG 10_1101-2021_02_13_429885 35 8 costs cost NNS 10_1101-2021_02_13_429885 35 9 is be VBZ 10_1101-2021_02_13_429885 35 10 fueling fuel VBG 10_1101-2021_02_13_429885 35 11 the the DT 10_1101-2021_02_13_429885 35 12 creation creation NN 10_1101-2021_02_13_429885 35 13 of of IN 10_1101-2021_02_13_429885 35 14 large large JJ 10_1101-2021_02_13_429885 35 15 amounts amount NNS 10_1101-2021_02_13_429885 35 16 of of IN 10_1101-2021_02_13_429885 35 17 data datum NNS 10_1101-2021_02_13_429885 35 18 , , , 10_1101-2021_02_13_429885 35 19 which which WDT 10_1101-2021_02_13_429885 35 20 are be VBP 10_1101-2021_02_13_429885 35 21 becoming become VBG 10_1101-2021_02_13_429885 35 22 increasingly increasingly RB 10_1101-2021_02_13_429885 35 23 available available JJ 10_1101-2021_02_13_429885 35 24 for for IN 10_1101-2021_02_13_429885 35 25 researchers researcher NNS 10_1101-2021_02_13_429885 35 26 to to TO 10_1101-2021_02_13_429885 35 27 access access VB 10_1101-2021_02_13_429885 35 28 through through IN 10_1101-2021_02_13_429885 35 29 public public JJ 10_1101-2021_02_13_429885 35 30 databases database NNS 10_1101-2021_02_13_429885 35 31 . . . 10_1101-2021_02_13_429885 36 1 Notably notably RB 10_1101-2021_02_13_429885 36 2 , , , 10_1101-2021_02_13_429885 36 3 we -PRON- PRP 10_1101-2021_02_13_429885 36 4 are be VBP 10_1101-2021_02_13_429885 36 5 entering enter VBG 10_1101-2021_02_13_429885 36 6 the the DT 10_1101-2021_02_13_429885 36 7 era era NN 10_1101-2021_02_13_429885 36 8 of of IN 10_1101-2021_02_13_429885 36 9 high high JJ 10_1101-2021_02_13_429885 36 10 - - HYPH 10_1101-2021_02_13_429885 36 11 resolution resolution NN 10_1101-2021_02_13_429885 36 12 whole whole RB 10_1101-2021_02_13_429885 36 13 - - HYPH 10_1101-2021_02_13_429885 36 14 genome genome RB 10_1101-2021_02_13_429885 36 15 sequencing sequencing NN 10_1101-2021_02_13_429885 36 16 ( ( -LRB- 10_1101-2021_02_13_429885 36 17 WGS WGS NNP 10_1101-2021_02_13_429885 36 18 ) ) -RRB- 10_1101-2021_02_13_429885 36 19 , , , 10_1101-2021_02_13_429885 36 20 a a DT 10_1101-2021_02_13_429885 36 21 technology technology NN 10_1101-2021_02_13_429885 36 22 that that WDT 10_1101-2021_02_13_429885 36 23 can can MD 10_1101-2021_02_13_429885 36 24 read read VB 10_1101-2021_02_13_429885 36 25 out out RP 10_1101-2021_02_13_429885 36 26 the the DT 10_1101-2021_02_13_429885 36 27 majority majority NN 10_1101-2021_02_13_429885 36 28 of of IN 10_1101-2021_02_13_429885 36 29 a a DT 10_1101-2021_02_13_429885 36 30 tumour tumour NN 10_1101-2021_02_13_429885 36 31 genome genome NN 10_1101-2021_02_13_429885 36 32 , , , 10_1101-2021_02_13_429885 36 33 providing provide VBG 10_1101-2021_02_13_429885 36 34 major major JJ 10_1101-2021_02_13_429885 36 35 improvements improvement NNS 10_1101-2021_02_13_429885 36 36 over over IN 10_1101-2021_02_13_429885 36 37 whole whole JJ 10_1101-2021_02_13_429885 36 38 - - HYPH 10_1101-2021_02_13_429885 36 39 exome exome NN 10_1101-2021_02_13_429885 36 40 counterparts counterpart NNS 10_1101-2021_02_13_429885 36 41 . . . 10_1101-2021_02_13_429885 37 1 Generating generate VBG 10_1101-2021_02_13_429885 37 2 some some DT 10_1101-2021_02_13_429885 37 3 of of IN 10_1101-2021_02_13_429885 37 4 these these DT 10_1101-2021_02_13_429885 37 5 data datum NNS 10_1101-2021_02_13_429885 37 6 , , , 10_1101-2021_02_13_429885 37 7 however however RB 10_1101-2021_02_13_429885 37 8 , , , 10_1101-2021_02_13_429885 37 9 poses pose VBZ 10_1101-2021_02_13_429885 37 10 challenges challenge NNS 10_1101-2021_02_13_429885 37 11 . . . 10_1101-2021_02_13_429885 38 1 While while IN 10_1101-2021_02_13_429885 38 2 SNVs SNVs NNPS 10_1101-2021_02_13_429885 38 3 are be VBP 10_1101-2021_02_13_429885 38 4 the the DT 10_1101-2021_02_13_429885 38 5 simplest simple JJS 10_1101-2021_02_13_429885 38 6 type type NN 10_1101-2021_02_13_429885 38 7 of of IN 10_1101-2021_02_13_429885 38 8 mutations mutation NNS 10_1101-2021_02_13_429885 38 9 to to TO 10_1101-2021_02_13_429885 38 10 detect detect VB 10_1101-2021_02_13_429885 38 11 using use VBG 10_1101-2021_02_13_429885 38 12 bioinformatic bioinformatic JJ 10_1101-2021_02_13_429885 38 13 analysis analysis NN 10_1101-2021_02_13_429885 38 14 and and CC 10_1101-2021_02_13_429885 38 15 perhaps perhaps RB 10_1101-2021_02_13_429885 38 16 have have VB 10_1101-2021_02_13_429885 38 17 the the DT 10_1101-2021_02_13_429885 38 18 most most RBS 10_1101-2021_02_13_429885 38 19 well well RB 10_1101-2021_02_13_429885 38 20 established establish VBN 10_1101-2021_02_13_429885 38 21 supporting support VBG 10_1101-2021_02_13_429885 38 22 tools tool NNS 10_1101-2021_02_13_429885 38 23 ​(Li ​(li CD 10_1101-2021_02_13_429885 38 24 et et FW 10_1101-2021_02_13_429885 38 25 al al NNP 10_1101-2021_02_13_429885 38 26 . . . 10_1101-2021_02_13_429885 39 1 2020)​ 2020)​ CD 10_1101-2021_02_13_429885 39 2 , , , 10_1101-2021_02_13_429885 39 3 CNAs cna NNS 10_1101-2021_02_13_429885 39 4 are be VBP 10_1101-2021_02_13_429885 39 5 particularly particularly RB 10_1101-2021_02_13_429885 39 6 difficult difficult JJ 10_1101-2021_02_13_429885 39 7 to to TO 10_1101-2021_02_13_429885 39 8 call call VB 10_1101-2021_02_13_429885 39 9 since since IN 10_1101-2021_02_13_429885 39 10 the the DT 10_1101-2021_02_13_429885 39 11 baseline baseline NN 10_1101-2021_02_13_429885 39 12 ploidy ploidy NN 10_1101-2021_02_13_429885 39 13 of of IN 10_1101-2021_02_13_429885 39 14 the the DT 10_1101-2021_02_13_429885 39 15 tumour tumour NN 10_1101-2021_02_13_429885 39 16 ( ( -LRB- 10_1101-2021_02_13_429885 39 17 i.e. i.e. FW 10_1101-2021_02_13_429885 39 18 , , , 10_1101-2021_02_13_429885 39 19 the the DT 10_1101-2021_02_13_429885 39 20 number number NN 10_1101-2021_02_13_429885 39 21 of of IN 10_1101-2021_02_13_429885 39 22 chromosome chromosome NN 10_1101-2021_02_13_429885 39 23 copies copy NNS 10_1101-2021_02_13_429885 39 24 ) ) -RRB- 10_1101-2021_02_13_429885 39 25 is be VBZ 10_1101-2021_02_13_429885 39 26 usually usually RB 10_1101-2021_02_13_429885 39 27 unknown unknown JJ 10_1101-2021_02_13_429885 39 28 and and CC 10_1101-2021_02_13_429885 39 29 has have VBZ 10_1101-2021_02_13_429885 39 30 to to TO 10_1101-2021_02_13_429885 39 31 be be VB 10_1101-2021_02_13_429885 39 32 inferred infer VBN 10_1101-2021_02_13_429885 39 33 from from IN 10_1101-2021_02_13_429885 39 34 the the DT 10_1101-2021_02_13_429885 39 35 data datum NNS 10_1101-2021_02_13_429885 39 36 . . . 10_1101-2021_02_13_429885 40 1 CNAs cna NNS 10_1101-2021_02_13_429885 40 2 are be VBP 10_1101-2021_02_13_429885 40 3 important important JJ 10_1101-2021_02_13_429885 40 4 types type NNS 10_1101-2021_02_13_429885 40 5 of of IN 10_1101-2021_02_13_429885 40 6 cancer cancer NN 10_1101-2021_02_13_429885 40 7 mutations mutation NNS 10_1101-2021_02_13_429885 40 8 ; ; : 10_1101-2021_02_13_429885 40 9 large large JJ 10_1101-2021_02_13_429885 40 10 - - HYPH 10_1101-2021_02_13_429885 40 11 scale scale NN 10_1101-2021_02_13_429885 40 12 gain gain NN 10_1101-2021_02_13_429885 40 13 and and CC 10_1101-2021_02_13_429885 40 14 loss loss NN 10_1101-2021_02_13_429885 40 15 of of IN 10_1101-2021_02_13_429885 40 16 chromosome chromosome NN 10_1101-2021_02_13_429885 40 17 arms arm NNS 10_1101-2021_02_13_429885 40 18 or or CC 10_1101-2021_02_13_429885 40 19 sections section NNS 10_1101-2021_02_13_429885 40 20 of of IN 10_1101-2021_02_13_429885 40 21 arms arm NNS 10_1101-2021_02_13_429885 40 22 can can MD 10_1101-2021_02_13_429885 40 23 confer confer VB 10_1101-2021_02_13_429885 40 24 tumour tumour NN 10_1101-2021_02_13_429885 40 25 cells cell NNS 10_1101-2021_02_13_429885 40 26 with with IN 10_1101-2021_02_13_429885 40 27 large large JJ 10_1101-2021_02_13_429885 40 28 - - HYPH 10_1101-2021_02_13_429885 40 29 scale scale NN 10_1101-2021_02_13_429885 40 30 phenotypic phenotypic NN 10_1101-2021_02_13_429885 40 31 changes change NNS 10_1101-2021_02_13_429885 40 32 , , , 10_1101-2021_02_13_429885 40 33 and and CC 10_1101-2021_02_13_429885 40 34 are be VBP 10_1101-2021_02_13_429885 40 35 often often RB 10_1101-2021_02_13_429885 40 36 important important JJ 10_1101-2021_02_13_429885 40 37 clinical clinical JJ 10_1101-2021_02_13_429885 40 38 targets target NNS 10_1101-2021_02_13_429885 40 39 ​(Gerstung ​(Gerstung NNP 10_1101-2021_02_13_429885 40 40 et et FW 10_1101-2021_02_13_429885 40 41 al al NNP 10_1101-2021_02_13_429885 40 42 . . . 10_1101-2021_02_13_429885 41 1 2020 2020 CD 10_1101-2021_02_13_429885 41 2 ; ; : 10_1101-2021_02_13_429885 41 3 Watkins Watkins NNP 10_1101-2021_02_13_429885 41 4 et et NNP 10_1101-2021_02_13_429885 41 5 al al NNP 10_1101-2021_02_13_429885 41 6 . . . 10_1101-2021_02_13_429885 42 1 11 11 CD 10_1101-2021_02_13_429885 42 2 2020)​. 2020)​. CD 10_1101-2021_02_13_429885 43 1 SNVs snv NNS 10_1101-2021_02_13_429885 43 2 and and CC 10_1101-2021_02_13_429885 43 3 CNAs cna NNS 10_1101-2021_02_13_429885 43 4 are be VBP 10_1101-2021_02_13_429885 43 5 intertwined intertwine VBN 10_1101-2021_02_13_429885 43 6 mutation mutation NN 10_1101-2021_02_13_429885 43 7 groups group NNS 10_1101-2021_02_13_429885 43 8 . . . 10_1101-2021_02_13_429885 44 1 They -PRON- PRP 10_1101-2021_02_13_429885 44 2 can can MD 10_1101-2021_02_13_429885 44 3 overlap overlap VB 10_1101-2021_02_13_429885 44 4 within within IN 10_1101-2021_02_13_429885 44 5 a a DT 10_1101-2021_02_13_429885 44 6 tumour tumour NN 10_1101-2021_02_13_429885 44 7 cell cell NN 10_1101-2021_02_13_429885 44 8 ’s ’s POS 10_1101-2021_02_13_429885 44 9 genome genome NN 10_1101-2021_02_13_429885 44 10 , , , 10_1101-2021_02_13_429885 44 11 meaning mean VBG 10_1101-2021_02_13_429885 44 12 the the DT 10_1101-2021_02_13_429885 44 13 number number NN 10_1101-2021_02_13_429885 44 14 of of IN 10_1101-2021_02_13_429885 44 15 copies copy NNS 10_1101-2021_02_13_429885 44 16 of of IN 10_1101-2021_02_13_429885 44 17 an an DT 10_1101-2021_02_13_429885 44 18 SNV SNV NNP 10_1101-2021_02_13_429885 44 19 can can MD 10_1101-2021_02_13_429885 44 20 be be VB 10_1101-2021_02_13_429885 44 21 amplified amplify VBN 10_1101-2021_02_13_429885 44 22 or or CC 10_1101-2021_02_13_429885 44 23 indeed indeed RB 10_1101-2021_02_13_429885 44 24 reduced reduce VBN 10_1101-2021_02_13_429885 44 25 by by IN 10_1101-2021_02_13_429885 44 26 CNAs cna NNS 10_1101-2021_02_13_429885 44 27 . . . 10_1101-2021_02_13_429885 45 1 This this DT 10_1101-2021_02_13_429885 45 2 depends depend VBZ 10_1101-2021_02_13_429885 45 3 on on IN 10_1101-2021_02_13_429885 45 4 the the DT 10_1101-2021_02_13_429885 45 5 ploidy ploidy NN 10_1101-2021_02_13_429885 45 6 of of IN 10_1101-2021_02_13_429885 45 7 the the DT 10_1101-2021_02_13_429885 45 8 genome genome JJ 10_1101-2021_02_13_429885 45 9 regions region NNS 10_1101-2021_02_13_429885 45 10 overlapping overlap VBG 10_1101-2021_02_13_429885 45 11 with with IN 10_1101-2021_02_13_429885 45 12 the the DT 10_1101-2021_02_13_429885 45 13 variants variant NNS 10_1101-2021_02_13_429885 45 14 . . . 10_1101-2021_02_13_429885 46 1 For for IN 10_1101-2021_02_13_429885 46 2 instance instance NN 10_1101-2021_02_13_429885 46 3 , , , 10_1101-2021_02_13_429885 46 4 for for IN 10_1101-2021_02_13_429885 46 5 a a DT 10_1101-2021_02_13_429885 46 6 clonal clonal JJ 10_1101-2021_02_13_429885 46 7 - - HYPH 10_1101-2021_02_13_429885 46 8 meaning meaning NN 10_1101-2021_02_13_429885 46 9 present present NN 10_1101-2021_02_13_429885 46 10 in in IN 10_1101-2021_02_13_429885 46 11 every every DT 10_1101-2021_02_13_429885 46 12 cell cell NN 10_1101-2021_02_13_429885 46 13 of of IN 10_1101-2021_02_13_429885 46 14 the the DT 10_1101-2021_02_13_429885 46 15 tumour tumour NN 10_1101-2021_02_13_429885 46 16 sample sample NN 10_1101-2021_02_13_429885 46 17 - - HYPH 10_1101-2021_02_13_429885 46 18 heterozygous heterozygous JJ 10_1101-2021_02_13_429885 46 19 SNV SNV NNP 10_1101-2021_02_13_429885 46 20 in in IN 10_1101-2021_02_13_429885 46 21 a a DT 10_1101-2021_02_13_429885 46 22 diploid diploid JJ 10_1101-2021_02_13_429885 46 23 tumour tumour NN 10_1101-2021_02_13_429885 46 24 genome genome NN 10_1101-2021_02_13_429885 46 25 the the DT 10_1101-2021_02_13_429885 46 26 expected expect VBN 10_1101-2021_02_13_429885 46 27 variant variant JJ 10_1101-2021_02_13_429885 46 28 allele allele NNP 10_1101-2021_02_13_429885 46 29 frequency frequency NN 10_1101-2021_02_13_429885 46 30 ( ( -LRB- 10_1101-2021_02_13_429885 46 31 VAF VAF NNP 10_1101-2021_02_13_429885 46 32 ) ) -RRB- 10_1101-2021_02_13_429885 46 33 is be VBZ 10_1101-2021_02_13_429885 46 34 50 50 CD 10_1101-2021_02_13_429885 46 35 % % NN 10_1101-2021_02_13_429885 46 36 ( ( -LRB- 10_1101-2021_02_13_429885 46 37 i.e. i.e. FW 10_1101-2021_02_13_429885 46 38 , , , 10_1101-2021_02_13_429885 46 39 half half NN 10_1101-2021_02_13_429885 46 40 of of IN 10_1101-2021_02_13_429885 46 41 the the DT 10_1101-2021_02_13_429885 46 42 reads read NNS 10_1101-2021_02_13_429885 46 43 from from IN 10_1101-2021_02_13_429885 46 44 tumour tumour NN 10_1101-2021_02_13_429885 46 45 cells cell NNS 10_1101-2021_02_13_429885 46 46 will will MD 10_1101-2021_02_13_429885 46 47 harbour harbour VB 10_1101-2021_02_13_429885 46 48 the the DT 10_1101-2021_02_13_429885 46 49 SNV SNV NNP 10_1101-2021_02_13_429885 46 50 ) ) -RRB- 10_1101-2021_02_13_429885 46 51 . . . 10_1101-2021_02_13_429885 47 1 Alternatively alternatively RB 10_1101-2021_02_13_429885 47 2 , , , 10_1101-2021_02_13_429885 47 3 if if IN 10_1101-2021_02_13_429885 47 4 each each DT 10_1101-2021_02_13_429885 47 5 chromosome chromosome NN 10_1101-2021_02_13_429885 47 6 is be VBZ 10_1101-2021_02_13_429885 47 7 present present JJ 10_1101-2021_02_13_429885 47 8 in in IN 10_1101-2021_02_13_429885 47 9 three three CD 10_1101-2021_02_13_429885 47 10 copies copy NNS 10_1101-2021_02_13_429885 47 11 ( ( -LRB- 10_1101-2021_02_13_429885 47 12 triploid triploid NNP 10_1101-2021_02_13_429885 47 13 ) ) -RRB- 10_1101-2021_02_13_429885 47 14 , , , 10_1101-2021_02_13_429885 47 15 the the DT 10_1101-2021_02_13_429885 47 16 expected expect VBN 10_1101-2021_02_13_429885 47 17 VAF VAF NNP 10_1101-2021_02_13_429885 47 18 is be VBZ 10_1101-2021_02_13_429885 47 19 33 33 CD 10_1101-2021_02_13_429885 47 20 % % NN 10_1101-2021_02_13_429885 47 21 - - , 10_1101-2021_02_13_429885 47 22 if if IN 10_1101-2021_02_13_429885 47 23 the the DT 10_1101-2021_02_13_429885 47 24 SNV SNV NNP 10_1101-2021_02_13_429885 47 25 occurred occur VBD 10_1101-2021_02_13_429885 47 26 after after IN 10_1101-2021_02_13_429885 47 27 the the DT 10_1101-2021_02_13_429885 47 28 amplification amplification NN 10_1101-2021_02_13_429885 47 29 - - , 10_1101-2021_02_13_429885 47 30 or or CC 10_1101-2021_02_13_429885 47 31 66 66 CD 10_1101-2021_02_13_429885 47 32 % % NN 10_1101-2021_02_13_429885 47 33 - - , 10_1101-2021_02_13_429885 47 34 if if IN 10_1101-2021_02_13_429885 47 35 the the DT 10_1101-2021_02_13_429885 47 36 SNV SNV NNP 10_1101-2021_02_13_429885 47 37 is be VBZ 10_1101-2021_02_13_429885 47 38 on on IN 10_1101-2021_02_13_429885 47 39 the the DT 10_1101-2021_02_13_429885 47 40 amplified amplify VBN 10_1101-2021_02_13_429885 47 41 chromosome chromosome NN 10_1101-2021_02_13_429885 47 42 and and CC 10_1101-2021_02_13_429885 47 43 occurred occur VBD 10_1101-2021_02_13_429885 47 44 before before IN 10_1101-2021_02_13_429885 47 45 the the DT 10_1101-2021_02_13_429885 47 46 amplification amplification NN 10_1101-2021_02_13_429885 47 47 . . . 10_1101-2021_02_13_429885 48 1 The the DT 10_1101-2021_02_13_429885 48 2 theoretical theoretical JJ 10_1101-2021_02_13_429885 48 3 .CC .CC : 10_1101-2021_02_13_429885 48 4 - - HYPH 10_1101-2021_02_13_429885 48 5 BY by IN 10_1101-2021_02_13_429885 48 6 - - HYPH 10_1101-2021_02_13_429885 48 7 NC NC NNP 10_1101-2021_02_13_429885 48 8 - - HYPH 10_1101-2021_02_13_429885 48 9 ND ND NNP 10_1101-2021_02_13_429885 48 10 4.0 4.0 CD 10_1101-2021_02_13_429885 48 11 International International NNP 10_1101-2021_02_13_429885 48 12 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 48 13 under under IN 10_1101-2021_02_13_429885 48 14 a a DT 10_1101-2021_02_13_429885 48 15 ( ( -LRB- 10_1101-2021_02_13_429885 48 16 which which WDT 10_1101-2021_02_13_429885 48 17 was be VBD 10_1101-2021_02_13_429885 48 18 not not RB 10_1101-2021_02_13_429885 48 19 certified certify VBN 10_1101-2021_02_13_429885 48 20 by by IN 10_1101-2021_02_13_429885 48 21 peer peer NN 10_1101-2021_02_13_429885 48 22 review review NN 10_1101-2021_02_13_429885 48 23 ) ) -RRB- 10_1101-2021_02_13_429885 48 24 is be VBZ 10_1101-2021_02_13_429885 48 25 the the DT 10_1101-2021_02_13_429885 48 26 author author NN 10_1101-2021_02_13_429885 48 27 / / SYM 10_1101-2021_02_13_429885 48 28 funder funder NN 10_1101-2021_02_13_429885 48 29 , , , 10_1101-2021_02_13_429885 48 30 who who WP 10_1101-2021_02_13_429885 48 31 has have VBZ 10_1101-2021_02_13_429885 48 32 granted grant VBN 10_1101-2021_02_13_429885 48 33 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 48 34 a a DT 10_1101-2021_02_13_429885 48 35 license license NN 10_1101-2021_02_13_429885 48 36 to to TO 10_1101-2021_02_13_429885 48 37 display display VB 10_1101-2021_02_13_429885 48 38 the the DT 10_1101-2021_02_13_429885 48 39 preprint preprint NN 10_1101-2021_02_13_429885 48 40 in in IN 10_1101-2021_02_13_429885 48 41 perpetuity perpetuity NN 10_1101-2021_02_13_429885 48 42 . . . 10_1101-2021_02_13_429885 49 1 It -PRON- PRP 10_1101-2021_02_13_429885 49 2 is be VBZ 10_1101-2021_02_13_429885 49 3 made make VBN 10_1101-2021_02_13_429885 49 4 The the DT 10_1101-2021_02_13_429885 49 5 copyright copyright NN 10_1101-2021_02_13_429885 49 6 holder holder NN 10_1101-2021_02_13_429885 49 7 for for IN 10_1101-2021_02_13_429885 49 8 this this DT 10_1101-2021_02_13_429885 49 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 49 10 version version NN 10_1101-2021_02_13_429885 49 11 posted post VBD 10_1101-2021_02_13_429885 49 12 February February NNP 10_1101-2021_02_13_429885 49 13 13 13 CD 10_1101-2021_02_13_429885 49 14 , , , 10_1101-2021_02_13_429885 49 15 2021 2021 CD 10_1101-2021_02_13_429885 49 16 . . . 10_1101-2021_02_13_429885 49 17 ; ; : 10_1101-2021_02_13_429885 49 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 49 19 : : : 10_1101-2021_02_13_429885 49 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 49 21 preprint preprint NN 10_1101-2021_02_13_429885 49 22 https://paperpile.com/c/rqVmzs/j5j7 https://paperpile.com/c/rqVmzs/j5j7 NNP 10_1101-2021_02_13_429885 49 23 https://paperpile.com/c/rqVmzs/j5j7 https://paperpile.com/c/rqvmzs/j5j7 XX 10_1101-2021_02_13_429885 49 24 https://paperpile.com/c/rqVmzs/UEke+Glz6 https://paperpile.com/c/rqVmzs/UEke+Glz6 NNP 10_1101-2021_02_13_429885 49 25 https://paperpile.com/c/rqVmzs/vQgD+bHGV+chqB https://paperpile.com/c/rqVmzs/vQgD+bHGV+chqB NNP 10_1101-2021_02_13_429885 49 26 https://paperpile.com/c/rqVmzs/vQgD+bHGV+chqB https://paperpile.com/c/rqVmzs/vQgD+bHGV+chqB NNP 10_1101-2021_02_13_429885 49 27 https://paperpile.com/c/rqVmzs/CxXa https://paperpile.com/c/rqVmzs/CxXa NNP 10_1101-2021_02_13_429885 49 28 https://paperpile.com/c/rqVmzs/tMOu https://paperpile.com/c/rqVmzs/tMOu NNP 10_1101-2021_02_13_429885 49 29 https://paperpile.com/c/rqVmzs/df7V+SxXl https://paperpile.com/c/rqVmzs/df7V+SxXl : 10_1101-2021_02_13_429885 49 30 https://paperpile.com/c/rqVmzs/df7V+SxXl https://paperpile.com/c/rqVmzs/df7V+SxXl NNP 10_1101-2021_02_13_429885 49 31 https://paperpile.com/c/rqVmzs/tMOu https://paperpile.com/c/rqVmzs/tMOu NNP 10_1101-2021_02_13_429885 49 32 https://paperpile.com/c/rqVmzs/vQgD+NCPJ https://paperpile.com/c/rqvmzs/vqgd+ncpj ADD 10_1101-2021_02_13_429885 49 33 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 NNP 10_1101-2021_02_13_429885 49 34 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 49 35 Househam Househam NNP 10_1101-2021_02_13_429885 49 36 et et FW 10_1101-2021_02_13_429885 49 37 al al NNP 10_1101-2021_02_13_429885 49 38 . . . 10_1101-2021_02_13_429885 50 1 A a DT 10_1101-2021_02_13_429885 50 2 fully fully RB 10_1101-2021_02_13_429885 50 3 automated automate VBN 10_1101-2021_02_13_429885 50 4 approach approach NN 10_1101-2021_02_13_429885 50 5 for for IN 10_1101-2021_02_13_429885 50 6 quality quality NN 10_1101-2021_02_13_429885 50 7 control control NN 10_1101-2021_02_13_429885 50 8 of of IN 10_1101-2021_02_13_429885 50 9 cancer cancer NN 10_1101-2021_02_13_429885 50 10 mutations mutation NNS 10_1101-2021_02_13_429885 50 11 in in IN 10_1101-2021_02_13_429885 50 12 the the DT 10_1101-2021_02_13_429885 50 13 era era NN 10_1101-2021_02_13_429885 50 14 of of IN 10_1101-2021_02_13_429885 50 15 high high JJ 10_1101-2021_02_13_429885 50 16 - - HYPH 10_1101-2021_02_13_429885 50 17 resolution resolution NN 10_1101-2021_02_13_429885 50 18 whole whole JJ 10_1101-2021_02_13_429885 50 19 genome genome JJ 10_1101-2021_02_13_429885 50 20 sequencing sequencing NN 10_1101-2021_02_13_429885 50 21 . . . 10_1101-2021_02_13_429885 51 1 frequencies frequency NNS 10_1101-2021_02_13_429885 51 2 are be VBP 10_1101-2021_02_13_429885 51 3 observed observe VBN 10_1101-2021_02_13_429885 51 4 with with IN 10_1101-2021_02_13_429885 51 5 a a DT 10_1101-2021_02_13_429885 51 6 Binomial Binomial NNP 10_1101-2021_02_13_429885 51 7 noise noise NN 10_1101-2021_02_13_429885 51 8 model model NN 10_1101-2021_02_13_429885 51 9 that that WDT 10_1101-2021_02_13_429885 51 10 depends depend VBZ 10_1101-2021_02_13_429885 51 11 on on IN 10_1101-2021_02_13_429885 51 12 the the DT 10_1101-2021_02_13_429885 51 13 depth depth NN 10_1101-2021_02_13_429885 51 14 of of IN 10_1101-2021_02_13_429885 51 15 sequencing sequencing NN 10_1101-2021_02_13_429885 51 16 and and CC 10_1101-2021_02_13_429885 51 17 the the DT 10_1101-2021_02_13_429885 51 18 actual actual JJ 10_1101-2021_02_13_429885 51 19 VAF VAF NNP 10_1101-2021_02_13_429885 51 20 ​(Nik ​(nik CD 10_1101-2021_02_13_429885 51 21 - - HYPH 10_1101-2021_02_13_429885 51 22 Zainal Zainal NNP 10_1101-2021_02_13_429885 51 23 et et NNP 10_1101-2021_02_13_429885 51 24 al al NNP 10_1101-2021_02_13_429885 51 25 . . . 10_1101-2021_02_13_429885 52 1 2012 2012 CD 10_1101-2021_02_13_429885 52 2 ; ; : 10_1101-2021_02_13_429885 52 3 Caravagna Caravagna NNP 10_1101-2021_02_13_429885 52 4 et et NNP 10_1101-2021_02_13_429885 52 5 al al NNP 10_1101-2021_02_13_429885 52 6 . . . 10_1101-2021_02_13_429885 53 1 2020)​. 2020)​. CD 10_1101-2021_02_13_429885 54 1 We -PRON- PRP 10_1101-2021_02_13_429885 54 2 note note VBP 10_1101-2021_02_13_429885 54 3 that that IN 10_1101-2021_02_13_429885 54 4 these these DT 10_1101-2021_02_13_429885 54 5 VAFs vaf NNS 10_1101-2021_02_13_429885 54 6 hold hold VBP 10_1101-2021_02_13_429885 54 7 for for IN 10_1101-2021_02_13_429885 54 8 pure pure JJ 10_1101-2021_02_13_429885 54 9 bulk bulk JJ 10_1101-2021_02_13_429885 54 10 tumour tumour NN 10_1101-2021_02_13_429885 54 11 samples sample NNS 10_1101-2021_02_13_429885 54 12 ( ( -LRB- 10_1101-2021_02_13_429885 54 13 100 100 CD 10_1101-2021_02_13_429885 54 14 % % NN 10_1101-2021_02_13_429885 54 15 tumour tumour NN 10_1101-2021_02_13_429885 54 16 cells cell NNS 10_1101-2021_02_13_429885 54 17 ) ) -RRB- 10_1101-2021_02_13_429885 54 18 . . . 10_1101-2021_02_13_429885 55 1 Realistically realistically RB 10_1101-2021_02_13_429885 55 2 , , , 10_1101-2021_02_13_429885 55 3 most most JJS 10_1101-2021_02_13_429885 55 4 bulk bulk JJ 10_1101-2021_02_13_429885 55 5 samples sample NNS 10_1101-2021_02_13_429885 55 6 contain contain VBP 10_1101-2021_02_13_429885 55 7 normal normal JJ 10_1101-2021_02_13_429885 55 8 cells cell NNS 10_1101-2021_02_13_429885 55 9 , , , 10_1101-2021_02_13_429885 55 10 the the DT 10_1101-2021_02_13_429885 55 11 percentage percentage NN 10_1101-2021_02_13_429885 55 12 of of IN 10_1101-2021_02_13_429885 55 13 which which WDT 10_1101-2021_02_13_429885 55 14 shifts shift VBZ 10_1101-2021_02_13_429885 55 15 these these DT 10_1101-2021_02_13_429885 55 16 theoretical theoretical JJ 10_1101-2021_02_13_429885 55 17 frequencies frequency NNS 10_1101-2021_02_13_429885 55 18 towards towards IN 10_1101-2021_02_13_429885 55 19 lower low JJR 10_1101-2021_02_13_429885 55 20 values value NNS 10_1101-2021_02_13_429885 55 21 . . . 10_1101-2021_02_13_429885 56 1 These these DT 10_1101-2021_02_13_429885 56 2 ideas idea NNS 10_1101-2021_02_13_429885 56 3 are be VBP 10_1101-2021_02_13_429885 56 4 leveraged leverage VBN 10_1101-2021_02_13_429885 56 5 by by IN 10_1101-2021_02_13_429885 56 6 methods method NNS 10_1101-2021_02_13_429885 56 7 that that WDT 10_1101-2021_02_13_429885 56 8 seek seek VBP 10_1101-2021_02_13_429885 56 9 to to TO 10_1101-2021_02_13_429885 56 10 compute compute VB 10_1101-2021_02_13_429885 56 11 the the DT 10_1101-2021_02_13_429885 56 12 Cancer Cancer NNP 10_1101-2021_02_13_429885 56 13 Cell Cell NNP 10_1101-2021_02_13_429885 56 14 Fractions Fractions NNPS 10_1101-2021_02_13_429885 56 15 ( ( -LRB- 10_1101-2021_02_13_429885 56 16 CCFs CCFs NNP 10_1101-2021_02_13_429885 56 17 ) ) -RRB- 10_1101-2021_02_13_429885 56 18 of of IN 10_1101-2021_02_13_429885 56 19 the the DT 10_1101-2021_02_13_429885 56 20 tumour tumour NN 10_1101-2021_02_13_429885 56 21 , , , 10_1101-2021_02_13_429885 56 22 i.e. i.e. FW 10_1101-2021_02_13_429885 56 23 , , , 10_1101-2021_02_13_429885 56 24 a a DT 10_1101-2021_02_13_429885 56 25 normalisation normalisation NN 10_1101-2021_02_13_429885 56 26 of of IN 10_1101-2021_02_13_429885 56 27 the the DT 10_1101-2021_02_13_429885 56 28 observed observed JJ 10_1101-2021_02_13_429885 56 29 tumour tumour NN 10_1101-2021_02_13_429885 56 30 VAF VAF NNP 10_1101-2021_02_13_429885 56 31 for for IN 10_1101-2021_02_13_429885 56 32 the the DT 10_1101-2021_02_13_429885 56 33 CNA CNA NNP 10_1101-2021_02_13_429885 56 34 , , , 10_1101-2021_02_13_429885 56 35 the the DT 10_1101-2021_02_13_429885 56 36 number number NN 10_1101-2021_02_13_429885 56 37 of of IN 10_1101-2021_02_13_429885 56 38 copies copy NNS 10_1101-2021_02_13_429885 56 39 of of IN 10_1101-2021_02_13_429885 56 40 a a DT 10_1101-2021_02_13_429885 56 41 mutation mutation NN 10_1101-2021_02_13_429885 56 42 ( ( -LRB- 10_1101-2021_02_13_429885 56 43 mutation mutation NN 10_1101-2021_02_13_429885 56 44 multiplicity multiplicity NN 10_1101-2021_02_13_429885 56 45 ) ) -RRB- 10_1101-2021_02_13_429885 56 46 and and CC 10_1101-2021_02_13_429885 56 47 tumour tumour VB 10_1101-2021_02_13_429885 56 48 purity purity NN 10_1101-2021_02_13_429885 56 49 ​(Nik ​(Nik NNP 10_1101-2021_02_13_429885 56 50 - - HYPH 10_1101-2021_02_13_429885 56 51 Zainal Zainal NNP 10_1101-2021_02_13_429885 56 52 et et NNP 10_1101-2021_02_13_429885 56 53 al al NNP 10_1101-2021_02_13_429885 56 54 . . . 10_1101-2021_02_13_429885 57 1 2012)​. 2012)​. XX 10_1101-2021_02_13_429885 58 1 Many many JJ 10_1101-2021_02_13_429885 58 2 bioinformatics bioinformatic NNS 10_1101-2021_02_13_429885 58 3 pipelines pipeline NNS 10_1101-2021_02_13_429885 58 4 are be VBP 10_1101-2021_02_13_429885 58 5 designed design VBN 10_1101-2021_02_13_429885 58 6 to to TO 10_1101-2021_02_13_429885 58 7 start start VB 10_1101-2021_02_13_429885 58 8 from from IN 10_1101-2021_02_13_429885 58 9 a a DT 10_1101-2021_02_13_429885 58 10 BAM bam NN 10_1101-2021_02_13_429885 58 11 formatted format VBN 10_1101-2021_02_13_429885 58 12 input input NN 10_1101-2021_02_13_429885 58 13 file file NN 10_1101-2021_02_13_429885 58 14 and and CC 10_1101-2021_02_13_429885 58 15 , , , 10_1101-2021_02_13_429885 58 16 following follow VBG 10_1101-2021_02_13_429885 58 17 variant variant JJ 10_1101-2021_02_13_429885 58 18 calling calling NN 10_1101-2021_02_13_429885 58 19 , , , 10_1101-2021_02_13_429885 58 20 extract extract VB 10_1101-2021_02_13_429885 58 21 the the DT 10_1101-2021_02_13_429885 58 22 VAF VAF NNP 10_1101-2021_02_13_429885 58 23 of of IN 10_1101-2021_02_13_429885 58 24 mutations mutation NNS 10_1101-2021_02_13_429885 58 25 while while IN 10_1101-2021_02_13_429885 58 26 calling call VBG 10_1101-2021_02_13_429885 58 27 CNAs cna NNS 10_1101-2021_02_13_429885 58 28 in in IN 10_1101-2021_02_13_429885 58 29 parallel parallel NN 10_1101-2021_02_13_429885 58 30 ( ( -LRB- 10_1101-2021_02_13_429885 58 31 Boeva Boeva NNP 10_1101-2021_02_13_429885 58 32 et et NNP 10_1101-2021_02_13_429885 58 33 al al NNP 10_1101-2021_02_13_429885 58 34 . . . 10_1101-2021_02_13_429885 59 1 2011 2011 CD 10_1101-2021_02_13_429885 59 2 ; ; : 10_1101-2021_02_13_429885 59 3 Cmero Cmero NNP 10_1101-2021_02_13_429885 59 4 et et NNP 10_1101-2021_02_13_429885 59 5 al al NNP 10_1101-2021_02_13_429885 59 6 . . . 10_1101-2021_02_13_429885 60 1 2020 2020 CD 10_1101-2021_02_13_429885 60 2 ; ; : 10_1101-2021_02_13_429885 60 3 Zaccaria Zaccaria NNP 10_1101-2021_02_13_429885 60 4 and and CC 10_1101-2021_02_13_429885 60 5 Raphael Raphael NNP 10_1101-2021_02_13_429885 60 6 2020 2020 CD 10_1101-2021_02_13_429885 60 7 ; ; : 10_1101-2021_02_13_429885 60 8 Van Van NNP 10_1101-2021_02_13_429885 60 9 Loo Loo NNP 10_1101-2021_02_13_429885 60 10 et et FW 10_1101-2021_02_13_429885 60 11 al al NNP 10_1101-2021_02_13_429885 60 12 . . . 10_1101-2021_02_13_429885 61 1 2010)​. 2010)​. CD 10_1101-2021_02_13_429885 62 1 These these DT 10_1101-2021_02_13_429885 62 2 analyses analysis NNS 10_1101-2021_02_13_429885 62 3 are be VBP 10_1101-2021_02_13_429885 62 4 nearly nearly RB 10_1101-2021_02_13_429885 62 5 always always RB 10_1101-2021_02_13_429885 62 6 decoupled decouple VBN 10_1101-2021_02_13_429885 62 7 , , , 10_1101-2021_02_13_429885 62 8 and and CC 10_1101-2021_02_13_429885 62 9 can can MD 10_1101-2021_02_13_429885 62 10 return return VB 10_1101-2021_02_13_429885 62 11 inconsistent inconsistent JJ 10_1101-2021_02_13_429885 62 12 variant variant JJ 10_1101-2021_02_13_429885 62 13 calls call NNS 10_1101-2021_02_13_429885 62 14 ; ; , 10_1101-2021_02_13_429885 62 15 i.e. i.e. FW 10_1101-2021_02_13_429885 62 16 , , , 10_1101-2021_02_13_429885 62 17 CNAs cna NNS 10_1101-2021_02_13_429885 62 18 and and CC 10_1101-2021_02_13_429885 62 19 purity purity NN 10_1101-2021_02_13_429885 62 20 that that WDT 10_1101-2021_02_13_429885 62 21 mismatch mismatch VBP 10_1101-2021_02_13_429885 62 22 the the DT 10_1101-2021_02_13_429885 62 23 empirical empirical JJ 10_1101-2021_02_13_429885 62 24 VAF VAF NNP 10_1101-2021_02_13_429885 62 25 from from IN 10_1101-2021_02_13_429885 62 26 the the DT 10_1101-2021_02_13_429885 62 27 BAMs bam NNS 10_1101-2021_02_13_429885 62 28 . . . 10_1101-2021_02_13_429885 63 1 Since since IN 10_1101-2021_02_13_429885 63 2 CNAs cna NNS 10_1101-2021_02_13_429885 63 3 and and CC 10_1101-2021_02_13_429885 63 4 purity purity NN 10_1101-2021_02_13_429885 63 5 are be VBP 10_1101-2021_02_13_429885 63 6 inferred infer VBN 10_1101-2021_02_13_429885 63 7 through through IN 10_1101-2021_02_13_429885 63 8 various various JJ 10_1101-2021_02_13_429885 63 9 measurements measurement NNS 10_1101-2021_02_13_429885 63 10 that that WDT 10_1101-2021_02_13_429885 63 11 are be VBP 10_1101-2021_02_13_429885 63 12 subject subject JJ 10_1101-2021_02_13_429885 63 13 to to IN 10_1101-2021_02_13_429885 63 14 noise noise NN 10_1101-2021_02_13_429885 63 15 - - HYPH 10_1101-2021_02_13_429885 63 16 i.e. i.e. FW 10_1101-2021_02_13_429885 63 17 , , , 10_1101-2021_02_13_429885 63 18 mutation mutation NN 10_1101-2021_02_13_429885 63 19 allele allele NNP 10_1101-2021_02_13_429885 63 20 ratios ratio NNS 10_1101-2021_02_13_429885 63 21 , , , 10_1101-2021_02_13_429885 63 22 tumour tumour NN 10_1101-2021_02_13_429885 63 23 - - HYPH 10_1101-2021_02_13_429885 63 24 normal normal JJ 10_1101-2021_02_13_429885 63 25 depth depth NN 10_1101-2021_02_13_429885 63 26 ratios ratio NNS 10_1101-2021_02_13_429885 63 27 and and CC 10_1101-2021_02_13_429885 63 28 B b NN 10_1101-2021_02_13_429885 63 29 - - HYPH 10_1101-2021_02_13_429885 63 30 allele allele NNP 10_1101-2021_02_13_429885 63 31 frequencies frequency NNS 10_1101-2021_02_13_429885 63 32 are be VBP 10_1101-2021_02_13_429885 63 33 prime prime JJ 10_1101-2021_02_13_429885 63 34 examples example NNS 10_1101-2021_02_13_429885 63 35 - - : 10_1101-2021_02_13_429885 63 36 they -PRON- PRP 10_1101-2021_02_13_429885 63 37 are be VBP 10_1101-2021_02_13_429885 63 38 the the DT 10_1101-2021_02_13_429885 63 39 most most RBS 10_1101-2021_02_13_429885 63 40 likely likely JJ 10_1101-2021_02_13_429885 63 41 cause cause NN 10_1101-2021_02_13_429885 63 42 of of IN 10_1101-2021_02_13_429885 63 43 error error NN 10_1101-2021_02_13_429885 63 44 . . . 10_1101-2021_02_13_429885 64 1 While while IN 10_1101-2021_02_13_429885 64 2 in in IN 10_1101-2021_02_13_429885 64 3 some some DT 10_1101-2021_02_13_429885 64 4 cases case NNS 10_1101-2021_02_13_429885 64 5 these these DT 10_1101-2021_02_13_429885 64 6 errors error NNS 10_1101-2021_02_13_429885 64 7 can can MD 10_1101-2021_02_13_429885 64 8 be be VB 10_1101-2021_02_13_429885 64 9 spotted spot VBN 10_1101-2021_02_13_429885 64 10 and and CC 10_1101-2021_02_13_429885 64 11 fixed fix VBN 10_1101-2021_02_13_429885 64 12 by by IN 10_1101-2021_02_13_429885 64 13 manual manual JJ 10_1101-2021_02_13_429885 64 14 intervention intervention NN 10_1101-2021_02_13_429885 64 15 , , , 10_1101-2021_02_13_429885 64 16 this this DT 10_1101-2021_02_13_429885 64 17 process process NN 10_1101-2021_02_13_429885 64 18 is be VBZ 10_1101-2021_02_13_429885 64 19 also also RB 10_1101-2021_02_13_429885 64 20 subject subject JJ 10_1101-2021_02_13_429885 64 21 to to IN 10_1101-2021_02_13_429885 64 22 inconsistencies inconsistency NNS 10_1101-2021_02_13_429885 64 23 in in IN 10_1101-2021_02_13_429885 64 24 the the DT 10_1101-2021_02_13_429885 64 25 absence absence NN 10_1101-2021_02_13_429885 64 26 of of IN 10_1101-2021_02_13_429885 64 27 a a DT 10_1101-2021_02_13_429885 64 28 proper proper JJ 10_1101-2021_02_13_429885 64 29 statistical statistical JJ 10_1101-2021_02_13_429885 64 30 framework framework NN 10_1101-2021_02_13_429885 64 31 , , , 10_1101-2021_02_13_429885 64 32 and and CC 10_1101-2021_02_13_429885 64 33 does do VBZ 10_1101-2021_02_13_429885 64 34 not not RB 10_1101-2021_02_13_429885 64 35 scale scale VB 10_1101-2021_02_13_429885 64 36 in in IN 10_1101-2021_02_13_429885 64 37 studies study NNS 10_1101-2021_02_13_429885 64 38 seeking seek VBG 10_1101-2021_02_13_429885 64 39 to to TO 10_1101-2021_02_13_429885 64 40 generate generate VB 10_1101-2021_02_13_429885 64 41 datasets dataset NNS 10_1101-2021_02_13_429885 64 42 with with IN 10_1101-2021_02_13_429885 64 43 millions million NNS 10_1101-2021_02_13_429885 64 44 of of IN 10_1101-2021_02_13_429885 64 45 data datum NNS 10_1101-2021_02_13_429885 64 46 points point NNS 10_1101-2021_02_13_429885 64 47 ​(Campbell ​(Campbell NNP 10_1101-2021_02_13_429885 64 48 et et FW 10_1101-2021_02_13_429885 64 49 al al NNP 10_1101-2021_02_13_429885 64 50 . . . 10_1101-2021_02_13_429885 65 1 2020 2020 CD 10_1101-2021_02_13_429885 65 2 ; ; : 10_1101-2021_02_13_429885 65 3 Priestley Priestley NNP 10_1101-2021_02_13_429885 65 4 et et NNP 10_1101-2021_02_13_429885 65 5 al al NNP 10_1101-2021_02_13_429885 65 6 . . . 10_1101-2021_02_13_429885 66 1 2019 2019 CD 10_1101-2021_02_13_429885 66 2 ; ; : 10_1101-2021_02_13_429885 66 3 Turnbull Turnbull NNP 10_1101-2021_02_13_429885 66 4 et et FW 10_1101-2021_02_13_429885 66 5 al al NNP 10_1101-2021_02_13_429885 66 6 . . . 10_1101-2021_02_13_429885 67 1 2018)​. 2018)​. CD 10_1101-2021_02_13_429885 68 1 The the DT 10_1101-2021_02_13_429885 68 2 intrinsic intrinsic JJ 10_1101-2021_02_13_429885 68 3 performance performance NN 10_1101-2021_02_13_429885 68 4 of of IN 10_1101-2021_02_13_429885 68 5 a a DT 10_1101-2021_02_13_429885 68 6 variant variant JJ 10_1101-2021_02_13_429885 68 7 caller caller NN 10_1101-2021_02_13_429885 68 8 and and CC 10_1101-2021_02_13_429885 68 9 sequencing sequencing NN 10_1101-2021_02_13_429885 68 10 noise noise NN 10_1101-2021_02_13_429885 68 11 therefore therefore RB 10_1101-2021_02_13_429885 68 12 massively massively RB 10_1101-2021_02_13_429885 68 13 impacts impact VBZ 10_1101-2021_02_13_429885 68 14 CNA cna NN 10_1101-2021_02_13_429885 68 15 calling calling NN 10_1101-2021_02_13_429885 68 16 and and CC 10_1101-2021_02_13_429885 68 17 purity purity NN 10_1101-2021_02_13_429885 68 18 inferences inference NNS 10_1101-2021_02_13_429885 68 19 , , , 10_1101-2021_02_13_429885 68 20 propagating propagate VBG 10_1101-2021_02_13_429885 68 21 errors error NNS 10_1101-2021_02_13_429885 68 22 in in IN 10_1101-2021_02_13_429885 68 23 downstream downstream JJ 10_1101-2021_02_13_429885 68 24 analysis analysis NN 10_1101-2021_02_13_429885 68 25 that that WDT 10_1101-2021_02_13_429885 68 26 eventually eventually RB 10_1101-2021_02_13_429885 68 27 lead lead VBP 10_1101-2021_02_13_429885 68 28 to to IN 10_1101-2021_02_13_429885 68 29 incorrect incorrect JJ 10_1101-2021_02_13_429885 68 30 biological biological JJ 10_1101-2021_02_13_429885 68 31 conclusions conclusion NNS 10_1101-2021_02_13_429885 68 32 , , , 10_1101-2021_02_13_429885 68 33 becoming become VBG 10_1101-2021_02_13_429885 68 34 a a DT 10_1101-2021_02_13_429885 68 35 crucial crucial JJ 10_1101-2021_02_13_429885 68 36 computational computational JJ 10_1101-2021_02_13_429885 68 37 bottleneck bottleneck NN 10_1101-2021_02_13_429885 68 38 in in IN 10_1101-2021_02_13_429885 68 39 the the DT 10_1101-2021_02_13_429885 68 40 era era NN 10_1101-2021_02_13_429885 68 41 of of IN 10_1101-2021_02_13_429885 68 42 high high JJ 10_1101-2021_02_13_429885 68 43 - - HYPH 10_1101-2021_02_13_429885 68 44 resolution resolution NN 10_1101-2021_02_13_429885 68 45 whole whole RB 10_1101-2021_02_13_429885 68 46 - - HYPH 10_1101-2021_02_13_429885 68 47 genome genome RB 10_1101-2021_02_13_429885 68 48 sequencing sequencing NN 10_1101-2021_02_13_429885 68 49 . . . 10_1101-2021_02_13_429885 69 1 To to TO 10_1101-2021_02_13_429885 69 2 solve solve VB 10_1101-2021_02_13_429885 69 3 these these DT 10_1101-2021_02_13_429885 69 4 problems problem NNS 10_1101-2021_02_13_429885 69 5 we -PRON- PRP 10_1101-2021_02_13_429885 69 6 developed develop VBD 10_1101-2021_02_13_429885 69 7 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 69 8 ( ( -LRB- 10_1101-2021_02_13_429885 69 9 ​Data ​Data NNP 10_1101-2021_02_13_429885 69 10 Availability​ Availability​ NNP 10_1101-2021_02_13_429885 69 11 ) ) -RRB- 10_1101-2021_02_13_429885 69 12 , , , 10_1101-2021_02_13_429885 69 13 a a DT 10_1101-2021_02_13_429885 69 14 computational computational JJ 10_1101-2021_02_13_429885 69 15 framework framework NN 10_1101-2021_02_13_429885 69 16 with with IN 10_1101-2021_02_13_429885 69 17 a a DT 10_1101-2021_02_13_429885 69 18 de de FW 10_1101-2021_02_13_429885 69 19 novo novo NNP 10_1101-2021_02_13_429885 69 20 statistical statistical JJ 10_1101-2021_02_13_429885 69 21 model model NN 10_1101-2021_02_13_429885 69 22 to to TO 10_1101-2021_02_13_429885 69 23 assess assess VB 10_1101-2021_02_13_429885 69 24 the the DT 10_1101-2021_02_13_429885 69 25 conformance conformance NN 10_1101-2021_02_13_429885 69 26 of of IN 10_1101-2021_02_13_429885 69 27 expected expect VBN 10_1101-2021_02_13_429885 69 28 SNVs SNVs NNPS 10_1101-2021_02_13_429885 69 29 , , , 10_1101-2021_02_13_429885 69 30 CNAs cna NNS 10_1101-2021_02_13_429885 69 31 , , , 10_1101-2021_02_13_429885 69 32 and and CC 10_1101-2021_02_13_429885 69 33 purity purity NN 10_1101-2021_02_13_429885 69 34 estimates estimate NNS 10_1101-2021_02_13_429885 69 35 . . . 10_1101-2021_02_13_429885 70 1 We -PRON- PRP 10_1101-2021_02_13_429885 70 2 strived strive VBD 10_1101-2021_02_13_429885 70 3 to to TO 10_1101-2021_02_13_429885 70 4 make make VB 10_1101-2021_02_13_429885 70 5 the the DT 10_1101-2021_02_13_429885 70 6 tool tool NN 10_1101-2021_02_13_429885 70 7 as as IN 10_1101-2021_02_13_429885 70 8 simple simple JJ 10_1101-2021_02_13_429885 70 9 to to TO 10_1101-2021_02_13_429885 70 10 implement implement VB 10_1101-2021_02_13_429885 70 11 as as IN 10_1101-2021_02_13_429885 70 12 possible possible JJ 10_1101-2021_02_13_429885 70 13 , , , 10_1101-2021_02_13_429885 70 14 maximising maximise VBG 10_1101-2021_02_13_429885 70 15 compatibility compatibility NN 10_1101-2021_02_13_429885 70 16 across across IN 10_1101-2021_02_13_429885 70 17 differing differ VBG 10_1101-2021_02_13_429885 70 18 pipelines pipeline NNS 10_1101-2021_02_13_429885 70 19 . . . 10_1101-2021_02_13_429885 71 1 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 71 2 computes compute VBZ 10_1101-2021_02_13_429885 71 3 a a DT 10_1101-2021_02_13_429885 71 4 quantitative quantitative JJ 10_1101-2021_02_13_429885 71 5 quality quality NN 10_1101-2021_02_13_429885 71 6 check check NN 10_1101-2021_02_13_429885 71 7 ( ( -LRB- 10_1101-2021_02_13_429885 71 8 QC QC NNP 10_1101-2021_02_13_429885 71 9 ) ) -RRB- 10_1101-2021_02_13_429885 71 10 score score VB 10_1101-2021_02_13_429885 71 11 for for IN 10_1101-2021_02_13_429885 71 12 the the DT 10_1101-2021_02_13_429885 71 13 overall overall JJ 10_1101-2021_02_13_429885 71 14 agreement agreement NN 10_1101-2021_02_13_429885 71 15 of of IN 10_1101-2021_02_13_429885 71 16 the the DT 10_1101-2021_02_13_429885 71 17 calls call NNS 10_1101-2021_02_13_429885 71 18 , , , 10_1101-2021_02_13_429885 71 19 which which WDT 10_1101-2021_02_13_429885 71 20 can can MD 10_1101-2021_02_13_429885 71 21 be be VB 10_1101-2021_02_13_429885 71 22 used use VBN 10_1101-2021_02_13_429885 71 23 to to TO 10_1101-2021_02_13_429885 71 24 tune tune VB 10_1101-2021_02_13_429885 71 25 the the DT 10_1101-2021_02_13_429885 71 26 parameters parameter NNS 10_1101-2021_02_13_429885 71 27 of of IN 10_1101-2021_02_13_429885 71 28 callers caller NNS 10_1101-2021_02_13_429885 71 29 ( ( -LRB- 10_1101-2021_02_13_429885 71 30 e.g. e.g. RB 10_1101-2021_02_13_429885 71 31 , , , 10_1101-2021_02_13_429885 71 32 decrease decrease VB 10_1101-2021_02_13_429885 71 33 purity purity NN 10_1101-2021_02_13_429885 71 34 or or CC 10_1101-2021_02_13_429885 71 35 increase increase VB 10_1101-2021_02_13_429885 71 36 ploidy ploidy NN 10_1101-2021_02_13_429885 71 37 ) ) -RRB- 10_1101-2021_02_13_429885 71 38 , , , 10_1101-2021_02_13_429885 71 39 or or CC 10_1101-2021_02_13_429885 71 40 select select VB 10_1101-2021_02_13_429885 71 41 among among IN 10_1101-2021_02_13_429885 71 42 multiple multiple JJ 10_1101-2021_02_13_429885 71 43 CNA CNA NNP 10_1101-2021_02_13_429885 71 44 profiles profile NNS 10_1101-2021_02_13_429885 71 45 ( ( -LRB- 10_1101-2021_02_13_429885 71 46 e.g. e.g. RB 10_1101-2021_02_13_429885 71 47 , , , 10_1101-2021_02_13_429885 71 48 tetraploid tetraploid NN 10_1101-2021_02_13_429885 71 49 versus versus IN 10_1101-2021_02_13_429885 71 50 diploid diploid NNP 10_1101-2021_02_13_429885 71 51 tumours tumour NNS 10_1101-2021_02_13_429885 71 52 ) ) -RRB- 10_1101-2021_02_13_429885 71 53 until until IN 10_1101-2021_02_13_429885 71 54 a a DT 10_1101-2021_02_13_429885 71 55 fit fit NN 10_1101-2021_02_13_429885 71 56 is be VBZ 10_1101-2021_02_13_429885 71 57 achieved achieve VBN 10_1101-2021_02_13_429885 71 58 . . . 10_1101-2021_02_13_429885 72 1 In in IN 10_1101-2021_02_13_429885 72 2 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 72 3 we -PRON- PRP 10_1101-2021_02_13_429885 72 4 also also RB 10_1101-2021_02_13_429885 72 5 integrate integrate VBP 10_1101-2021_02_13_429885 72 6 these these DT 10_1101-2021_02_13_429885 72 7 measures measure NNS 10_1101-2021_02_13_429885 72 8 to to TO 10_1101-2021_02_13_429885 72 9 determine determine VB 10_1101-2021_02_13_429885 72 10 CCF ccf NN 10_1101-2021_02_13_429885 72 11 values value NNS 10_1101-2021_02_13_429885 72 12 ( ( -LRB- 10_1101-2021_02_13_429885 72 13 Dentro Dentro NNP 10_1101-2021_02_13_429885 72 14 , , , 10_1101-2021_02_13_429885 72 15 Wedge Wedge NNP 10_1101-2021_02_13_429885 72 16 , , , 10_1101-2021_02_13_429885 72 17 and and CC 10_1101-2021_02_13_429885 72 18 Van Van NNP 10_1101-2021_02_13_429885 72 19 Loo Loo NNP 10_1101-2021_02_13_429885 72 20 2017)​. 2017)​. CD 10_1101-2021_02_13_429885 73 1 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 73 2 is be VBZ 10_1101-2021_02_13_429885 73 3 implemented implement VBN 10_1101-2021_02_13_429885 73 4 as as IN 10_1101-2021_02_13_429885 73 5 a a DT 10_1101-2021_02_13_429885 73 6 highly highly RB 10_1101-2021_02_13_429885 73 7 optimised optimised JJ 10_1101-2021_02_13_429885 73 8 R r NN 10_1101-2021_02_13_429885 73 9 package package NN 10_1101-2021_02_13_429885 73 10 that that WDT 10_1101-2021_02_13_429885 73 11 can can MD 10_1101-2021_02_13_429885 73 12 be be VB 10_1101-2021_02_13_429885 73 13 used use VBN 10_1101-2021_02_13_429885 73 14 downstream downstream JJ 10_1101-2021_02_13_429885 73 15 of of IN 10_1101-2021_02_13_429885 73 16 any any DT 10_1101-2021_02_13_429885 73 17 cancer cancer NN 10_1101-2021_02_13_429885 73 18 mutation mutation NN 10_1101-2021_02_13_429885 73 19 calling calling NN 10_1101-2021_02_13_429885 73 20 pipeline pipeline NN 10_1101-2021_02_13_429885 73 21 . . . 10_1101-2021_02_13_429885 74 1 It -PRON- PRP 10_1101-2021_02_13_429885 74 2 can can MD 10_1101-2021_02_13_429885 74 3 be be VB 10_1101-2021_02_13_429885 74 4 run run VBN 10_1101-2021_02_13_429885 74 5 on on IN 10_1101-2021_02_13_429885 74 6 WGS WGS NNP 10_1101-2021_02_13_429885 74 7 data datum NNS 10_1101-2021_02_13_429885 74 8 , , , 10_1101-2021_02_13_429885 74 9 and and CC 10_1101-2021_02_13_429885 74 10 can can MD 10_1101-2021_02_13_429885 74 11 automatically automatically RB 10_1101-2021_02_13_429885 74 12 compute compute VB 10_1101-2021_02_13_429885 74 13 a a DT 10_1101-2021_02_13_429885 74 14 QC QC NNP 10_1101-2021_02_13_429885 74 15 score score NN 10_1101-2021_02_13_429885 74 16 in in IN 10_1101-2021_02_13_429885 74 17 a a DT 10_1101-2021_02_13_429885 74 18 matter matter NN 10_1101-2021_02_13_429885 74 19 of of IN 10_1101-2021_02_13_429885 74 20 seconds second NNS 10_1101-2021_02_13_429885 74 21 , , , 10_1101-2021_02_13_429885 74 22 which which WDT 10_1101-2021_02_13_429885 74 23 is be VBZ 10_1101-2021_02_13_429885 74 24 an an DT 10_1101-2021_02_13_429885 74 25 extremely extremely RB 10_1101-2021_02_13_429885 74 26 useful useful JJ 10_1101-2021_02_13_429885 74 27 .CC .CC NFP 10_1101-2021_02_13_429885 74 28 - - HYPH 10_1101-2021_02_13_429885 74 29 BY by IN 10_1101-2021_02_13_429885 74 30 - - HYPH 10_1101-2021_02_13_429885 74 31 NC NC NNP 10_1101-2021_02_13_429885 74 32 - - HYPH 10_1101-2021_02_13_429885 74 33 ND ND NNP 10_1101-2021_02_13_429885 74 34 4.0 4.0 CD 10_1101-2021_02_13_429885 74 35 International International NNP 10_1101-2021_02_13_429885 74 36 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 74 37 under under IN 10_1101-2021_02_13_429885 74 38 a a DT 10_1101-2021_02_13_429885 74 39 ( ( -LRB- 10_1101-2021_02_13_429885 74 40 which which WDT 10_1101-2021_02_13_429885 74 41 was be VBD 10_1101-2021_02_13_429885 74 42 not not RB 10_1101-2021_02_13_429885 74 43 certified certify VBN 10_1101-2021_02_13_429885 74 44 by by IN 10_1101-2021_02_13_429885 74 45 peer peer NN 10_1101-2021_02_13_429885 74 46 review review NN 10_1101-2021_02_13_429885 74 47 ) ) -RRB- 10_1101-2021_02_13_429885 74 48 is be VBZ 10_1101-2021_02_13_429885 74 49 the the DT 10_1101-2021_02_13_429885 74 50 author author NN 10_1101-2021_02_13_429885 74 51 / / SYM 10_1101-2021_02_13_429885 74 52 funder funder NN 10_1101-2021_02_13_429885 74 53 , , , 10_1101-2021_02_13_429885 74 54 who who WP 10_1101-2021_02_13_429885 74 55 has have VBZ 10_1101-2021_02_13_429885 74 56 granted grant VBN 10_1101-2021_02_13_429885 74 57 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 74 58 a a DT 10_1101-2021_02_13_429885 74 59 license license NN 10_1101-2021_02_13_429885 74 60 to to TO 10_1101-2021_02_13_429885 74 61 display display VB 10_1101-2021_02_13_429885 74 62 the the DT 10_1101-2021_02_13_429885 74 63 preprint preprint NN 10_1101-2021_02_13_429885 74 64 in in IN 10_1101-2021_02_13_429885 74 65 perpetuity perpetuity NN 10_1101-2021_02_13_429885 74 66 . . . 10_1101-2021_02_13_429885 75 1 It -PRON- PRP 10_1101-2021_02_13_429885 75 2 is be VBZ 10_1101-2021_02_13_429885 75 3 made make VBN 10_1101-2021_02_13_429885 75 4 The the DT 10_1101-2021_02_13_429885 75 5 copyright copyright NN 10_1101-2021_02_13_429885 75 6 holder holder NN 10_1101-2021_02_13_429885 75 7 for for IN 10_1101-2021_02_13_429885 75 8 this this DT 10_1101-2021_02_13_429885 75 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 75 10 version version NN 10_1101-2021_02_13_429885 75 11 posted post VBD 10_1101-2021_02_13_429885 75 12 February February NNP 10_1101-2021_02_13_429885 75 13 13 13 CD 10_1101-2021_02_13_429885 75 14 , , , 10_1101-2021_02_13_429885 75 15 2021 2021 CD 10_1101-2021_02_13_429885 75 16 . . . 10_1101-2021_02_13_429885 75 17 ; ; : 10_1101-2021_02_13_429885 75 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 75 19 : : : 10_1101-2021_02_13_429885 75 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 75 21 preprint preprint NN 10_1101-2021_02_13_429885 75 22 https://paperpile.com/c/rqVmzs/bHGV+chqB https://paperpile.com/c/rqvmzs/bhgv+chqb NN 10_1101-2021_02_13_429885 75 23 https://paperpile.com/c/rqVmzs/bHGV https://paperpile.com/c/rqvmzs/bhgv PRP 10_1101-2021_02_13_429885 75 24 https://paperpile.com/c/rqVmzs/IX1R+ydMa+rmmC+yAgN https://paperpile.com/c/rqvmzs/ix1r+ydma+rmmc+yagn JJ 10_1101-2021_02_13_429885 75 25 https://paperpile.com/c/rqVmzs/IX1R+ydMa+rmmC+yAgN https://paperpile.com/c/rqvmzs/ix1r+ydma+rmmc+yagn JJ 10_1101-2021_02_13_429885 75 26 https://paperpile.com/c/rqVmzs/CxXa+67up+mWfz https://paperpile.com/c/rqVmzs/CxXa+67up+mWfz NNP 10_1101-2021_02_13_429885 75 27 https://paperpile.com/c/rqVmzs/CxXa+67up+mWfz https://paperpile.com/c/rqVmzs/CxXa+67up+mWfz NNP 10_1101-2021_02_13_429885 75 28 https://paperpile.com/c/rqVmzs/Uxwc https://paperpile.com/c/rqVmzs/Uxwc NNP 10_1101-2021_02_13_429885 75 29 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 NNP 10_1101-2021_02_13_429885 75 30 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 75 31 Househam Househam NNP 10_1101-2021_02_13_429885 75 32 et et FW 10_1101-2021_02_13_429885 75 33 al al NNP 10_1101-2021_02_13_429885 75 34 . . . 10_1101-2021_02_13_429885 76 1 A a DT 10_1101-2021_02_13_429885 76 2 fully fully RB 10_1101-2021_02_13_429885 76 3 automated automate VBN 10_1101-2021_02_13_429885 76 4 approach approach NN 10_1101-2021_02_13_429885 76 5 for for IN 10_1101-2021_02_13_429885 76 6 quality quality NN 10_1101-2021_02_13_429885 76 7 control control NN 10_1101-2021_02_13_429885 76 8 of of IN 10_1101-2021_02_13_429885 76 9 cancer cancer NN 10_1101-2021_02_13_429885 76 10 mutations mutation NNS 10_1101-2021_02_13_429885 76 11 in in IN 10_1101-2021_02_13_429885 76 12 the the DT 10_1101-2021_02_13_429885 76 13 era era NN 10_1101-2021_02_13_429885 76 14 of of IN 10_1101-2021_02_13_429885 76 15 high high JJ 10_1101-2021_02_13_429885 76 16 - - HYPH 10_1101-2021_02_13_429885 76 17 resolution resolution NN 10_1101-2021_02_13_429885 76 18 whole whole JJ 10_1101-2021_02_13_429885 76 19 genome genome JJ 10_1101-2021_02_13_429885 76 20 sequencing sequencing NN 10_1101-2021_02_13_429885 76 21 . . . 10_1101-2021_02_13_429885 77 1 feature feature NN 10_1101-2021_02_13_429885 77 2 for for IN 10_1101-2021_02_13_429885 77 3 large large JJ 10_1101-2021_02_13_429885 77 4 - - HYPH 10_1101-2021_02_13_429885 77 5 scale scale NN 10_1101-2021_02_13_429885 77 6 genomics genomic NNS 10_1101-2021_02_13_429885 77 7 consortia consortium NNS 10_1101-2021_02_13_429885 77 8 that that WDT 10_1101-2021_02_13_429885 77 9 analyse analyse VBP 10_1101-2021_02_13_429885 77 10 many many JJ 10_1101-2021_02_13_429885 77 11 samples sample NNS 10_1101-2021_02_13_429885 77 12 per per IN 10_1101-2021_02_13_429885 77 13 day day NN 10_1101-2021_02_13_429885 77 14 . . . 10_1101-2021_02_13_429885 78 1 To to TO 10_1101-2021_02_13_429885 78 2 demonstrate demonstrate VB 10_1101-2021_02_13_429885 78 3 the the DT 10_1101-2021_02_13_429885 78 4 tool tool NN 10_1101-2021_02_13_429885 78 5 we -PRON- PRP 10_1101-2021_02_13_429885 78 6 analysed analyse VBD 10_1101-2021_02_13_429885 78 7 11 11 CD 10_1101-2021_02_13_429885 78 8 bulk bulk NN 10_1101-2021_02_13_429885 78 9 WGS WGS NNP 10_1101-2021_02_13_429885 78 10 datasets dataset VBZ 10_1101-2021_02_13_429885 78 11 from from IN 10_1101-2021_02_13_429885 78 12 two two CD 10_1101-2021_02_13_429885 78 13 multi multi JJ 10_1101-2021_02_13_429885 78 14 - - JJ 10_1101-2021_02_13_429885 78 15 region region JJ 10_1101-2021_02_13_429885 78 16 colorectal colorectal JJ 10_1101-2021_02_13_429885 78 17 cancers cancer NNS 10_1101-2021_02_13_429885 78 18 , , , 10_1101-2021_02_13_429885 78 19 and and CC 10_1101-2021_02_13_429885 78 20 analysed analyse VBD 10_1101-2021_02_13_429885 78 21 high high JJ 10_1101-2021_02_13_429885 78 22 - - HYPH 10_1101-2021_02_13_429885 78 23 quality quality NN 10_1101-2021_02_13_429885 78 24 whole whole JJ 10_1101-2021_02_13_429885 78 25 - - HYPH 10_1101-2021_02_13_429885 78 26 genomes genome NNS 10_1101-2021_02_13_429885 78 27 from from IN 10_1101-2021_02_13_429885 78 28 the the DT 10_1101-2021_02_13_429885 78 29 Pan Pan NNP 10_1101-2021_02_13_429885 78 30 0651 0651 CD 10_1101-2021_02_13_429885 78 31 Cancer Cancer NNP 10_1101-2021_02_13_429885 78 32 Analysis Analysis NNP 10_1101-2021_02_13_429885 78 33 of of IN 10_1101-2021_02_13_429885 78 34 Whole Whole NNP 10_1101-2021_02_13_429885 78 35 Genomes Genomes NNP 10_1101-2021_02_13_429885 78 36 ( ( -LRB- 10_1101-2021_02_13_429885 78 37 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 78 38 ) ) -RRB- 10_1101-2021_02_13_429885 78 39 cohort cohort NN 10_1101-2021_02_13_429885 78 40 ​(Campbell ​(Campbell NNP 10_1101-2021_02_13_429885 78 41 et et FW 10_1101-2021_02_13_429885 78 42 al al NNP 10_1101-2021_02_13_429885 78 43 . . . 10_1101-2021_02_13_429885 79 1 2020)​. 2020)​. CD 10_1101-2021_02_13_429885 80 1 Results result NNS 10_1101-2021_02_13_429885 80 2 The the DT 10_1101-2021_02_13_429885 80 3 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 80 4 framework framework NN 10_1101-2021_02_13_429885 80 5 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 80 6 can can MD 10_1101-2021_02_13_429885 80 7 perform perform VB 10_1101-2021_02_13_429885 80 8 different different JJ 10_1101-2021_02_13_429885 80 9 types type NNS 10_1101-2021_02_13_429885 80 10 of of IN 10_1101-2021_02_13_429885 80 11 operations operation NNS 10_1101-2021_02_13_429885 80 12 on on IN 10_1101-2021_02_13_429885 80 13 CNAs cna NNS 10_1101-2021_02_13_429885 80 14 and and CC 10_1101-2021_02_13_429885 80 15 somatic somatic JJ 10_1101-2021_02_13_429885 80 16 mutation mutation NN 10_1101-2021_02_13_429885 80 17 calls call NNS 10_1101-2021_02_13_429885 80 18 obtained obtain VBN 10_1101-2021_02_13_429885 80 19 from from IN 10_1101-2021_02_13_429885 80 20 bulk bulk NNP 10_1101-2021_02_13_429885 80 21 WGS WGS NNP 10_1101-2021_02_13_429885 80 22 . . . 10_1101-2021_02_13_429885 81 1 In in IN 10_1101-2021_02_13_429885 81 2 what what WP 10_1101-2021_02_13_429885 81 3 follows follow VBZ 10_1101-2021_02_13_429885 81 4 , , , 10_1101-2021_02_13_429885 81 5 we -PRON- PRP 10_1101-2021_02_13_429885 81 6 will will MD 10_1101-2021_02_13_429885 81 7 refer refer VB 10_1101-2021_02_13_429885 81 8 explicitly explicitly RB 10_1101-2021_02_13_429885 81 9 to to IN 10_1101-2021_02_13_429885 81 10 SNVs SNVs NNPS 10_1101-2021_02_13_429885 81 11 as as IN 10_1101-2021_02_13_429885 81 12 the the DT 10_1101-2021_02_13_429885 81 13 main main JJ 10_1101-2021_02_13_429885 81 14 type type NN 10_1101-2021_02_13_429885 81 15 of of IN 10_1101-2021_02_13_429885 81 16 mutation mutation NN 10_1101-2021_02_13_429885 81 17 used use VBN 10_1101-2021_02_13_429885 81 18 , , , 10_1101-2021_02_13_429885 81 19 but but CC 10_1101-2021_02_13_429885 81 20 in in IN 10_1101-2021_02_13_429885 81 21 principle principle JJ 10_1101-2021_02_13_429885 81 22 other other JJ 10_1101-2021_02_13_429885 81 23 types type NNS 10_1101-2021_02_13_429885 81 24 of of IN 10_1101-2021_02_13_429885 81 25 substitutions substitution NNS 10_1101-2021_02_13_429885 81 26 such such JJ 10_1101-2021_02_13_429885 81 27 as as IN 10_1101-2021_02_13_429885 81 28 insertions insertion NNS 10_1101-2021_02_13_429885 81 29 or or CC 10_1101-2021_02_13_429885 81 30 deletions deletion NNS 10_1101-2021_02_13_429885 81 31 also also RB 10_1101-2021_02_13_429885 81 32 apply apply VBP 10_1101-2021_02_13_429885 81 33 . . . 10_1101-2021_02_13_429885 82 1 The the DT 10_1101-2021_02_13_429885 82 2 package package NN 10_1101-2021_02_13_429885 82 3 supports support VBZ 10_1101-2021_02_13_429885 82 4 the the DT 10_1101-2021_02_13_429885 82 5 most most RBS 10_1101-2021_02_13_429885 82 6 common common JJ 10_1101-2021_02_13_429885 82 7 CNA cna NN 10_1101-2021_02_13_429885 82 8 copy copy NN 10_1101-2021_02_13_429885 82 9 types type NNS 10_1101-2021_02_13_429885 82 10 found find VBN 10_1101-2021_02_13_429885 82 11 in in IN 10_1101-2021_02_13_429885 82 12 cancers cancer NNS 10_1101-2021_02_13_429885 82 13 : : : 10_1101-2021_02_13_429885 82 14 heterozygous heterozygous JJ 10_1101-2021_02_13_429885 82 15 normal normal JJ 10_1101-2021_02_13_429885 82 16 states state NNS 10_1101-2021_02_13_429885 82 17 ( ( -LRB- 10_1101-2021_02_13_429885 82 18 1:1 1:1 CD 10_1101-2021_02_13_429885 82 19 chromosome chromosome NN 10_1101-2021_02_13_429885 82 20 complement complement NN 10_1101-2021_02_13_429885 82 21 ) ) -RRB- 10_1101-2021_02_13_429885 82 22 , , , 10_1101-2021_02_13_429885 82 23 loss loss NN 10_1101-2021_02_13_429885 82 24 of of IN 10_1101-2021_02_13_429885 82 25 heterozygosity heterozygosity NN 10_1101-2021_02_13_429885 82 26 ( ( -LRB- 10_1101-2021_02_13_429885 82 27 LOH LOH NNP 10_1101-2021_02_13_429885 82 28 ) ) -RRB- 10_1101-2021_02_13_429885 82 29 in in IN 10_1101-2021_02_13_429885 82 30 monosomy monosomy NNP 10_1101-2021_02_13_429885 82 31 ( ( -LRB- 10_1101-2021_02_13_429885 82 32 1:0 1:0 CD 10_1101-2021_02_13_429885 82 33 ) ) -RRB- 10_1101-2021_02_13_429885 82 34 and and CC 10_1101-2021_02_13_429885 82 35 copy copy NN 10_1101-2021_02_13_429885 82 36 - - HYPH 10_1101-2021_02_13_429885 82 37 neutral neutral JJ 10_1101-2021_02_13_429885 82 38 ( ( -LRB- 10_1101-2021_02_13_429885 82 39 2:0 2:0 CD 10_1101-2021_02_13_429885 82 40 ) ) -RRB- 10_1101-2021_02_13_429885 82 41 form form NN 10_1101-2021_02_13_429885 82 42 , , , 10_1101-2021_02_13_429885 82 43 trisomy trisomy JJ 10_1101-2021_02_13_429885 82 44 ( ( -LRB- 10_1101-2021_02_13_429885 82 45 2:1 2:1 CD 10_1101-2021_02_13_429885 82 46 ) ) -RRB- 10_1101-2021_02_13_429885 82 47 or or CC 10_1101-2021_02_13_429885 82 48 tetrasomy tetrasomy NN 10_1101-2021_02_13_429885 82 49 ( ( -LRB- 10_1101-2021_02_13_429885 82 50 2:2 2:2 CD 10_1101-2021_02_13_429885 82 51 ) ) -RRB- 10_1101-2021_02_13_429885 82 52 gains gain NNS 10_1101-2021_02_13_429885 82 53 . . . 10_1101-2021_02_13_429885 83 1 The the DT 10_1101-2021_02_13_429885 83 2 tool tool NN 10_1101-2021_02_13_429885 83 3 also also RB 10_1101-2021_02_13_429885 83 4 works work VBZ 10_1101-2021_02_13_429885 83 5 with with IN 10_1101-2021_02_13_429885 83 6 exome exome JJ 10_1101-2021_02_13_429885 83 7 data datum NNS 10_1101-2021_02_13_429885 83 8 , , , 10_1101-2021_02_13_429885 83 9 but but CC 10_1101-2021_02_13_429885 83 10 the the DT 10_1101-2021_02_13_429885 83 11 reduced reduced JJ 10_1101-2021_02_13_429885 83 12 mutational mutational JJ 10_1101-2021_02_13_429885 83 13 burden burden NN 10_1101-2021_02_13_429885 83 14 can can MD 10_1101-2021_02_13_429885 83 15 , , , 10_1101-2021_02_13_429885 83 16 in in IN 10_1101-2021_02_13_429885 83 17 general general JJ 10_1101-2021_02_13_429885 83 18 , , , 10_1101-2021_02_13_429885 83 19 lower lower VB 10_1101-2021_02_13_429885 83 20 the the DT 10_1101-2021_02_13_429885 83 21 reliability reliability NN 10_1101-2021_02_13_429885 83 22 of of IN 10_1101-2021_02_13_429885 83 23 the the DT 10_1101-2021_02_13_429885 83 24 QC QC NNP 10_1101-2021_02_13_429885 83 25 score score NN 10_1101-2021_02_13_429885 83 26 ( ( -LRB- 10_1101-2021_02_13_429885 83 27 ​Supplementary ​supplementary JJ 10_1101-2021_02_13_429885 83 28 Figure Figure NNP 10_1101-2021_02_13_429885 83 29 S1​ s1​ NN 10_1101-2021_02_13_429885 83 30 ) ) -RRB- 10_1101-2021_02_13_429885 83 31 . . . 10_1101-2021_02_13_429885 84 1 Many many JJ 10_1101-2021_02_13_429885 84 2 metrics metric NNS 10_1101-2021_02_13_429885 84 3 output output NN 10_1101-2021_02_13_429885 84 4 by by IN 10_1101-2021_02_13_429885 84 5 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 84 6 are be VBP 10_1101-2021_02_13_429885 84 7 derived derive VBN 10_1101-2021_02_13_429885 84 8 from from IN 10_1101-2021_02_13_429885 84 9 the the DT 10_1101-2021_02_13_429885 84 10 link link NN 10_1101-2021_02_13_429885 84 11 between between IN 10_1101-2021_02_13_429885 84 12 copy copy NN 10_1101-2021_02_13_429885 84 13 - - HYPH 10_1101-2021_02_13_429885 84 14 state state NN 10_1101-2021_02_13_429885 84 15 profiles profile NNS 10_1101-2021_02_13_429885 84 16 ( ( -LRB- 10_1101-2021_02_13_429885 84 17 i.e. i.e. FW 10_1101-2021_02_13_429885 84 18 , , , 10_1101-2021_02_13_429885 84 19 the the DT 10_1101-2021_02_13_429885 84 20 copies copy NNS 10_1101-2021_02_13_429885 84 21 of of IN 10_1101-2021_02_13_429885 84 22 the the DT 10_1101-2021_02_13_429885 84 23 major major JJ 10_1101-2021_02_13_429885 84 24 and and CC 10_1101-2021_02_13_429885 84 25 minor minor JJ 10_1101-2021_02_13_429885 84 26 alleles allele NNS 10_1101-2021_02_13_429885 84 27 , , , 10_1101-2021_02_13_429885 84 28 which which WDT 10_1101-2021_02_13_429885 84 29 sum sum VBP 10_1101-2021_02_13_429885 84 30 up up RP 10_1101-2021_02_13_429885 84 31 to to IN 10_1101-2021_02_13_429885 84 32 the the DT 10_1101-2021_02_13_429885 84 33 ploidy ploidy NN 10_1101-2021_02_13_429885 84 34 of of IN 10_1101-2021_02_13_429885 84 35 a a DT 10_1101-2021_02_13_429885 84 36 segment segment NN 10_1101-2021_02_13_429885 84 37 ) ) -RRB- 10_1101-2021_02_13_429885 84 38 and and CC 10_1101-2021_02_13_429885 84 39 allele allele NNP 10_1101-2021_02_13_429885 84 40 frequencies frequency NNS 10_1101-2021_02_13_429885 84 41 that that WDT 10_1101-2021_02_13_429885 84 42 are be VBP 10_1101-2021_02_13_429885 84 43 explicit explicit JJ 10_1101-2021_02_13_429885 84 44 from from IN 10_1101-2021_02_13_429885 84 45 read read VBN 10_1101-2021_02_13_429885 84 46 counts count NNS 10_1101-2021_02_13_429885 84 47 . . . 10_1101-2021_02_13_429885 85 1 Combinatorial combinatorial JJ 10_1101-2021_02_13_429885 85 2 equations equation NNS 10_1101-2021_02_13_429885 85 3 and and CC 10_1101-2021_02_13_429885 85 4 frequency frequency NN 10_1101-2021_02_13_429885 85 5 spectrum spectrum NN 10_1101-2021_02_13_429885 85 6 analysis analysis NN 10_1101-2021_02_13_429885 85 7 can can MD 10_1101-2021_02_13_429885 85 8 quantitatively quantitatively RB 10_1101-2021_02_13_429885 85 9 determine determine VB 10_1101-2021_02_13_429885 85 10 if if IN 10_1101-2021_02_13_429885 85 11 CNAs cna NNS 10_1101-2021_02_13_429885 85 12 and and CC 10_1101-2021_02_13_429885 85 13 purity purity NN 10_1101-2021_02_13_429885 85 14 are be VBP 10_1101-2021_02_13_429885 85 15 consistent consistent JJ 10_1101-2021_02_13_429885 85 16 with with IN 10_1101-2021_02_13_429885 85 17 the the DT 10_1101-2021_02_13_429885 85 18 VAF VAF NNP 10_1101-2021_02_13_429885 85 19 distribution distribution NN 10_1101-2021_02_13_429885 85 20 ( ( -LRB- 10_1101-2021_02_13_429885 85 21 ​Online ​online JJ 10_1101-2021_02_13_429885 85 22 methods method NNS 10_1101-2021_02_13_429885 85 23 ​ ​ JJ 10_1101-2021_02_13_429885 85 24 ) ) -RRB- 10_1101-2021_02_13_429885 85 25 . . . 10_1101-2021_02_13_429885 86 1 This this DT 10_1101-2021_02_13_429885 86 2 score score NN 10_1101-2021_02_13_429885 86 3 also also RB 10_1101-2021_02_13_429885 86 4 suggests suggest VBZ 10_1101-2021_02_13_429885 86 5 “ " `` 10_1101-2021_02_13_429885 86 6 corrections correction NNS 10_1101-2021_02_13_429885 86 7 ” " '' 10_1101-2021_02_13_429885 86 8 to to TO 10_1101-2021_02_13_429885 86 9 automatically automatically RB 10_1101-2021_02_13_429885 86 10 fine fine JJ 10_1101-2021_02_13_429885 86 11 - - HYPH 10_1101-2021_02_13_429885 86 12 tune tune NN 10_1101-2021_02_13_429885 86 13 and and CC 10_1101-2021_02_13_429885 86 14 repeat repeat VB 10_1101-2021_02_13_429885 86 15 CNA CNA NNP 10_1101-2021_02_13_429885 86 16 calling calling NN 10_1101-2021_02_13_429885 86 17 runs run NNS 10_1101-2021_02_13_429885 86 18 . . . 10_1101-2021_02_13_429885 87 1 This this DT 10_1101-2021_02_13_429885 87 2 works work VBZ 10_1101-2021_02_13_429885 87 3 for for IN 10_1101-2021_02_13_429885 87 4 tools tool NNS 10_1101-2021_02_13_429885 87 5 that that WDT 10_1101-2021_02_13_429885 87 6 use use VBP 10_1101-2021_02_13_429885 87 7 either either DT 10_1101-2021_02_13_429885 87 8 Bayesian bayesian JJ 10_1101-2021_02_13_429885 87 9 priors prior NNS 10_1101-2021_02_13_429885 87 10 or or CC 10_1101-2021_02_13_429885 87 11 point point NN 10_1101-2021_02_13_429885 87 12 estimates estimate NNS 10_1101-2021_02_13_429885 87 13 of of IN 10_1101-2021_02_13_429885 87 14 the the DT 10_1101-2021_02_13_429885 87 15 parameters parameter NNS 10_1101-2021_02_13_429885 87 16 . . . 10_1101-2021_02_13_429885 88 1 The the DT 10_1101-2021_02_13_429885 88 2 key key JJ 10_1101-2021_02_13_429885 88 3 equations equation NNS 10_1101-2021_02_13_429885 88 4 for for IN 10_1101-2021_02_13_429885 88 5 a a DT 10_1101-2021_02_13_429885 88 6 somatic somatic JJ 10_1101-2021_02_13_429885 88 7 mutation mutation NN 10_1101-2021_02_13_429885 88 8 link link VB 10_1101-2021_02_13_429885 88 9 its -PRON- PRP$ 10_1101-2021_02_13_429885 88 10 VAF VAF NNP 10_1101-2021_02_13_429885 88 11 and and CC 10_1101-2021_02_13_429885 88 12 CCF CCF NNP 10_1101-2021_02_13_429885 88 13 , , , 10_1101-2021_02_13_429885 88 14 to to TO 10_1101-2021_02_13_429885 88 15 sample sample VB 10_1101-2021_02_13_429885 88 16 purity purity NN 10_1101-2021_02_13_429885 88 17 , , , 10_1101-2021_02_13_429885 88 18 tumour tumour NN 10_1101-2021_02_13_429885 88 19 ploidy ploidy NN 10_1101-2021_02_13_429885 88 20 , , , 10_1101-2021_02_13_429885 88 21 and and CC 10_1101-2021_02_13_429885 88 22 ​ ​ JJ 10_1101-2021_02_13_429885 88 23 , , , 10_1101-2021_02_13_429885 88 24 the the DT 10_1101-2021_02_13_429885 88 25 number number NN 10_1101-2021_02_13_429885 88 26 of of IN 10_1101-2021_02_13_429885 88 27 copies copy NNS 10_1101-2021_02_13_429885 88 28 of of IN 10_1101-2021_02_13_429885 88 29 a a DT 10_1101-2021_02_13_429885 88 30 mutation mutation NN 10_1101-2021_02_13_429885 88 31 ( ( -LRB- 10_1101-2021_02_13_429885 88 32 ​Figure ​figure NN 10_1101-2021_02_13_429885 88 33 1a 1a CD 10_1101-2021_02_13_429885 88 34 ​ ​ JJ 10_1101-2021_02_13_429885 88 35 ) ) -RRB- 10_1101-2021_02_13_429885 88 36 . . . 10_1101-2021_02_13_429885 89 1 Effectively effectively RB 10_1101-2021_02_13_429885 89 2 , , , 10_1101-2021_02_13_429885 89 3 for for IN 10_1101-2021_02_13_429885 89 4 complex complex JJ 10_1101-2021_02_13_429885 89 5 2:0 2:0 CD 10_1101-2021_02_13_429885 89 6 , , , 10_1101-2021_02_13_429885 89 7 2:1 2:1 CD 10_1101-2021_02_13_429885 89 8 and and CC 10_1101-2021_02_13_429885 89 9 2:2 2:2 CD 10_1101-2021_02_13_429885 89 10 copy copy NN 10_1101-2021_02_13_429885 89 11 states state NNS 10_1101-2021_02_13_429885 89 12 , , , 10_1101-2021_02_13_429885 89 13 phases phase VBZ 10_1101-2021_02_13_429885 89 14 mutations mutation NNS 10_1101-2021_02_13_429885 89 15 that that WDT 10_1101-2021_02_13_429885 89 16 were be VBD 10_1101-2021_02_13_429885 89 17 acquired acquire VBN 10_1101-2021_02_13_429885 89 18 before before IN 10_1101-2021_02_13_429885 89 19 or or CC 10_1101-2021_02_13_429885 89 20 after after IN 10_1101-2021_02_13_429885 89 21 the the DT 10_1101-2021_02_13_429885 89 22 copy copy NN 10_1101-2021_02_13_429885 89 23 number number NN 10_1101-2021_02_13_429885 89 24 event event NN 10_1101-2021_02_13_429885 89 25 ( ( -LRB- 10_1101-2021_02_13_429885 89 26 ​Figure ​figure NN 10_1101-2021_02_13_429885 89 27 1b 1b NN 10_1101-2021_02_13_429885 89 28 ​ ​ JJ 10_1101-2021_02_13_429885 89 29 ) ) -RRB- 10_1101-2021_02_13_429885 89 30 . . . 10_1101-2021_02_13_429885 90 1 We -PRON- PRP 10_1101-2021_02_13_429885 90 2 remark remark VBP 10_1101-2021_02_13_429885 90 3 that that IN 10_1101-2021_02_13_429885 90 4 we -PRON- PRP 10_1101-2021_02_13_429885 90 5 observe observe VBP 10_1101-2021_02_13_429885 90 6 , , , 10_1101-2021_02_13_429885 90 7 and and CC 10_1101-2021_02_13_429885 90 8 infer infer JJ 10_1101-2021_02_13_429885 90 9 , , , 10_1101-2021_02_13_429885 90 10 ​ ​ JJ 10_1101-2021_02_13_429885 90 11 and and CC 10_1101-2021_02_13_429885 90 12 , , , 10_1101-2021_02_13_429885 90 13 finally finally RB 10_1101-2021_02_13_429885 90 14 deriving derive VBG 10_1101-2021_02_13_429885 90 15 , , , 10_1101-2021_02_13_429885 90 16 which which WDT 10_1101-2021_02_13_429885 90 17 is be VBZ 10_1101-2021_02_13_429885 90 18 difficult difficult JJ 10_1101-2021_02_13_429885 90 19 to to TO 10_1101-2021_02_13_429885 90 20 estimate estimate VB 10_1101-2021_02_13_429885 90 21 ( ( -LRB- 10_1101-2021_02_13_429885 90 22 ​Figure ​figure NN 10_1101-2021_02_13_429885 90 23 1c 1c CD 10_1101-2021_02_13_429885 90 24 ​ ​ NN 10_1101-2021_02_13_429885 90 25 ) ) -RRB- 10_1101-2021_02_13_429885 90 26 . . . 10_1101-2021_02_13_429885 91 1 In in IN 10_1101-2021_02_13_429885 91 2 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 91 3 we -PRON- PRP 10_1101-2021_02_13_429885 91 4 use use VBP 10_1101-2021_02_13_429885 91 5 the the DT 10_1101-2021_02_13_429885 91 6 following follow VBG 10_1101-2021_02_13_429885 91 7 formula formula NN 10_1101-2021_02_13_429885 91 8 for for IN 10_1101-2021_02_13_429885 91 9 VAF VAF NNP 10_1101-2021_02_13_429885 91 10 ( ( -LRB- 10_1101-2021_02_13_429885 91 11 ​Figure ​figure NN 10_1101-2021_02_13_429885 91 12 1d​ 1d​ CD 10_1101-2021_02_13_429885 91 13 ) ) -RRB- 10_1101-2021_02_13_429885 91 14 and and CC 10_1101-2021_02_13_429885 91 15 CCF CCF NNP 10_1101-2021_02_13_429885 91 16 .CC .CC NFP 10_1101-2021_02_13_429885 91 17 - - : 10_1101-2021_02_13_429885 91 18 BY by IN 10_1101-2021_02_13_429885 91 19 - - HYPH 10_1101-2021_02_13_429885 91 20 NC NC NNP 10_1101-2021_02_13_429885 91 21 - - HYPH 10_1101-2021_02_13_429885 91 22 ND ND NNP 10_1101-2021_02_13_429885 91 23 4.0 4.0 CD 10_1101-2021_02_13_429885 91 24 International International NNP 10_1101-2021_02_13_429885 91 25 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 91 26 under under IN 10_1101-2021_02_13_429885 91 27 a a DT 10_1101-2021_02_13_429885 91 28 ( ( -LRB- 10_1101-2021_02_13_429885 91 29 which which WDT 10_1101-2021_02_13_429885 91 30 was be VBD 10_1101-2021_02_13_429885 91 31 not not RB 10_1101-2021_02_13_429885 91 32 certified certify VBN 10_1101-2021_02_13_429885 91 33 by by IN 10_1101-2021_02_13_429885 91 34 peer peer NN 10_1101-2021_02_13_429885 91 35 review review NN 10_1101-2021_02_13_429885 91 36 ) ) -RRB- 10_1101-2021_02_13_429885 91 37 is be VBZ 10_1101-2021_02_13_429885 91 38 the the DT 10_1101-2021_02_13_429885 91 39 author author NN 10_1101-2021_02_13_429885 91 40 / / SYM 10_1101-2021_02_13_429885 91 41 funder funder NN 10_1101-2021_02_13_429885 91 42 , , , 10_1101-2021_02_13_429885 91 43 who who WP 10_1101-2021_02_13_429885 91 44 has have VBZ 10_1101-2021_02_13_429885 91 45 granted grant VBN 10_1101-2021_02_13_429885 91 46 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 91 47 a a DT 10_1101-2021_02_13_429885 91 48 license license NN 10_1101-2021_02_13_429885 91 49 to to TO 10_1101-2021_02_13_429885 91 50 display display VB 10_1101-2021_02_13_429885 91 51 the the DT 10_1101-2021_02_13_429885 91 52 preprint preprint NN 10_1101-2021_02_13_429885 91 53 in in IN 10_1101-2021_02_13_429885 91 54 perpetuity perpetuity NN 10_1101-2021_02_13_429885 91 55 . . . 10_1101-2021_02_13_429885 92 1 It -PRON- PRP 10_1101-2021_02_13_429885 92 2 is be VBZ 10_1101-2021_02_13_429885 92 3 made make VBN 10_1101-2021_02_13_429885 92 4 The the DT 10_1101-2021_02_13_429885 92 5 copyright copyright NN 10_1101-2021_02_13_429885 92 6 holder holder NN 10_1101-2021_02_13_429885 92 7 for for IN 10_1101-2021_02_13_429885 92 8 this this DT 10_1101-2021_02_13_429885 92 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 92 10 version version NN 10_1101-2021_02_13_429885 92 11 posted post VBD 10_1101-2021_02_13_429885 92 12 February February NNP 10_1101-2021_02_13_429885 92 13 13 13 CD 10_1101-2021_02_13_429885 92 14 , , , 10_1101-2021_02_13_429885 92 15 2021 2021 CD 10_1101-2021_02_13_429885 92 16 . . . 10_1101-2021_02_13_429885 92 17 ; ; : 10_1101-2021_02_13_429885 92 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 92 19 : : : 10_1101-2021_02_13_429885 92 20 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 92 21 preprint preprint NN 10_1101-2021_02_13_429885 92 22 https://paperpile.com/c/rqVmzs/CxXa https://paperpile.com/c/rqVmzs/CxXa NNP 10_1101-2021_02_13_429885 92 23 https://www.codecogs.com/eqnedit.php?latex=v#0 https://www.codecogs.com/eqnedit.php?latex=v#0 NNP 10_1101-2021_02_13_429885 92 24 https://www.codecogs.com/eqnedit.php?latex=c#0 https://www.codecogs.com/eqnedit.php?latex=c#0 NNP 10_1101-2021_02_13_429885 92 25 https://www.codecogs.com/eqnedit.php?latex=%5Cpi#0 https://www.codecogs.com/eqnedit.php?latex=%5cpi#0 NN 10_1101-2021_02_13_429885 92 26 https://www.codecogs.com/eqnedit.php?latex=p#0 https://www.codecogs.com/eqnedit.php?latex=p#0 NNP 10_1101-2021_02_13_429885 92 27 https://www.codecogs.com/eqnedit.php?latex=m%5Cin%5C%7B1%2C2%5C%7D#0 https://www.codecogs.com/eqnedit.php?latex=m%5Cin%5C%7B1%2C2%5C%7D#0 NNP 10_1101-2021_02_13_429885 92 28 https://www.codecogs.com/eqnedit.php?latex=m#0 https://www.codecogs.com/eqnedit.php?latex=m#0 NNP 10_1101-2021_02_13_429885 92 29 https://www.codecogs.com/eqnedit.php?latex=v#0 https://www.codecogs.com/eqnedit.php?latex=v#0 NNP 10_1101-2021_02_13_429885 92 30 https://www.codecogs.com/eqnedit.php?latex=%5Cpi#0 https://www.codecogs.com/eqnedit.php?latex=%5cpi#0 NN 10_1101-2021_02_13_429885 92 31 https://www.codecogs.com/eqnedit.php?latex=p#0 https://www.codecogs.com/eqnedit.php?latex=p#0 NNP 10_1101-2021_02_13_429885 92 32 https://www.codecogs.com/eqnedit.php?latex=m#0 https://www.codecogs.com/eqnedit.php?latex=m#0 NN 10_1101-2021_02_13_429885 92 33 https://www.codecogs.com/eqnedit.php?latex=c#0 https://www.codecogs.com/eqnedit.php?latex=c#0 NNS 10_1101-2021_02_13_429885 92 34 https://www.codecogs.com/eqnedit.php?latex=v%20%3D%20%5Cdfrac%7B%5Cpi%7D%7B2(1-%5Cpi)%20%2B%20%5Cpi%20p%7D#0 https://www.codecogs.com/eqnedit.php?latex=v%20%3D%20%5Cdfrac%7B%5Cpi%7D%7B2(1-%5Cpi)%20%2B%20%5Cpi%20p%7D#0 NNS 10_1101-2021_02_13_429885 92 35 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 92 36 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 92 37 Househam Househam NNP 10_1101-2021_02_13_429885 92 38 et et FW 10_1101-2021_02_13_429885 92 39 al al NNP 10_1101-2021_02_13_429885 92 40 . . . 10_1101-2021_02_13_429885 93 1 A a DT 10_1101-2021_02_13_429885 93 2 fully fully RB 10_1101-2021_02_13_429885 93 3 automated automate VBN 10_1101-2021_02_13_429885 93 4 approach approach NN 10_1101-2021_02_13_429885 93 5 for for IN 10_1101-2021_02_13_429885 93 6 quality quality NN 10_1101-2021_02_13_429885 93 7 control control NN 10_1101-2021_02_13_429885 93 8 of of IN 10_1101-2021_02_13_429885 93 9 cancer cancer NN 10_1101-2021_02_13_429885 93 10 mutations mutation NNS 10_1101-2021_02_13_429885 93 11 in in IN 10_1101-2021_02_13_429885 93 12 the the DT 10_1101-2021_02_13_429885 93 13 era era NN 10_1101-2021_02_13_429885 93 14 of of IN 10_1101-2021_02_13_429885 93 15 high high JJ 10_1101-2021_02_13_429885 93 16 - - HYPH 10_1101-2021_02_13_429885 93 17 resolution resolution NN 10_1101-2021_02_13_429885 93 18 whole whole JJ 10_1101-2021_02_13_429885 93 19 genome genome JJ 10_1101-2021_02_13_429885 93 20 sequencing sequencing NN 10_1101-2021_02_13_429885 93 21 . . . 10_1101-2021_02_13_429885 94 1 These these DT 10_1101-2021_02_13_429885 94 2 formulas formula NNS 10_1101-2021_02_13_429885 94 3 lead lead VBP 10_1101-2021_02_13_429885 94 4 to to IN 10_1101-2021_02_13_429885 94 5 other other JJ 10_1101-2021_02_13_429885 94 6 interesting interesting JJ 10_1101-2021_02_13_429885 94 7 quantities quantity NNS 10_1101-2021_02_13_429885 94 8 ( ( -LRB- 10_1101-2021_02_13_429885 94 9 ​Online ​online JJ 10_1101-2021_02_13_429885 94 10 methods method NNS 10_1101-2021_02_13_429885 94 11 ​ ​ JJ 10_1101-2021_02_13_429885 94 12 ) ) -RRB- 10_1101-2021_02_13_429885 94 13 . . . 10_1101-2021_02_13_429885 95 1 For for IN 10_1101-2021_02_13_429885 95 2 instance instance NN 10_1101-2021_02_13_429885 95 3 , , , 10_1101-2021_02_13_429885 95 4 if if IN 10_1101-2021_02_13_429885 95 5 we -PRON- PRP 10_1101-2021_02_13_429885 95 6 know know VBP 10_1101-2021_02_13_429885 95 7 tumour tumour NN 10_1101-2021_02_13_429885 95 8 purity purity NN 10_1101-2021_02_13_429885 95 9 and and CC 10_1101-2021_02_13_429885 95 10 the the DT 10_1101-2021_02_13_429885 95 11 ploidy ploidy NN 10_1101-2021_02_13_429885 95 12 of of IN 10_1101-2021_02_13_429885 95 13 a a DT 10_1101-2021_02_13_429885 95 14 CNA CNA NNP 10_1101-2021_02_13_429885 95 15 segment segment NN 10_1101-2021_02_13_429885 95 16 , , , 10_1101-2021_02_13_429885 95 17 then then RB 10_1101-2021_02_13_429885 95 18 the the DT 10_1101-2021_02_13_429885 95 19 VAF VAF NNP 10_1101-2021_02_13_429885 95 20 mutations mutation NNS 10_1101-2021_02_13_429885 95 21 mapped map VBN 10_1101-2021_02_13_429885 95 22 to to IN 10_1101-2021_02_13_429885 95 23 the the DT 10_1101-2021_02_13_429885 95 24 segment segment NN 10_1101-2021_02_13_429885 95 25 must must MD 10_1101-2021_02_13_429885 95 26 peak peak VB 10_1101-2021_02_13_429885 95 27 at at IN 10_1101-2021_02_13_429885 95 28 a a DT 10_1101-2021_02_13_429885 95 29 known know VBN 10_1101-2021_02_13_429885 95 30 location location NN 10_1101-2021_02_13_429885 95 31 . . . 10_1101-2021_02_13_429885 96 1 The the DT 10_1101-2021_02_13_429885 96 2 value value NN 10_1101-2021_02_13_429885 96 3 for for IN 10_1101-2021_02_13_429885 96 4 follows follow VBZ 10_1101-2021_02_13_429885 96 5 from from IN 10_1101-2021_02_13_429885 96 6 x x NN 10_1101-2021_02_13_429885 96 7 x x NN 10_1101-2021_02_13_429885 96 8 combinatorial combinatorial JJ 10_1101-2021_02_13_429885 96 9 arguments argument NNS 10_1101-2021_02_13_429885 96 10 relating relate VBG 10_1101-2021_02_13_429885 96 11 all all DT 10_1101-2021_02_13_429885 96 12 other other JJ 10_1101-2021_02_13_429885 96 13 variables variable NNS 10_1101-2021_02_13_429885 96 14 ​(Nik ​(nik CD 10_1101-2021_02_13_429885 96 15 - - HYPH 10_1101-2021_02_13_429885 96 16 Zainal Zainal NNP 10_1101-2021_02_13_429885 96 17 et et NNP 10_1101-2021_02_13_429885 96 18 al al NNP 10_1101-2021_02_13_429885 96 19 . . NNP 10_1101-2021_02_13_429885 96 20 , , , 10_1101-2021_02_13_429885 96 21 2012)​. 2012)​. CD 10_1101-2021_02_13_429885 97 1 From from IN 10_1101-2021_02_13_429885 97 2 a a DT 10_1101-2021_02_13_429885 97 3 QC QC NNP 10_1101-2021_02_13_429885 97 4 perspective perspective NN 10_1101-2021_02_13_429885 97 5 , , , 10_1101-2021_02_13_429885 97 6 the the DT 10_1101-2021_02_13_429885 97 7 euclidean euclidean JJ 10_1101-2021_02_13_429885 97 8 distance distance NN 10_1101-2021_02_13_429885 97 9 between between IN 10_1101-2021_02_13_429885 97 10 the the DT 10_1101-2021_02_13_429885 97 11 theoretical theoretical JJ 10_1101-2021_02_13_429885 97 12 expectation expectation NN 10_1101-2021_02_13_429885 97 13 and and CC 10_1101-2021_02_13_429885 97 14 the the DT 10_1101-2021_02_13_429885 97 15 x x NN 10_1101-2021_02_13_429885 97 16 peaks peak NNS 10_1101-2021_02_13_429885 97 17 observed observe VBN 10_1101-2021_02_13_429885 97 18 from from IN 10_1101-2021_02_13_429885 97 19 data datum NNS 10_1101-2021_02_13_429885 97 20 is be VBZ 10_1101-2021_02_13_429885 97 21 an an DT 10_1101-2021_02_13_429885 97 22 error error NN 10_1101-2021_02_13_429885 97 23 score score NN 10_1101-2021_02_13_429885 97 24 that that WDT 10_1101-2021_02_13_429885 97 25 approaches approach VBZ 10_1101-2021_02_13_429885 97 26 0 0 CD 10_1101-2021_02_13_429885 97 27 for for IN 10_1101-2021_02_13_429885 97 28 perfect perfect JJ 10_1101-2021_02_13_429885 97 29 calls call NNS 10_1101-2021_02_13_429885 97 30 , , , 10_1101-2021_02_13_429885 97 31 and and CC 10_1101-2021_02_13_429885 97 32 grows grow VBZ 10_1101-2021_02_13_429885 97 33 otherwise otherwise RB 10_1101-2021_02_13_429885 97 34 . . . 10_1101-2021_02_13_429885 98 1 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 98 2 can can MD 10_1101-2021_02_13_429885 98 3 visualise visualise VB 10_1101-2021_02_13_429885 98 4 the the DT 10_1101-2021_02_13_429885 98 5 input input NN 10_1101-2021_02_13_429885 98 6 segments segment NNS 10_1101-2021_02_13_429885 98 7 ( ( -LRB- 10_1101-2021_02_13_429885 98 8 ​Figure ​figure NN 10_1101-2021_02_13_429885 98 9 2a 2a CD 10_1101-2021_02_13_429885 98 10 ​ ​ JJ 10_1101-2021_02_13_429885 98 11 ) ) -RRB- 10_1101-2021_02_13_429885 98 12 and and CC 10_1101-2021_02_13_429885 98 13 read read VBD 10_1101-2021_02_13_429885 98 14 counts count NNS 10_1101-2021_02_13_429885 98 15 ( ( -LRB- 10_1101-2021_02_13_429885 98 16 ​Figure ​figure NN 10_1101-2021_02_13_429885 98 17 2b 2b NNS 10_1101-2021_02_13_429885 98 18 - - HYPH 10_1101-2021_02_13_429885 98 19 d d NN 10_1101-2021_02_13_429885 98 20 ​ ​ NNP 10_1101-2021_02_13_429885 98 21 ) ) -RRB- 10_1101-2021_02_13_429885 98 22 . . . 10_1101-2021_02_13_429885 99 1 Other other JJ 10_1101-2021_02_13_429885 99 2 analysis analysis NN 10_1101-2021_02_13_429885 99 3 such such JJ 10_1101-2021_02_13_429885 99 4 as as IN 10_1101-2021_02_13_429885 99 5 CCFs ccf NNS 10_1101-2021_02_13_429885 99 6 computation computation NN 10_1101-2021_02_13_429885 99 7 and and CC 10_1101-2021_02_13_429885 99 8 genome genome JJ 10_1101-2021_02_13_429885 99 9 fragmentation fragmentation NN 10_1101-2021_02_13_429885 99 10 analysis analysis NN 10_1101-2021_02_13_429885 99 11 are be VBP 10_1101-2021_02_13_429885 99 12 also also RB 10_1101-2021_02_13_429885 99 13 available available JJ 10_1101-2021_02_13_429885 99 14 , , , 10_1101-2021_02_13_429885 99 15 and and CC 10_1101-2021_02_13_429885 99 16 have have VBP 10_1101-2021_02_13_429885 99 17 other other JJ 10_1101-2021_02_13_429885 99 18 visualisations visualisation NNS 10_1101-2021_02_13_429885 99 19 ( ( -LRB- 10_1101-2021_02_13_429885 99 20 ​Figure ​figure NN 10_1101-2021_02_13_429885 99 21 2e​ 2e​ CD 10_1101-2021_02_13_429885 99 22 ) ) -RRB- 10_1101-2021_02_13_429885 99 23 . . . 10_1101-2021_02_13_429885 100 1 The the DT 10_1101-2021_02_13_429885 100 2 scores score NNS 10_1101-2021_02_13_429885 100 3 of of IN 10_1101-2021_02_13_429885 100 4 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 100 5 can can MD 10_1101-2021_02_13_429885 100 6 be be VB 10_1101-2021_02_13_429885 100 7 used use VBN 10_1101-2021_02_13_429885 100 8 to to TO 10_1101-2021_02_13_429885 100 9 determine determine VB 10_1101-2021_02_13_429885 100 10 a a DT 10_1101-2021_02_13_429885 100 11 QC QC NNP 10_1101-2021_02_13_429885 100 12 PASS pas NNS 10_1101-2021_02_13_429885 100 13 or or CC 10_1101-2021_02_13_429885 100 14 FAIL FAIL NNP 10_1101-2021_02_13_429885 100 15 status status NN 10_1101-2021_02_13_429885 100 16 for for IN 10_1101-2021_02_13_429885 100 17 every every DT 10_1101-2021_02_13_429885 100 18 copy copy NN 10_1101-2021_02_13_429885 100 19 state state NN 10_1101-2021_02_13_429885 100 20 within within IN 10_1101-2021_02_13_429885 100 21 a a DT 10_1101-2021_02_13_429885 100 22 tumour tumour NN 10_1101-2021_02_13_429885 100 23 genome genome NN 10_1101-2021_02_13_429885 100 24 , , , 10_1101-2021_02_13_429885 100 25 weighting weight VBG 10_1101-2021_02_13_429885 100 26 different different JJ 10_1101-2021_02_13_429885 100 27 evidence evidence NN 10_1101-2021_02_13_429885 100 28 from from IN 10_1101-2021_02_13_429885 100 29 the the DT 10_1101-2021_02_13_429885 100 30 data datum NNS 10_1101-2021_02_13_429885 100 31 . . . 10_1101-2021_02_13_429885 101 1 One one CD 10_1101-2021_02_13_429885 101 2 score score NN 10_1101-2021_02_13_429885 101 3 is be VBZ 10_1101-2021_02_13_429885 101 4 for for IN 10_1101-2021_02_13_429885 101 5 the the DT 10_1101-2021_02_13_429885 101 6 quality quality NN 10_1101-2021_02_13_429885 101 7 of of IN 10_1101-2021_02_13_429885 101 8 CNA CNA NNP 10_1101-2021_02_13_429885 101 9 segmentation segmentation NN 10_1101-2021_02_13_429885 101 10 and and CC 10_1101-2021_02_13_429885 101 11 tumour tumour NN 10_1101-2021_02_13_429885 101 12 purity purity NN 10_1101-2021_02_13_429885 101 13 , , , 10_1101-2021_02_13_429885 101 14 and and CC 10_1101-2021_02_13_429885 101 15 one one CD 10_1101-2021_02_13_429885 101 16 for for IN 10_1101-2021_02_13_429885 101 17 CCF ccf NN 10_1101-2021_02_13_429885 101 18 values value NNS 10_1101-2021_02_13_429885 101 19 . . . 10_1101-2021_02_13_429885 102 1 The the DT 10_1101-2021_02_13_429885 102 2 former former JJ 10_1101-2021_02_13_429885 102 3 is be VBZ 10_1101-2021_02_13_429885 102 4 based base VBN 10_1101-2021_02_13_429885 102 5 on on IN 10_1101-2021_02_13_429885 102 6 a a DT 10_1101-2021_02_13_429885 102 7 density density NN 10_1101-2021_02_13_429885 102 8 - - HYPH 10_1101-2021_02_13_429885 102 9 based base VBN 10_1101-2021_02_13_429885 102 10 analysis analysis NN 10_1101-2021_02_13_429885 102 11 of of IN 10_1101-2021_02_13_429885 102 12 the the DT 10_1101-2021_02_13_429885 102 13 VAF VAF NNP 10_1101-2021_02_13_429885 102 14 distribution distribution NN 10_1101-2021_02_13_429885 102 15 , , , 10_1101-2021_02_13_429885 102 16 and and CC 10_1101-2021_02_13_429885 102 17 uses use VBZ 10_1101-2021_02_13_429885 102 18 both both CC 10_1101-2021_02_13_429885 102 19 a a DT 10_1101-2021_02_13_429885 102 20 non non JJ 10_1101-2021_02_13_429885 102 21 - - JJ 10_1101-2021_02_13_429885 102 22 parametric parametric JJ 10_1101-2021_02_13_429885 102 23 kernel kernel NN 10_1101-2021_02_13_429885 102 24 density density NN 10_1101-2021_02_13_429885 102 25 and and CC 10_1101-2021_02_13_429885 102 26 a a DT 10_1101-2021_02_13_429885 102 27 univariate univariate JJ 10_1101-2021_02_13_429885 102 28 Binomial Binomial NNP 10_1101-2021_02_13_429885 102 29 mixture mixture NN 10_1101-2021_02_13_429885 102 30 to to TO 10_1101-2021_02_13_429885 102 31 match match VB 10_1101-2021_02_13_429885 102 32 peaks peak NNS 10_1101-2021_02_13_429885 102 33 in in IN 10_1101-2021_02_13_429885 102 34 the the DT 10_1101-2021_02_13_429885 102 35 VAF VAF NNP 10_1101-2021_02_13_429885 102 36 data datum NNS 10_1101-2021_02_13_429885 102 37 ( ( -LRB- 10_1101-2021_02_13_429885 102 38 ​Figure ​Figure NNP 10_1101-2021_02_13_429885 102 39 3a 3a NNP 10_1101-2021_02_13_429885 102 40 - - HYPH 10_1101-2021_02_13_429885 102 41 d d NNP 10_1101-2021_02_13_429885 102 42 ​ ​ NNP 10_1101-2021_02_13_429885 102 43 ) ) -RRB- 10_1101-2021_02_13_429885 102 44 . . . 10_1101-2021_02_13_429885 103 1 The the DT 10_1101-2021_02_13_429885 103 2 latter latter NN 10_1101-2021_02_13_429885 103 3 is be VBZ 10_1101-2021_02_13_429885 103 4 based base VBN 10_1101-2021_02_13_429885 103 5 on on IN 10_1101-2021_02_13_429885 103 6 the the DT 10_1101-2021_02_13_429885 103 7 entropy entropy NN 10_1101-2021_02_13_429885 103 8 of of IN 10_1101-2021_02_13_429885 103 9 the the DT 10_1101-2021_02_13_429885 103 10 latent latent NN 10_1101-2021_02_13_429885 103 11 variables variable NNS 10_1101-2021_02_13_429885 103 12 in in IN 10_1101-2021_02_13_429885 103 13 a a DT 10_1101-2021_02_13_429885 103 14 Binomial Binomial NNP 10_1101-2021_02_13_429885 103 15 mixture mixture NN 10_1101-2021_02_13_429885 103 16 model model NN 10_1101-2021_02_13_429885 103 17 , , , 10_1101-2021_02_13_429885 103 18 whose whose WP$ 10_1101-2021_02_13_429885 103 19 components component NNS 10_1101-2021_02_13_429885 103 20 are be VBP 10_1101-2021_02_13_429885 103 21 peaked peak VBN 10_1101-2021_02_13_429885 103 22 at at IN 10_1101-2021_02_13_429885 103 23 the the DT 10_1101-2021_02_13_429885 103 24 expected expect VBN 10_1101-2021_02_13_429885 103 25 VAF VAF NNP 10_1101-2021_02_13_429885 103 26 . . . 10_1101-2021_02_13_429885 104 1 From from IN 10_1101-2021_02_13_429885 104 2 this this DT 10_1101-2021_02_13_429885 104 3 density density NN 10_1101-2021_02_13_429885 104 4 we -PRON- PRP 10_1101-2021_02_13_429885 104 5 identify identify VBP 10_1101-2021_02_13_429885 104 6 VAF VAF NNP 10_1101-2021_02_13_429885 104 7 ranges range VBZ 10_1101-2021_02_13_429885 104 8 for for IN 10_1101-2021_02_13_429885 104 9 which which WDT 10_1101-2021_02_13_429885 104 10 it -PRON- PRP 10_1101-2021_02_13_429885 104 11 is be VBZ 10_1101-2021_02_13_429885 104 12 hard hard JJ 10_1101-2021_02_13_429885 104 13 to to TO 10_1101-2021_02_13_429885 104 14 estimate estimate VB 10_1101-2021_02_13_429885 104 15 the the DT 10_1101-2021_02_13_429885 104 16 mutation mutation NN 10_1101-2021_02_13_429885 104 17 multiplicity multiplicity NN 10_1101-2021_02_13_429885 104 18 , , , 10_1101-2021_02_13_429885 104 19 and and CC 10_1101-2021_02_13_429885 104 20 therefore therefore RB 10_1101-2021_02_13_429885 104 21 the the DT 10_1101-2021_02_13_429885 104 22 CCF ccf NN 10_1101-2021_02_13_429885 104 23 of of IN 10_1101-2021_02_13_429885 104 24 the the DT 10_1101-2021_02_13_429885 104 25 mutation mutation NN 10_1101-2021_02_13_429885 104 26 ( ( -LRB- 10_1101-2021_02_13_429885 104 27 ​Figure ​Figure NNP 10_1101-2021_02_13_429885 104 28 3e 3e NN 10_1101-2021_02_13_429885 104 29 - - HYPH 10_1101-2021_02_13_429885 104 30 h h NN 10_1101-2021_02_13_429885 104 31 ​ ​ NN 10_1101-2021_02_13_429885 104 32 ) ) -RRB- 10_1101-2021_02_13_429885 104 33 . . . 10_1101-2021_02_13_429885 105 1 To to IN 10_1101-2021_02_13_429885 105 2 the the DT 10_1101-2021_02_13_429885 105 3 best good JJS 10_1101-2021_02_13_429885 105 4 of of IN 10_1101-2021_02_13_429885 105 5 our -PRON- PRP$ 10_1101-2021_02_13_429885 105 6 understanding understanding NN 10_1101-2021_02_13_429885 105 7 , , , 10_1101-2021_02_13_429885 105 8 this this DT 10_1101-2021_02_13_429885 105 9 is be VBZ 10_1101-2021_02_13_429885 105 10 the the DT 10_1101-2021_02_13_429885 105 11 only only JJ 10_1101-2021_02_13_429885 105 12 framework framework NN 10_1101-2021_02_13_429885 105 13 providing provide VBG 10_1101-2021_02_13_429885 105 14 quantitative quantitative JJ 10_1101-2021_02_13_429885 105 15 metrics metric NNS 10_1101-2021_02_13_429885 105 16 for for IN 10_1101-2021_02_13_429885 105 17 all all PDT 10_1101-2021_02_13_429885 105 18 the the DT 10_1101-2021_02_13_429885 105 19 most most RBS 10_1101-2021_02_13_429885 105 20 widespread widespread JJ 10_1101-2021_02_13_429885 105 21 types type NNS 10_1101-2021_02_13_429885 105 22 of of IN 10_1101-2021_02_13_429885 105 23 tumour tumour NN 10_1101-2021_02_13_429885 105 24 mutations mutation NNS 10_1101-2021_02_13_429885 105 25 . . . 10_1101-2021_02_13_429885 106 1 Multi multi JJ 10_1101-2021_02_13_429885 106 2 - - JJ 10_1101-2021_02_13_429885 106 3 region region JJ 10_1101-2021_02_13_429885 106 4 colorectal colorectal JJ 10_1101-2021_02_13_429885 106 5 cancer cancer NN 10_1101-2021_02_13_429885 106 6 data datum NNS 10_1101-2021_02_13_429885 106 7 We -PRON- PRP 10_1101-2021_02_13_429885 106 8 have have VBP 10_1101-2021_02_13_429885 106 9 run run VBN 10_1101-2021_02_13_429885 106 10 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 106 11 on on IN 10_1101-2021_02_13_429885 106 12 previously previously RB 10_1101-2021_02_13_429885 106 13 published publish VBN 10_1101-2021_02_13_429885 106 14 WGS WGS NNP 10_1101-2021_02_13_429885 106 15 multi multi JJ 10_1101-2021_02_13_429885 106 16 - - JJ 10_1101-2021_02_13_429885 106 17 region region JJ 10_1101-2021_02_13_429885 106 18 data datum NNS 10_1101-2021_02_13_429885 106 19 ​(Cross ​(Cross NNP 10_1101-2021_02_13_429885 106 20 et et NNP 10_1101-2021_02_13_429885 106 21 al al NNP 10_1101-2021_02_13_429885 106 22 . . . 10_1101-2021_02_13_429885 107 1 10 10 CD 10_1101-2021_02_13_429885 107 2 2018 2018 CD 10_1101-2021_02_13_429885 107 3 ; ; : 10_1101-2021_02_13_429885 107 4 Caravagna Caravagna NNP 10_1101-2021_02_13_429885 107 5 et et NNP 10_1101-2021_02_13_429885 107 6 al al NNP 10_1101-2021_02_13_429885 107 7 . . . 10_1101-2021_02_13_429885 108 1 2020)​ 2020)​ CD 10_1101-2021_02_13_429885 108 2 , , , 10_1101-2021_02_13_429885 108 3 which which WDT 10_1101-2021_02_13_429885 108 4 was be VBD 10_1101-2021_02_13_429885 108 5 collected collect VBN 10_1101-2021_02_13_429885 108 6 from from IN 10_1101-2021_02_13_429885 108 7 multiple multiple JJ 10_1101-2021_02_13_429885 108 8 regions region NNS 10_1101-2021_02_13_429885 108 9 of of IN 10_1101-2021_02_13_429885 108 10 primary primary JJ 10_1101-2021_02_13_429885 108 11 colorectal colorectal JJ 10_1101-2021_02_13_429885 108 12 adenocarcinomas adenocarcinomas NNP 10_1101-2021_02_13_429885 108 13 across across IN 10_1101-2021_02_13_429885 108 14 two two CD 10_1101-2021_02_13_429885 108 15 distinct distinct JJ 10_1101-2021_02_13_429885 108 16 patients patient NNS 10_1101-2021_02_13_429885 108 17 . . . 10_1101-2021_02_13_429885 109 1 For for IN 10_1101-2021_02_13_429885 109 2 all all PDT 10_1101-2021_02_13_429885 109 3 these these DT 10_1101-2021_02_13_429885 109 4 samples sample NNS 10_1101-2021_02_13_429885 109 5 we -PRON- PRP 10_1101-2021_02_13_429885 109 6 have have VBP 10_1101-2021_02_13_429885 109 7 high high JJ 10_1101-2021_02_13_429885 109 8 quality quality NN 10_1101-2021_02_13_429885 109 9 somatic somatic JJ 10_1101-2021_02_13_429885 109 10 mutation mutation NN 10_1101-2021_02_13_429885 109 11 calls call NNS 10_1101-2021_02_13_429885 109 12 ​(Cross ​(Cross NNP 10_1101-2021_02_13_429885 109 13 et et NNP 10_1101-2021_02_13_429885 109 14 al al NNP 10_1101-2021_02_13_429885 109 15 . . . 10_1101-2021_02_13_429885 110 1 10 10 CD 10_1101-2021_02_13_429885 110 2 2018 2018 CD 10_1101-2021_02_13_429885 110 3 ) ) -RRB- 10_1101-2021_02_13_429885 110 4 that that WDT 10_1101-2021_02_13_429885 110 5 were be VBD 10_1101-2021_02_13_429885 110 6 obtained obtain VBN 10_1101-2021_02_13_429885 110 7 using use VBG 10_1101-2021_02_13_429885 110 8 CloneHD CloneHD NNP 10_1101-2021_02_13_429885 110 9 ​(Fischer ​(Fischer NNP 10_1101-2021_02_13_429885 110 10 et et NNP 10_1101-2021_02_13_429885 110 11 al al NNP 10_1101-2021_02_13_429885 110 12 . . . 10_1101-2021_02_13_429885 111 1 2014)​. 2014)​. LS 10_1101-2021_02_13_429885 112 1 We -PRON- PRP 10_1101-2021_02_13_429885 112 2 have have VBP 10_1101-2021_02_13_429885 112 3 re re VBN 10_1101-2021_02_13_429885 112 4 - - VBN 10_1101-2021_02_13_429885 112 5 called call VBN 10_1101-2021_02_13_429885 112 6 CNAs cna NNS 10_1101-2021_02_13_429885 112 7 with with IN 10_1101-2021_02_13_429885 112 8 the the DT 10_1101-2021_02_13_429885 112 9 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 112 10 CNA CNA NNP 10_1101-2021_02_13_429885 112 11 caller caller NN 10_1101-2021_02_13_429885 112 12 ( ( -LRB- 10_1101-2021_02_13_429885 112 13 Favero Favero NNP 10_1101-2021_02_13_429885 112 14 et et NNP 10_1101-2021_02_13_429885 112 15 al al NNP 10_1101-2021_02_13_429885 112 16 . . . 10_1101-2021_02_13_429885 113 1 2015)​ 2015)​ CD 10_1101-2021_02_13_429885 113 2 , , , 10_1101-2021_02_13_429885 113 3 and and CC 10_1101-2021_02_13_429885 113 4 sought seek VBD 10_1101-2021_02_13_429885 113 5 out out RP 10_1101-2021_02_13_429885 113 6 to to TO 10_1101-2021_02_13_429885 113 7 check check VB 10_1101-2021_02_13_429885 113 8 the the DT 10_1101-2021_02_13_429885 113 9 inferred inferred JJ 10_1101-2021_02_13_429885 113 10 copy copy NN 10_1101-2021_02_13_429885 113 11 states state NNS 10_1101-2021_02_13_429885 113 12 and and CC 10_1101-2021_02_13_429885 113 13 tumour tumour NN 10_1101-2021_02_13_429885 113 14 purity purity NN 10_1101-2021_02_13_429885 113 15 with with IN 10_1101-2021_02_13_429885 113 16 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 113 17 , , , 10_1101-2021_02_13_429885 113 18 along along IN 10_1101-2021_02_13_429885 113 19 with with IN 10_1101-2021_02_13_429885 113 20 SNVs snv NNS 10_1101-2021_02_13_429885 113 21 generated generate VBN 10_1101-2021_02_13_429885 113 22 by by IN 10_1101-2021_02_13_429885 113 23 Mutect2 Mutect2 NNP 10_1101-2021_02_13_429885 113 24 ​(Benjamin ​(Benjamin NNP 10_1101-2021_02_13_429885 113 25 et et NNP 10_1101-2021_02_13_429885 113 26 al al NNP 10_1101-2021_02_13_429885 113 27 . . . 10_1101-2021_02_13_429885 114 1 2019)​. 2019)​. CD 10_1101-2021_02_13_429885 115 1 .CC .CC NFP 10_1101-2021_02_13_429885 115 2 - - : 10_1101-2021_02_13_429885 115 3 BY by IN 10_1101-2021_02_13_429885 115 4 - - HYPH 10_1101-2021_02_13_429885 115 5 NC NC NNP 10_1101-2021_02_13_429885 115 6 - - HYPH 10_1101-2021_02_13_429885 115 7 ND ND NNP 10_1101-2021_02_13_429885 115 8 4.0 4.0 CD 10_1101-2021_02_13_429885 115 9 International International NNP 10_1101-2021_02_13_429885 115 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 115 11 under under IN 10_1101-2021_02_13_429885 115 12 a a DT 10_1101-2021_02_13_429885 115 13 ( ( -LRB- 10_1101-2021_02_13_429885 115 14 which which WDT 10_1101-2021_02_13_429885 115 15 was be VBD 10_1101-2021_02_13_429885 115 16 not not RB 10_1101-2021_02_13_429885 115 17 certified certify VBN 10_1101-2021_02_13_429885 115 18 by by IN 10_1101-2021_02_13_429885 115 19 peer peer NN 10_1101-2021_02_13_429885 115 20 review review NN 10_1101-2021_02_13_429885 115 21 ) ) -RRB- 10_1101-2021_02_13_429885 115 22 is be VBZ 10_1101-2021_02_13_429885 115 23 the the DT 10_1101-2021_02_13_429885 115 24 author author NN 10_1101-2021_02_13_429885 115 25 / / SYM 10_1101-2021_02_13_429885 115 26 funder funder NN 10_1101-2021_02_13_429885 115 27 , , , 10_1101-2021_02_13_429885 115 28 who who WP 10_1101-2021_02_13_429885 115 29 has have VBZ 10_1101-2021_02_13_429885 115 30 granted grant VBN 10_1101-2021_02_13_429885 115 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 115 32 a a DT 10_1101-2021_02_13_429885 115 33 license license NN 10_1101-2021_02_13_429885 115 34 to to TO 10_1101-2021_02_13_429885 115 35 display display VB 10_1101-2021_02_13_429885 115 36 the the DT 10_1101-2021_02_13_429885 115 37 preprint preprint NN 10_1101-2021_02_13_429885 115 38 in in IN 10_1101-2021_02_13_429885 115 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 115 40 . . . 10_1101-2021_02_13_429885 116 1 It -PRON- PRP 10_1101-2021_02_13_429885 116 2 is be VBZ 10_1101-2021_02_13_429885 116 3 made make VBN 10_1101-2021_02_13_429885 116 4 The the DT 10_1101-2021_02_13_429885 116 5 copyright copyright NN 10_1101-2021_02_13_429885 116 6 holder holder NN 10_1101-2021_02_13_429885 116 7 for for IN 10_1101-2021_02_13_429885 116 8 this this DT 10_1101-2021_02_13_429885 116 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 116 10 version version NN 10_1101-2021_02_13_429885 116 11 posted post VBD 10_1101-2021_02_13_429885 116 12 February February NNP 10_1101-2021_02_13_429885 116 13 13 13 CD 10_1101-2021_02_13_429885 116 14 , , , 10_1101-2021_02_13_429885 116 15 2021 2021 CD 10_1101-2021_02_13_429885 116 16 . . . 10_1101-2021_02_13_429885 116 17 ; ; : 10_1101-2021_02_13_429885 116 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 116 19 : : : 10_1101-2021_02_13_429885 116 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 116 21 preprint preprint NN 10_1101-2021_02_13_429885 116 22 https://www.codecogs.com/eqnedit.php?latex=c%20%3D%20%5Cdfrac%7Bv%5B(p-2)%5Cpi%20%2B%202%5D%7D%7Bm%5Cpi%7D%20%5C%2C%20.#0 https://www.codecogs.com/eqnedit.php?latex=c%20%3d%20%5cdfrac%7bv%5b(p-2)%5cpi%20%2b%202%5d%7d%7bm%5cpi%7d%20%5c%2c%20.#0 NN 10_1101-2021_02_13_429885 116 23 https://www.zotero.org/google-docs/?YaP3DC https://www.zotero.org/google-docs/?yap3dc ADD 10_1101-2021_02_13_429885 116 24 https://paperpile.com/c/rqVmzs/IC0y+chqB https://paperpile.com/c/rqvmzs/ic0y+chqb ADD 10_1101-2021_02_13_429885 116 25 https://paperpile.com/c/rqVmzs/IC0y+chqB https://paperpile.com/c/rqvmzs/ic0y+chqb NN 10_1101-2021_02_13_429885 116 26 https://paperpile.com/c/rqVmzs/IC0y https://paperpile.com/c/rqvmzs/ic0y UH 10_1101-2021_02_13_429885 116 27 https://paperpile.com/c/rqVmzs/A7Vg https://paperpile.com/c/rqVmzs/A7Vg NNP 10_1101-2021_02_13_429885 116 28 https://paperpile.com/c/rqVmzs/tCb6 https://paperpile.com/c/rqvmzs/tcb6 UH 10_1101-2021_02_13_429885 116 29 https://paperpile.com/c/rqVmzs/bD5o https://paperpile.com/c/rqVmzs/bD5o NNP 10_1101-2021_02_13_429885 116 30 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 VBP 10_1101-2021_02_13_429885 116 31 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 116 32 Househam Househam NNP 10_1101-2021_02_13_429885 116 33 et et FW 10_1101-2021_02_13_429885 116 34 al al NNP 10_1101-2021_02_13_429885 116 35 . . . 10_1101-2021_02_13_429885 117 1 A a DT 10_1101-2021_02_13_429885 117 2 fully fully RB 10_1101-2021_02_13_429885 117 3 automated automate VBN 10_1101-2021_02_13_429885 117 4 approach approach NN 10_1101-2021_02_13_429885 117 5 for for IN 10_1101-2021_02_13_429885 117 6 quality quality NN 10_1101-2021_02_13_429885 117 7 control control NN 10_1101-2021_02_13_429885 117 8 of of IN 10_1101-2021_02_13_429885 117 9 cancer cancer NN 10_1101-2021_02_13_429885 117 10 mutations mutation NNS 10_1101-2021_02_13_429885 117 11 in in IN 10_1101-2021_02_13_429885 117 12 the the DT 10_1101-2021_02_13_429885 117 13 era era NN 10_1101-2021_02_13_429885 117 14 of of IN 10_1101-2021_02_13_429885 117 15 high high JJ 10_1101-2021_02_13_429885 117 16 - - HYPH 10_1101-2021_02_13_429885 117 17 resolution resolution NN 10_1101-2021_02_13_429885 117 18 whole whole JJ 10_1101-2021_02_13_429885 117 19 genome genome JJ 10_1101-2021_02_13_429885 117 20 sequencing sequencing NN 10_1101-2021_02_13_429885 117 21 . . . 10_1101-2021_02_13_429885 118 1 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 118 2 was be VBD 10_1101-2021_02_13_429885 118 3 run run VBN 10_1101-2021_02_13_429885 118 4 using use VBG 10_1101-2021_02_13_429885 118 5 distinct distinct JJ 10_1101-2021_02_13_429885 118 6 parameterizations parameterization NNS 10_1101-2021_02_13_429885 118 7 . . . 10_1101-2021_02_13_429885 119 1 We -PRON- PRP 10_1101-2021_02_13_429885 119 2 begun begin VBD 10_1101-2021_02_13_429885 119 3 with with IN 10_1101-2021_02_13_429885 119 4 the the DT 10_1101-2021_02_13_429885 119 5 default default NN 10_1101-2021_02_13_429885 119 6 range range NN 10_1101-2021_02_13_429885 119 7 proposals proposal NNS 10_1101-2021_02_13_429885 119 8 for for IN 10_1101-2021_02_13_429885 119 9 purity purity NN 10_1101-2021_02_13_429885 119 10 and and CC 10_1101-2021_02_13_429885 119 11 ploidy ploidy NN 10_1101-2021_02_13_429885 119 12 , , , 10_1101-2021_02_13_429885 119 13 which which WDT 10_1101-2021_02_13_429885 119 14 we -PRON- PRP 10_1101-2021_02_13_429885 119 15 then then RB 10_1101-2021_02_13_429885 119 16 improved improve VBD 10_1101-2021_02_13_429885 119 17 in in IN 10_1101-2021_02_13_429885 119 18 a a DT 10_1101-2021_02_13_429885 119 19 final final JJ 10_1101-2021_02_13_429885 119 20 run run NN 10_1101-2021_02_13_429885 119 21 following follow VBG 10_1101-2021_02_13_429885 119 22 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 119 23 1 1 CD 10_1101-2021_02_13_429885 119 24 analysis analysis NN 10_1101-2021_02_13_429885 119 25 . . . 10_1101-2021_02_13_429885 120 1 We -PRON- PRP 10_1101-2021_02_13_429885 120 2 also also RB 10_1101-2021_02_13_429885 120 3 forced force VBD 10_1101-2021_02_13_429885 120 4 a a DT 10_1101-2021_02_13_429885 120 5 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 120 6 fit fit NN 10_1101-2021_02_13_429885 120 7 with with IN 10_1101-2021_02_13_429885 120 8 constrained constrained JJ 10_1101-2021_02_13_429885 120 9 tetraploid tetraploid NN 10_1101-2021_02_13_429885 120 10 genome genome JJ 10_1101-2021_02_13_429885 120 11 ( ( -LRB- 10_1101-2021_02_13_429885 120 12 ploidy ploidy NN 10_1101-2021_02_13_429885 120 13 equal equal NNP 10_1101-2021_02_13_429885 120 14 4 4 CD 10_1101-2021_02_13_429885 120 15 ) ) -RRB- 10_1101-2021_02_13_429885 120 16 , , , 10_1101-2021_02_13_429885 120 17 and and CC 10_1101-2021_02_13_429885 120 18 one one CD 10_1101-2021_02_13_429885 120 19 with with IN 10_1101-2021_02_13_429885 120 20 low low JJ 10_1101-2021_02_13_429885 120 21 purity purity NN 10_1101-2021_02_13_429885 120 22 . . . 10_1101-2021_02_13_429885 121 1 All all PDT 10_1101-2021_02_13_429885 121 2 these these DT 10_1101-2021_02_13_429885 121 3 steps step NNS 10_1101-2021_02_13_429885 121 4 could could MD 10_1101-2021_02_13_429885 121 5 have have VB 10_1101-2021_02_13_429885 121 6 been be VBN 10_1101-2021_02_13_429885 121 7 easily easily RB 10_1101-2021_02_13_429885 121 8 automatised automatise VBN 10_1101-2021_02_13_429885 121 9 in in IN 10_1101-2021_02_13_429885 121 10 a a DT 10_1101-2021_02_13_429885 121 11 procedure procedure NN 10_1101-2021_02_13_429885 121 12 that that WDT 10_1101-2021_02_13_429885 121 13 runs run VBZ 10_1101-2021_02_13_429885 121 14 the the DT 10_1101-2021_02_13_429885 121 15 caller caller NN 10_1101-2021_02_13_429885 121 16 , , , 10_1101-2021_02_13_429885 121 17 obtains obtain NNS 10_1101-2021_02_13_429885 121 18 score score VBP 10_1101-2021_02_13_429885 121 19 metrics metric NNS 10_1101-2021_02_13_429885 121 20 for for IN 10_1101-2021_02_13_429885 121 21 the the DT 10_1101-2021_02_13_429885 121 22 solution solution NN 10_1101-2021_02_13_429885 121 23 from from IN 10_1101-2021_02_13_429885 121 24 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 121 25 , , , 10_1101-2021_02_13_429885 121 26 and and CC 10_1101-2021_02_13_429885 121 27 re re VB 10_1101-2021_02_13_429885 121 28 - - VB 10_1101-2021_02_13_429885 121 29 run run VB 10_1101-2021_02_13_429885 121 30 the the DT 10_1101-2021_02_13_429885 121 31 fits fit NNS 10_1101-2021_02_13_429885 121 32 with with IN 10_1101-2021_02_13_429885 121 33 adjusted adjusted JJ 10_1101-2021_02_13_429885 121 34 parameters parameter NNS 10_1101-2021_02_13_429885 121 35 if if IN 10_1101-2021_02_13_429885 121 36 required require VBN 10_1101-2021_02_13_429885 121 37 . . . 10_1101-2021_02_13_429885 122 1 The the DT 10_1101-2021_02_13_429885 122 2 results result NNS 10_1101-2021_02_13_429885 122 3 for for IN 10_1101-2021_02_13_429885 122 4 one one CD 10_1101-2021_02_13_429885 122 5 sample sample NN 10_1101-2021_02_13_429885 122 6 of of IN 10_1101-2021_02_13_429885 122 7 patient patient JJ 10_1101-2021_02_13_429885 122 8 Set7 Set7 NNP 10_1101-2021_02_13_429885 122 9 - - HYPH 10_1101-2021_02_13_429885 122 10 Cancer Cancer NNP 10_1101-2021_02_13_429885 122 11 7 7 CD 10_1101-2021_02_13_429885 122 12 in in IN 10_1101-2021_02_13_429885 122 13 the the DT 10_1101-2021_02_13_429885 122 14 original original JJ 10_1101-2021_02_13_429885 122 15 manuscript manuscript NN 10_1101-2021_02_13_429885 122 16 ​(Cross ​(Cross NNP 10_1101-2021_02_13_429885 122 17 et et NNP 10_1101-2021_02_13_429885 122 18 al al NNP 10_1101-2021_02_13_429885 122 19 . . . 10_1101-2021_02_13_429885 123 1 10 10 CD 10_1101-2021_02_13_429885 123 2 2018 2018 CD 10_1101-2021_02_13_429885 123 3 ) ) -RRB- 10_1101-2021_02_13_429885 123 4 - - : 10_1101-2021_02_13_429885 123 5 are be VBP 10_1101-2021_02_13_429885 123 6 in in IN 10_1101-2021_02_13_429885 123 7 ​Figure ​figure NN 10_1101-2021_02_13_429885 123 8 4 4 CD 10_1101-2021_02_13_429885 123 9 ​ ​ JJ 10_1101-2021_02_13_429885 123 10 ; ; : 10_1101-2021_02_13_429885 123 11 the the DT 10_1101-2021_02_13_429885 123 12 other other JJ 10_1101-2021_02_13_429885 123 13 samples sample NNS 10_1101-2021_02_13_429885 123 14 for for IN 10_1101-2021_02_13_429885 123 15 patient patient NN 10_1101-2021_02_13_429885 123 16 Set7 Set7 NNP 10_1101-2021_02_13_429885 123 17 are be VBP 10_1101-2021_02_13_429885 123 18 in in IN 10_1101-2021_02_13_429885 123 19 ​Supplementary ​Supplementary NNP 10_1101-2021_02_13_429885 123 20 Figures Figures NNPS 10_1101-2021_02_13_429885 123 21 S2-S4 S2-S4 NNP 10_1101-2021_02_13_429885 123 22 ​. ​. NN 10_1101-2021_02_13_429885 124 1 All all DT 10_1101-2021_02_13_429885 124 2 samples sample NNS 10_1101-2021_02_13_429885 124 3 for for IN 10_1101-2021_02_13_429885 124 4 patient patient NN 10_1101-2021_02_13_429885 124 5 Set6 Set6 NNP 10_1101-2021_02_13_429885 124 6 are be VBP 10_1101-2021_02_13_429885 124 7 in in IN 10_1101-2021_02_13_429885 124 8 ​Supplementary ​supplementary JJ 10_1101-2021_02_13_429885 124 9 Figures figure NNS 10_1101-2021_02_13_429885 124 10 S5-S10 S5-S10 NNP 10_1101-2021_02_13_429885 124 11 . . . 10_1101-2021_02_13_429885 125 1 The the DT 10_1101-2021_02_13_429885 125 2 peak peak NN 10_1101-2021_02_13_429885 125 3 detection detection NN 10_1101-2021_02_13_429885 125 4 scores score NNS 10_1101-2021_02_13_429885 125 5 produced produce VBN 10_1101-2021_02_13_429885 125 6 by by IN 10_1101-2021_02_13_429885 125 7 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 125 8 invariably invariably RB 10_1101-2021_02_13_429885 125 9 fail fail VBP 10_1101-2021_02_13_429885 125 10 both both CC 10_1101-2021_02_13_429885 125 11 the the DT 10_1101-2021_02_13_429885 125 12 tetraploid tetraploid NN 10_1101-2021_02_13_429885 125 13 and and CC 10_1101-2021_02_13_429885 125 14 low low JJ 10_1101-2021_02_13_429885 125 15 - - HYPH 10_1101-2021_02_13_429885 125 16 purity purity NN 10_1101-2021_02_13_429885 125 17 solutions solution NNS 10_1101-2021_02_13_429885 125 18 , , , 10_1101-2021_02_13_429885 125 19 passing pass VBG 10_1101-2021_02_13_429885 125 20 the the DT 10_1101-2021_02_13_429885 125 21 others other NNS 10_1101-2021_02_13_429885 125 22 ; ; : 10_1101-2021_02_13_429885 125 23 the the DT 10_1101-2021_02_13_429885 125 24 little little JJ 10_1101-2021_02_13_429885 125 25 adjustment adjustment NN 10_1101-2021_02_13_429885 125 26 suggested suggest VBD 10_1101-2021_02_13_429885 125 27 to to IN 10_1101-2021_02_13_429885 125 28 the the DT 10_1101-2021_02_13_429885 125 29 default default NN 10_1101-2021_02_13_429885 125 30 parameters parameter NNS 10_1101-2021_02_13_429885 125 31 slightly slightly RB 10_1101-2021_02_13_429885 125 32 improves improve VBZ 10_1101-2021_02_13_429885 125 33 the the DT 10_1101-2021_02_13_429885 125 34 purity purity NN 10_1101-2021_02_13_429885 125 35 , , , 10_1101-2021_02_13_429885 125 36 but but CC 10_1101-2021_02_13_429885 125 37 the the DT 10_1101-2021_02_13_429885 125 38 overall overall JJ 10_1101-2021_02_13_429885 125 39 quality quality NN 10_1101-2021_02_13_429885 125 40 is be VBZ 10_1101-2021_02_13_429885 125 41 high high JJ 10_1101-2021_02_13_429885 125 42 even even RB 10_1101-2021_02_13_429885 125 43 with with IN 10_1101-2021_02_13_429885 125 44 just just RB 10_1101-2021_02_13_429885 125 45 default default NN 10_1101-2021_02_13_429885 125 46 parameters parameter NNS 10_1101-2021_02_13_429885 125 47 ( ( -LRB- 10_1101-2021_02_13_429885 125 48 ​Figure ​figure NN 10_1101-2021_02_13_429885 125 49 4b 4b NNS 10_1101-2021_02_13_429885 125 50 ​ ​ JJ 10_1101-2021_02_13_429885 125 51 ) ) -RRB- 10_1101-2021_02_13_429885 125 52 . . . 10_1101-2021_02_13_429885 126 1 The the DT 10_1101-2021_02_13_429885 126 2 whole whole JJ 10_1101-2021_02_13_429885 126 3 - - HYPH 10_1101-2021_02_13_429885 126 4 genome genome JJ 10_1101-2021_02_13_429885 126 5 CNA cna NN 10_1101-2021_02_13_429885 126 6 profile profile NN 10_1101-2021_02_13_429885 126 7 for for IN 10_1101-2021_02_13_429885 126 8 this this DT 10_1101-2021_02_13_429885 126 9 sample sample NN 10_1101-2021_02_13_429885 126 10 shows show VBZ 10_1101-2021_02_13_429885 126 11 some some DT 10_1101-2021_02_13_429885 126 12 degree degree NN 10_1101-2021_02_13_429885 126 13 of of IN 10_1101-2021_02_13_429885 126 14 aneuploidy aneuploidy NNP 10_1101-2021_02_13_429885 126 15 ( ( -LRB- 10_1101-2021_02_13_429885 126 16 ​Figure ​figure NN 10_1101-2021_02_13_429885 126 17 4c 4c CD 10_1101-2021_02_13_429885 126 18 ​ ​ JJ 10_1101-2021_02_13_429885 126 19 ) ) -RRB- 10_1101-2021_02_13_429885 126 20 , , , 10_1101-2021_02_13_429885 126 21 and and CC 10_1101-2021_02_13_429885 126 22 it -PRON- PRP 10_1101-2021_02_13_429885 126 23 is be VBZ 10_1101-2021_02_13_429885 126 24 easy easy JJ 10_1101-2021_02_13_429885 126 25 with with IN 10_1101-2021_02_13_429885 126 26 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 126 27 to to TO 10_1101-2021_02_13_429885 126 28 assess assess VB 10_1101-2021_02_13_429885 126 29 miscalled miscall VBN 10_1101-2021_02_13_429885 126 30 CNA CNA NNP 10_1101-2021_02_13_429885 126 31 segments segment NNS 10_1101-2021_02_13_429885 126 32 ahead ahead RB 10_1101-2021_02_13_429885 126 33 of of IN 10_1101-2021_02_13_429885 126 34 the the DT 10_1101-2021_02_13_429885 126 35 VAF VAF NNP 10_1101-2021_02_13_429885 126 36 data datum NNS 10_1101-2021_02_13_429885 126 37 ( ( -LRB- 10_1101-2021_02_13_429885 126 38 ​Figure ​figure NN 10_1101-2021_02_13_429885 126 39 4d 4d NNS 10_1101-2021_02_13_429885 126 40 ​ ​ JJ 10_1101-2021_02_13_429885 126 41 ) ) -RRB- 10_1101-2021_02_13_429885 126 42 . . . 10_1101-2021_02_13_429885 127 1 The the DT 10_1101-2021_02_13_429885 127 2 analysis analysis NN 10_1101-2021_02_13_429885 127 3 of of IN 10_1101-2021_02_13_429885 127 4 all all PDT 10_1101-2021_02_13_429885 127 5 the the DT 10_1101-2021_02_13_429885 127 6 samples sample NNS 10_1101-2021_02_13_429885 127 7 available available JJ 10_1101-2021_02_13_429885 127 8 for for IN 10_1101-2021_02_13_429885 127 9 Set7 Set7 NNP 10_1101-2021_02_13_429885 127 10 shows show VBZ 10_1101-2021_02_13_429885 127 11 an an DT 10_1101-2021_02_13_429885 127 12 overall overall JJ 10_1101-2021_02_13_429885 127 13 CNA cna NN 10_1101-2021_02_13_429885 127 14 profile profile NN 10_1101-2021_02_13_429885 127 15 with with IN 10_1101-2021_02_13_429885 127 16 many many JJ 10_1101-2021_02_13_429885 127 17 diploid diploid JJ 10_1101-2021_02_13_429885 127 18 regions region NNS 10_1101-2021_02_13_429885 127 19 and and CC 10_1101-2021_02_13_429885 127 20 mild mild JJ 10_1101-2021_02_13_429885 127 21 aneuploidy aneuploidy NN 10_1101-2021_02_13_429885 127 22 ( ( -LRB- 10_1101-2021_02_13_429885 127 23 ​Figure ​figure NN 10_1101-2021_02_13_429885 127 24 4e 4e NN 10_1101-2021_02_13_429885 127 25 ​ ​ JJ 10_1101-2021_02_13_429885 127 26 ) ) -RRB- 10_1101-2021_02_13_429885 127 27 , , , 10_1101-2021_02_13_429885 127 28 consistent consistent JJ 10_1101-2021_02_13_429885 127 29 with with IN 10_1101-2021_02_13_429885 127 30 a a DT 10_1101-2021_02_13_429885 127 31 microsatellite microsatellite JJ 10_1101-2021_02_13_429885 127 32 stable stable JJ 10_1101-2021_02_13_429885 127 33 colorectal colorectal JJ 10_1101-2021_02_13_429885 127 34 cancer cancer NN 10_1101-2021_02_13_429885 127 35 ​(Cross ​(Cross NNP 10_1101-2021_02_13_429885 127 36 et et NNP 10_1101-2021_02_13_429885 127 37 al al NNP 10_1101-2021_02_13_429885 127 38 . . . 10_1101-2021_02_13_429885 128 1 10 10 CD 10_1101-2021_02_13_429885 128 2 2018)​. 2018)​. CD 10_1101-2021_02_13_429885 129 1 Large large JJ 10_1101-2021_02_13_429885 129 2 - - HYPH 10_1101-2021_02_13_429885 129 3 scale scale NN 10_1101-2021_02_13_429885 129 4 pan pan NN 10_1101-2021_02_13_429885 129 5 cancer cancer NN 10_1101-2021_02_13_429885 129 6 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 129 7 calls call VBZ 10_1101-2021_02_13_429885 129 8 We -PRON- PRP 10_1101-2021_02_13_429885 129 9 have have VBP 10_1101-2021_02_13_429885 129 10 run run VBN 10_1101-2021_02_13_429885 129 11 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 129 12 on on IN 10_1101-2021_02_13_429885 129 13 a a DT 10_1101-2021_02_13_429885 129 14 subset subset NN 10_1101-2021_02_13_429885 129 15 of of IN 10_1101-2021_02_13_429885 129 16 the the DT 10_1101-2021_02_13_429885 129 17 full full JJ 10_1101-2021_02_13_429885 129 18 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 129 19 cohort cohort NN 10_1101-2021_02_13_429885 129 20 , , , 10_1101-2021_02_13_429885 129 21 which which WDT 10_1101-2021_02_13_429885 129 22 contains contain VBZ 10_1101-2021_02_13_429885 129 23 thousands thousand NNS 10_1101-2021_02_13_429885 129 24 of of IN 10_1101-2021_02_13_429885 129 25 samples sample NNS 10_1101-2021_02_13_429885 129 26 from from IN 10_1101-2021_02_13_429885 129 27 multiple multiple JJ 10_1101-2021_02_13_429885 129 28 tumour tumour NN 10_1101-2021_02_13_429885 129 29 types type NNS 10_1101-2021_02_13_429885 129 30 ​(Campbell ​(campbell VBP 10_1101-2021_02_13_429885 129 31 et et FW 10_1101-2021_02_13_429885 129 32 al al NNP 10_1101-2021_02_13_429885 129 33 . . . 10_1101-2021_02_13_429885 130 1 2020)​. 2020)​. CD 10_1101-2021_02_13_429885 131 1 The the DT 10_1101-2021_02_13_429885 131 2 median median JJ 10_1101-2021_02_13_429885 131 3 coverage coverage NN 10_1101-2021_02_13_429885 131 4 of of IN 10_1101-2021_02_13_429885 131 5 this this DT 10_1101-2021_02_13_429885 131 6 cohort cohort NN 10_1101-2021_02_13_429885 131 7 is be VBZ 10_1101-2021_02_13_429885 131 8 45x 45x NNS 10_1101-2021_02_13_429885 131 9 , , , 10_1101-2021_02_13_429885 131 10 with with IN 10_1101-2021_02_13_429885 131 11 purity purity NN 10_1101-2021_02_13_429885 131 12 ~65 ~65 NFP 10_1101-2021_02_13_429885 131 13 % % NN 10_1101-2021_02_13_429885 131 14 ​(Caravagna ​(Caravagna NNP 10_1101-2021_02_13_429885 131 15 et et NNP 10_1101-2021_02_13_429885 131 16 al al NNP 10_1101-2021_02_13_429885 131 17 . . . 10_1101-2021_02_13_429885 132 1 2020)​ 2020)​ CD 10_1101-2021_02_13_429885 132 2 ; ; : 10_1101-2021_02_13_429885 132 3 a a DT 10_1101-2021_02_13_429885 132 4 much much RB 10_1101-2021_02_13_429885 132 5 lower low JJR 10_1101-2021_02_13_429885 132 6 resolution resolution NN 10_1101-2021_02_13_429885 132 7 than than IN 10_1101-2021_02_13_429885 132 8 the the DT 10_1101-2021_02_13_429885 132 9 data datum NNS 10_1101-2021_02_13_429885 132 10 available available JJ 10_1101-2021_02_13_429885 132 11 for for IN 10_1101-2021_02_13_429885 132 12 the the DT 10_1101-2021_02_13_429885 132 13 multi multi JJ 10_1101-2021_02_13_429885 132 14 - - JJ 10_1101-2021_02_13_429885 132 15 region region JJ 10_1101-2021_02_13_429885 132 16 samples sample NNS 10_1101-2021_02_13_429885 132 17 discussed discuss VBN 10_1101-2021_02_13_429885 132 18 in in IN 10_1101-2021_02_13_429885 132 19 the the DT 10_1101-2021_02_13_429885 132 20 previous previous JJ 10_1101-2021_02_13_429885 132 21 section section NN 10_1101-2021_02_13_429885 132 22 . . . 10_1101-2021_02_13_429885 133 1 Because because IN 10_1101-2021_02_13_429885 133 2 of of IN 10_1101-2021_02_13_429885 133 3 this this DT 10_1101-2021_02_13_429885 133 4 , , , 10_1101-2021_02_13_429885 133 5 peak peak VB 10_1101-2021_02_13_429885 133 6 detection detection NN 10_1101-2021_02_13_429885 133 7 from from IN 10_1101-2021_02_13_429885 133 8 the the DT 10_1101-2021_02_13_429885 133 9 VAF VAF NNP 10_1101-2021_02_13_429885 133 10 distribution distribution NN 10_1101-2021_02_13_429885 133 11 across across IN 10_1101-2021_02_13_429885 133 12 some some DT 10_1101-2021_02_13_429885 133 13 of of IN 10_1101-2021_02_13_429885 133 14 the the DT 10_1101-2021_02_13_429885 133 15 samples sample NNS 10_1101-2021_02_13_429885 133 16 would would MD 10_1101-2021_02_13_429885 133 17 be be VB 10_1101-2021_02_13_429885 133 18 challenged challenge VBN 10_1101-2021_02_13_429885 133 19 by by IN 10_1101-2021_02_13_429885 133 20 signal signal JJ 10_1101-2021_02_13_429885 133 21 quality quality NN 10_1101-2021_02_13_429885 133 22 ; ; : 10_1101-2021_02_13_429885 133 23 in in IN 10_1101-2021_02_13_429885 133 24 practice practice NN 10_1101-2021_02_13_429885 133 25 , , , 10_1101-2021_02_13_429885 133 26 for for IN 10_1101-2021_02_13_429885 133 27 genomes genome NNS 10_1101-2021_02_13_429885 133 28 with with IN 10_1101-2021_02_13_429885 133 29 complex complex JJ 10_1101-2021_02_13_429885 133 30 aneuploidy aneuploidy NN 10_1101-2021_02_13_429885 133 31 and and CC 10_1101-2021_02_13_429885 133 32 massive massive JJ 10_1101-2021_02_13_429885 133 33 drops drop NNS 10_1101-2021_02_13_429885 133 34 in in IN 10_1101-2021_02_13_429885 133 35 purity purity NN 10_1101-2021_02_13_429885 133 36 and and CC 10_1101-2021_02_13_429885 133 37 coverage coverage VB 10_1101-2021_02_13_429885 133 38 the the DT 10_1101-2021_02_13_429885 133 39 VAF VAF NNP 10_1101-2021_02_13_429885 133 40 distribution distribution NN 10_1101-2021_02_13_429885 133 41 is be VBZ 10_1101-2021_02_13_429885 133 42 unsuitable unsuitable JJ 10_1101-2021_02_13_429885 133 43 for for IN 10_1101-2021_02_13_429885 133 44 peak peak NN 10_1101-2021_02_13_429885 133 45 - - HYPH 10_1101-2021_02_13_429885 133 46 detection detection NN 10_1101-2021_02_13_429885 133 47 , , , 10_1101-2021_02_13_429885 133 48 leading lead VBG 10_1101-2021_02_13_429885 133 49 to to IN 10_1101-2021_02_13_429885 133 50 false false JJ 10_1101-2021_02_13_429885 133 51 - - HYPH 10_1101-2021_02_13_429885 133 52 positives positive NNS 10_1101-2021_02_13_429885 133 53 in in IN 10_1101-2021_02_13_429885 133 54 the the DT 10_1101-2021_02_13_429885 133 55 QC QC NNP 10_1101-2021_02_13_429885 133 56 process process NN 10_1101-2021_02_13_429885 133 57 . . . 10_1101-2021_02_13_429885 134 1 To to TO 10_1101-2021_02_13_429885 134 2 avoid avoid VB 10_1101-2021_02_13_429885 134 3 this this DT 10_1101-2021_02_13_429885 134 4 and and CC 10_1101-2021_02_13_429885 134 5 work work VB 10_1101-2021_02_13_429885 134 6 with with IN 10_1101-2021_02_13_429885 134 7 suitable suitable JJ 10_1101-2021_02_13_429885 134 8 samples sample NNS 10_1101-2021_02_13_429885 134 9 , , , 10_1101-2021_02_13_429885 134 10 we -PRON- PRP 10_1101-2021_02_13_429885 134 11 identified identify VBD 10_1101-2021_02_13_429885 134 12 cases case NNS 10_1101-2021_02_13_429885 134 13 adopting adopt VBG 10_1101-2021_02_13_429885 134 14 the the DT 10_1101-2021_02_13_429885 134 15 following following JJ 10_1101-2021_02_13_429885 134 16 conditions condition NNS 10_1101-2021_02_13_429885 134 17 : : : 10_1101-2021_02_13_429885 134 18 ( ( -LRB- 10_1101-2021_02_13_429885 134 19 i i NN 10_1101-2021_02_13_429885 134 20 ) ) -RRB- 10_1101-2021_02_13_429885 134 21 the the DT 10_1101-2021_02_13_429885 134 22 065n 065n NNPS 10_1101-2021_02_13_429885 134 23 = = SYM 10_1101-2021_02_13_429885 134 24 1 1 CD 10_1101-2021_02_13_429885 134 25 tumour tumour NN 10_1101-2021_02_13_429885 134 26 type type NN 10_1101-2021_02_13_429885 134 27 contains contain VBZ 10_1101-2021_02_13_429885 134 28 > > NFP 10_1101-2021_02_13_429885 134 29 20 20 CD 10_1101-2021_02_13_429885 134 30 samples sample NNS 10_1101-2021_02_13_429885 134 31 , , , 10_1101-2021_02_13_429885 134 32 ( ( -LRB- 10_1101-2021_02_13_429885 134 33 ii ii LS 10_1101-2021_02_13_429885 134 34 ) ) -RRB- 10_1101-2021_02_13_429885 134 35 the the DT 10_1101-2021_02_13_429885 134 36 tumour tumour NN 10_1101-2021_02_13_429885 134 37 genome genome NN 10_1101-2021_02_13_429885 134 38 used use VBN 10_1101-2021_02_13_429885 134 39 for for IN 10_1101-2021_02_13_429885 134 40 QC QC NNP 10_1101-2021_02_13_429885 134 41 contains contain VBZ 10_1101-2021_02_13_429885 134 42 > > XX 10_1101-2021_02_13_429885 134 43 30 30 CD 10_1101-2021_02_13_429885 134 44 % % NN 10_1101-2021_02_13_429885 134 45 of of IN 10_1101-2021_02_13_429885 134 46 the the DT 10_1101-2021_02_13_429885 134 47 overall overall JJ 10_1101-2021_02_13_429885 134 48 SNVs SNVs NNPS 10_1101-2021_02_13_429885 134 49 in in IN 10_1101-2021_02_13_429885 134 50 the the DT 10_1101-2021_02_13_429885 134 51 tumour tumour NN 10_1101-2021_02_13_429885 134 52 - - , 10_1101-2021_02_13_429885 134 53 so so IN 10_1101-2021_02_13_429885 134 54 a a DT 10_1101-2021_02_13_429885 134 55 substantial substantial JJ 10_1101-2021_02_13_429885 134 56 part part NN 10_1101-2021_02_13_429885 134 57 of of IN 10_1101-2021_02_13_429885 134 58 the the DT 10_1101-2021_02_13_429885 134 59 overall overall JJ 10_1101-2021_02_13_429885 134 60 mutational mutational JJ 10_1101-2021_02_13_429885 134 61 burden burden NN 10_1101-2021_02_13_429885 134 62 - - : 10_1101-2021_02_13_429885 134 63 and and CC 10_1101-2021_02_13_429885 134 64 ( ( -LRB- 10_1101-2021_02_13_429885 134 65 iii iii NN 10_1101-2021_02_13_429885 134 66 ) ) -RRB- 10_1101-2021_02_13_429885 134 67 the the DT 10_1101-2021_02_13_429885 134 68 purity purity NN 10_1101-2021_02_13_429885 134 69 of of IN 10_1101-2021_02_13_429885 134 70 the the DT 10_1101-2021_02_13_429885 134 71 sample sample NN 10_1101-2021_02_13_429885 134 72 is be VBZ 10_1101-2021_02_13_429885 134 73 > > XX 10_1101-2021_02_13_429885 134 74 60 60 CD 10_1101-2021_02_13_429885 134 75 % % NN 10_1101-2021_02_13_429885 134 76 - - , 10_1101-2021_02_13_429885 134 77 so so IN 10_1101-2021_02_13_429885 134 78 the the DT 10_1101-2021_02_13_429885 134 79 signal signal NN 10_1101-2021_02_13_429885 134 80 is be VBZ 10_1101-2021_02_13_429885 134 81 suitable suitable JJ 10_1101-2021_02_13_429885 134 82 for for IN 10_1101-2021_02_13_429885 134 83 peak peak NN 10_1101-2021_02_13_429885 134 84 detection detection NN 10_1101-2021_02_13_429885 134 85 . . . 10_1101-2021_02_13_429885 135 1 On on IN 10_1101-2021_02_13_429885 135 2 a a DT 10_1101-2021_02_13_429885 135 3 standard standard JJ 10_1101-2021_02_13_429885 135 4 cluster cluster NN 10_1101-2021_02_13_429885 135 5 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 135 6 ran run VBD 10_1101-2021_02_13_429885 135 7 in in IN 10_1101-2021_02_13_429885 135 8 less less JJR 10_1101-2021_02_13_429885 135 9 than than IN 10_1101-2021_02_13_429885 135 10 1 1 CD 10_1101-2021_02_13_429885 135 11 hour hour NN 10_1101-2021_02_13_429885 135 12 for for IN 10_1101-2021_02_13_429885 135 13 these these DT 10_1101-2021_02_13_429885 135 14 samples sample NNS 10_1101-2021_02_13_429885 135 15 ; ; : 10_1101-2021_02_13_429885 135 16 notably notably RB 10_1101-2021_02_13_429885 135 17 the the DT 10_1101-2021_02_13_429885 135 18 1 1 CD 10_1101-2021_02_13_429885 135 19 Technically technically RB 10_1101-2021_02_13_429885 135 20 the the DT 10_1101-2021_02_13_429885 135 21 default default JJ 10_1101-2021_02_13_429885 135 22 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 135 23 values value NNS 10_1101-2021_02_13_429885 135 24 for for IN 10_1101-2021_02_13_429885 135 25 ploidy ploidy NN 10_1101-2021_02_13_429885 135 26 reach reach NN 10_1101-2021_02_13_429885 135 27 maximum maximum NN 10_1101-2021_02_13_429885 135 28 at at IN 10_1101-2021_02_13_429885 135 29 7 7 CD 10_1101-2021_02_13_429885 135 30 ; ; : 10_1101-2021_02_13_429885 135 31 being be VBG 10_1101-2021_02_13_429885 135 32 unrealistic unrealistic JJ 10_1101-2021_02_13_429885 135 33 for for IN 10_1101-2021_02_13_429885 135 34 our -PRON- PRP$ 10_1101-2021_02_13_429885 135 35 cases case NNS 10_1101-2021_02_13_429885 135 36 we -PRON- PRP 10_1101-2021_02_13_429885 135 37 limited limit VBD 10_1101-2021_02_13_429885 135 38 the the DT 10_1101-2021_02_13_429885 135 39 maximum maximum JJ 10_1101-2021_02_13_429885 135 40 ploidy ploidy NN 10_1101-2021_02_13_429885 135 41 to to TO 10_1101-2021_02_13_429885 135 42 be be VB 10_1101-2021_02_13_429885 135 43 5 5 CD 10_1101-2021_02_13_429885 135 44 . . . 10_1101-2021_02_13_429885 136 1 .CC .CC NFP 10_1101-2021_02_13_429885 136 2 - - : 10_1101-2021_02_13_429885 136 3 BY by IN 10_1101-2021_02_13_429885 136 4 - - HYPH 10_1101-2021_02_13_429885 136 5 NC NC NNP 10_1101-2021_02_13_429885 136 6 - - HYPH 10_1101-2021_02_13_429885 136 7 ND ND NNP 10_1101-2021_02_13_429885 136 8 4.0 4.0 CD 10_1101-2021_02_13_429885 136 9 International International NNP 10_1101-2021_02_13_429885 136 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 136 11 under under IN 10_1101-2021_02_13_429885 136 12 a a DT 10_1101-2021_02_13_429885 136 13 ( ( -LRB- 10_1101-2021_02_13_429885 136 14 which which WDT 10_1101-2021_02_13_429885 136 15 was be VBD 10_1101-2021_02_13_429885 136 16 not not RB 10_1101-2021_02_13_429885 136 17 certified certify VBN 10_1101-2021_02_13_429885 136 18 by by IN 10_1101-2021_02_13_429885 136 19 peer peer NN 10_1101-2021_02_13_429885 136 20 review review NN 10_1101-2021_02_13_429885 136 21 ) ) -RRB- 10_1101-2021_02_13_429885 136 22 is be VBZ 10_1101-2021_02_13_429885 136 23 the the DT 10_1101-2021_02_13_429885 136 24 author author NN 10_1101-2021_02_13_429885 136 25 / / SYM 10_1101-2021_02_13_429885 136 26 funder funder NN 10_1101-2021_02_13_429885 136 27 , , , 10_1101-2021_02_13_429885 136 28 who who WP 10_1101-2021_02_13_429885 136 29 has have VBZ 10_1101-2021_02_13_429885 136 30 granted grant VBN 10_1101-2021_02_13_429885 136 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 136 32 a a DT 10_1101-2021_02_13_429885 136 33 license license NN 10_1101-2021_02_13_429885 136 34 to to TO 10_1101-2021_02_13_429885 136 35 display display VB 10_1101-2021_02_13_429885 136 36 the the DT 10_1101-2021_02_13_429885 136 37 preprint preprint NN 10_1101-2021_02_13_429885 136 38 in in IN 10_1101-2021_02_13_429885 136 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 136 40 . . . 10_1101-2021_02_13_429885 137 1 It -PRON- PRP 10_1101-2021_02_13_429885 137 2 is be VBZ 10_1101-2021_02_13_429885 137 3 made make VBN 10_1101-2021_02_13_429885 137 4 The the DT 10_1101-2021_02_13_429885 137 5 copyright copyright NN 10_1101-2021_02_13_429885 137 6 holder holder NN 10_1101-2021_02_13_429885 137 7 for for IN 10_1101-2021_02_13_429885 137 8 this this DT 10_1101-2021_02_13_429885 137 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 137 10 version version NN 10_1101-2021_02_13_429885 137 11 posted post VBD 10_1101-2021_02_13_429885 137 12 February February NNP 10_1101-2021_02_13_429885 137 13 13 13 CD 10_1101-2021_02_13_429885 137 14 , , , 10_1101-2021_02_13_429885 137 15 2021 2021 CD 10_1101-2021_02_13_429885 137 16 . . . 10_1101-2021_02_13_429885 137 17 ; ; : 10_1101-2021_02_13_429885 137 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 137 19 : : : 10_1101-2021_02_13_429885 137 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 137 21 preprint preprint NN 10_1101-2021_02_13_429885 137 22 https://paperpile.com/c/rqVmzs/IC0y https://paperpile.com/c/rqVmzs/IC0y NNP 10_1101-2021_02_13_429885 137 23 https://paperpile.com/c/rqVmzs/IC0y https://paperpile.com/c/rqVmzs/IC0y NNP 10_1101-2021_02_13_429885 137 24 https://paperpile.com/c/rqVmzs/IC0y https://paperpile.com/c/rqVmzs/IC0y NNP 10_1101-2021_02_13_429885 137 25 https://paperpile.com/c/rqVmzs/CxXa https://paperpile.com/c/rqVmzs/CxXa NNP 10_1101-2021_02_13_429885 137 26 https://paperpile.com/c/rqVmzs/chqB https://paperpile.com/c/rqVmzs/chqB NNP 10_1101-2021_02_13_429885 137 27 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 137 28 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 137 29 Househam Househam NNP 10_1101-2021_02_13_429885 137 30 et et FW 10_1101-2021_02_13_429885 137 31 al al NNP 10_1101-2021_02_13_429885 137 32 . . . 10_1101-2021_02_13_429885 138 1 A a DT 10_1101-2021_02_13_429885 138 2 fully fully RB 10_1101-2021_02_13_429885 138 3 automated automate VBN 10_1101-2021_02_13_429885 138 4 approach approach NN 10_1101-2021_02_13_429885 138 5 for for IN 10_1101-2021_02_13_429885 138 6 quality quality NN 10_1101-2021_02_13_429885 138 7 control control NN 10_1101-2021_02_13_429885 138 8 of of IN 10_1101-2021_02_13_429885 138 9 cancer cancer NN 10_1101-2021_02_13_429885 138 10 mutations mutation NNS 10_1101-2021_02_13_429885 138 11 in in IN 10_1101-2021_02_13_429885 138 12 the the DT 10_1101-2021_02_13_429885 138 13 era era NN 10_1101-2021_02_13_429885 138 14 of of IN 10_1101-2021_02_13_429885 138 15 high high JJ 10_1101-2021_02_13_429885 138 16 - - HYPH 10_1101-2021_02_13_429885 138 17 resolution resolution NN 10_1101-2021_02_13_429885 138 18 whole whole JJ 10_1101-2021_02_13_429885 138 19 genome genome JJ 10_1101-2021_02_13_429885 138 20 sequencing sequencing NN 10_1101-2021_02_13_429885 138 21 . . . 10_1101-2021_02_13_429885 139 1 completion completion NN 10_1101-2021_02_13_429885 139 2 time time NN 10_1101-2021_02_13_429885 139 3 ( ( -LRB- 10_1101-2021_02_13_429885 139 4 per per FW 10_1101-2021_02_13_429885 139 5 sample sample NN 10_1101-2021_02_13_429885 139 6 ) ) -RRB- 10_1101-2021_02_13_429885 139 7 on on IN 10_1101-2021_02_13_429885 139 8 a a DT 10_1101-2021_02_13_429885 139 9 laptop laptop NN 10_1101-2021_02_13_429885 139 10 is be VBZ 10_1101-2021_02_13_429885 139 11 less less JJR 10_1101-2021_02_13_429885 139 12 than than IN 10_1101-2021_02_13_429885 139 13 1 1 CD 10_1101-2021_02_13_429885 139 14 minute minute NN 10_1101-2021_02_13_429885 139 15 , , , 10_1101-2021_02_13_429885 139 16 meaning mean VBG 10_1101-2021_02_13_429885 139 17 that that IN 10_1101-2021_02_13_429885 139 18 preliminary preliminary JJ 10_1101-2021_02_13_429885 139 19 analysis analysis NN 10_1101-2021_02_13_429885 139 20 can can MD 10_1101-2021_02_13_429885 139 21 be be VB 10_1101-2021_02_13_429885 139 22 carried carry VBN 10_1101-2021_02_13_429885 139 23 out out RP 10_1101-2021_02_13_429885 139 24 very very RB 10_1101-2021_02_13_429885 139 25 quickly quickly RB 10_1101-2021_02_13_429885 139 26 and and CC 10_1101-2021_02_13_429885 139 27 without without IN 10_1101-2021_02_13_429885 139 28 large large JJ 10_1101-2021_02_13_429885 139 29 computing computing NN 10_1101-2021_02_13_429885 139 30 infrastructures infrastructure NNS 10_1101-2021_02_13_429885 139 31 . . . 10_1101-2021_02_13_429885 140 1 The the DT 10_1101-2021_02_13_429885 140 2 calls call NNS 10_1101-2021_02_13_429885 140 3 in in IN 10_1101-2021_02_13_429885 140 4 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 140 5 were be VBD 10_1101-2021_02_13_429885 140 6 obtained obtain VBN 10_1101-2021_02_13_429885 140 7 by by IN 10_1101-2021_02_13_429885 140 8 consensus consensus NN 10_1101-2021_02_13_429885 140 9 with with IN 10_1101-2021_02_13_429885 140 10 multiple multiple JJ 10_1101-2021_02_13_429885 140 11 bioinformatics bioinformatic NNS 10_1101-2021_02_13_429885 140 12 tools tool NNS 10_1101-2021_02_13_429885 140 13 , , , 10_1101-2021_02_13_429885 140 14 and and CC 10_1101-2021_02_13_429885 140 15 for for IN 10_1101-2021_02_13_429885 140 16 this this DT 10_1101-2021_02_13_429885 140 17 reason reason NN 10_1101-2021_02_13_429885 140 18 we -PRON- PRP 10_1101-2021_02_13_429885 140 19 expected expect VBD 10_1101-2021_02_13_429885 140 20 them -PRON- PRP 10_1101-2021_02_13_429885 140 21 to to TO 10_1101-2021_02_13_429885 140 22 be be VB 10_1101-2021_02_13_429885 140 23 reliable reliable JJ 10_1101-2021_02_13_429885 140 24 . . . 10_1101-2021_02_13_429885 141 1 Manual manual JJ 10_1101-2021_02_13_429885 141 2 inspections inspection NNS 10_1101-2021_02_13_429885 141 3 of of IN 10_1101-2021_02_13_429885 141 4 some some DT 10_1101-2021_02_13_429885 141 5 patient patient JJ 10_1101-2021_02_13_429885 141 6 data datum NNS 10_1101-2021_02_13_429885 141 7 showed show VBD 10_1101-2021_02_13_429885 141 8 indeed indeed RB 10_1101-2021_02_13_429885 141 9 many many JJ 10_1101-2021_02_13_429885 141 10 high high JJ 10_1101-2021_02_13_429885 141 11 - - HYPH 10_1101-2021_02_13_429885 141 12 quality quality NN 10_1101-2021_02_13_429885 141 13 calls call NNS 10_1101-2021_02_13_429885 141 14 , , , 10_1101-2021_02_13_429885 141 15 but but CC 10_1101-2021_02_13_429885 141 16 also also RB 10_1101-2021_02_13_429885 141 17 highlighted highlight VBD 10_1101-2021_02_13_429885 141 18 a a DT 10_1101-2021_02_13_429885 141 19 variety variety NN 10_1101-2021_02_13_429885 141 20 of of IN 10_1101-2021_02_13_429885 141 21 interesting interesting JJ 10_1101-2021_02_13_429885 141 22 cases case NNS 10_1101-2021_02_13_429885 141 23 . . . 10_1101-2021_02_13_429885 142 1 For for IN 10_1101-2021_02_13_429885 142 2 instance instance NN 10_1101-2021_02_13_429885 142 3 , , , 10_1101-2021_02_13_429885 142 4 tumours tumour NNS 10_1101-2021_02_13_429885 142 5 with with IN 10_1101-2021_02_13_429885 142 6 extremely extremely RB 10_1101-2021_02_13_429885 142 7 low low JJ 10_1101-2021_02_13_429885 142 8 mutational mutational JJ 10_1101-2021_02_13_429885 142 9 burden burden NN 10_1101-2021_02_13_429885 142 10 but but CC 10_1101-2021_02_13_429885 142 11 high high JJ 10_1101-2021_02_13_429885 142 12 quality quality NN 10_1101-2021_02_13_429885 142 13 calls call NNS 10_1101-2021_02_13_429885 142 14 still still RB 10_1101-2021_02_13_429885 142 15 yielded yield VBD 10_1101-2021_02_13_429885 142 16 a a DT 10_1101-2021_02_13_429885 142 17 useful useful JJ 10_1101-2021_02_13_429885 142 18 report report NN 10_1101-2021_02_13_429885 142 19 , , , 10_1101-2021_02_13_429885 142 20 suggesting suggest VBG 10_1101-2021_02_13_429885 142 21 that that IN 10_1101-2021_02_13_429885 142 22 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 142 23 can can MD 10_1101-2021_02_13_429885 142 24 work work VB 10_1101-2021_02_13_429885 142 25 also also RB 10_1101-2021_02_13_429885 142 26 with with IN 10_1101-2021_02_13_429885 142 27 mutational mutational JJ 10_1101-2021_02_13_429885 142 28 burden burden NN 10_1101-2021_02_13_429885 142 29 from from IN 10_1101-2021_02_13_429885 142 30 whole whole JJ 10_1101-2021_02_13_429885 142 31 - - HYPH 10_1101-2021_02_13_429885 142 32 exome exome NN 10_1101-2021_02_13_429885 142 33 sequencing sequencing NN 10_1101-2021_02_13_429885 142 34 ( ( -LRB- 10_1101-2021_02_13_429885 142 35 ​Supplementary ​supplementary JJ 10_1101-2021_02_13_429885 142 36 Figure figure NN 10_1101-2021_02_13_429885 142 37 S1 s1 NN 10_1101-2021_02_13_429885 142 38 ​ ​ JJ 10_1101-2021_02_13_429885 142 39 ) ) -RRB- 10_1101-2021_02_13_429885 142 40 . . . 10_1101-2021_02_13_429885 143 1 For for IN 10_1101-2021_02_13_429885 143 2 other other JJ 10_1101-2021_02_13_429885 143 3 tumours tumour NNS 10_1101-2021_02_13_429885 143 4 , , , 10_1101-2021_02_13_429885 143 5 we -PRON- PRP 10_1101-2021_02_13_429885 143 6 found find VBD 10_1101-2021_02_13_429885 143 7 high high JJ 10_1101-2021_02_13_429885 143 8 purity purity NN 10_1101-2021_02_13_429885 143 9 levels level NNS 10_1101-2021_02_13_429885 143 10 > > XX 10_1101-2021_02_13_429885 143 11 90 90 CD 10_1101-2021_02_13_429885 143 12 % % NN 10_1101-2021_02_13_429885 143 13 , , , 10_1101-2021_02_13_429885 143 14 which which WDT 10_1101-2021_02_13_429885 143 15 are be VBP 10_1101-2021_02_13_429885 143 16 probably probably RB 10_1101-2021_02_13_429885 143 17 overestimated overestimate VBN 10_1101-2021_02_13_429885 143 18 ( ( -LRB- 10_1101-2021_02_13_429885 143 19 ​Supplementary ​supplementary JJ 10_1101-2021_02_13_429885 143 20 Figure figure NN 10_1101-2021_02_13_429885 143 21 S11 S11 NNP 10_1101-2021_02_13_429885 143 22 ​ ​ NNP 10_1101-2021_02_13_429885 143 23 ) ) -RRB- 10_1101-2021_02_13_429885 143 24 compared compare VBN 10_1101-2021_02_13_429885 143 25 to to IN 10_1101-2021_02_13_429885 143 26 others other NNS 10_1101-2021_02_13_429885 143 27 where where WRB 10_1101-2021_02_13_429885 143 28 purity purity NN 10_1101-2021_02_13_429885 143 29 is be VBZ 10_1101-2021_02_13_429885 143 30 genuinely genuinely RB 10_1101-2021_02_13_429885 143 31 very very RB 10_1101-2021_02_13_429885 143 32 high high JJ 10_1101-2021_02_13_429885 143 33 ( ( -LRB- 10_1101-2021_02_13_429885 143 34 ​Supplementary ​supplementary JJ 10_1101-2021_02_13_429885 143 35 Figure figure NN 10_1101-2021_02_13_429885 143 36 S12 S12 NNP 10_1101-2021_02_13_429885 143 37 ​ ​ NNP 10_1101-2021_02_13_429885 143 38 ) ) -RRB- 10_1101-2021_02_13_429885 143 39 . . . 10_1101-2021_02_13_429885 144 1 Overall overall RB 10_1101-2021_02_13_429885 144 2 , , , 10_1101-2021_02_13_429885 144 3 the the DT 10_1101-2021_02_13_429885 144 4 scores score NNS 10_1101-2021_02_13_429885 144 5 from from IN 10_1101-2021_02_13_429885 144 6 peak peak NN 10_1101-2021_02_13_429885 144 7 detection detection NN 10_1101-2021_02_13_429885 144 8 are be VBP 10_1101-2021_02_13_429885 144 9 reliable reliable JJ 10_1101-2021_02_13_429885 144 10 for for IN 10_1101-2021_02_13_429885 144 11 the the DT 10_1101-2021_02_13_429885 144 12 majority majority NN 10_1101-2021_02_13_429885 144 13 of of IN 10_1101-2021_02_13_429885 144 14 the the DT 10_1101-2021_02_13_429885 144 15 analysed analyse VBN 10_1101-2021_02_13_429885 144 16 samples sample NNS 10_1101-2021_02_13_429885 144 17 ( ( -LRB- 10_1101-2021_02_13_429885 144 18 ​Figure ​figure NN 10_1101-2021_02_13_429885 144 19 5a 5a CD 10_1101-2021_02_13_429885 144 20 ​ ​ JJ 10_1101-2021_02_13_429885 144 21 ) ) -RRB- 10_1101-2021_02_13_429885 144 22 - - : 10_1101-2021_02_13_429885 144 23 the the DT 10_1101-2021_02_13_429885 144 24 diploid diploid NNP 10_1101-2021_02_13_429885 144 25 85 85 CD 10_1101-2021_02_13_429885 144 26 % % NN 10_1101-2021_02_13_429885 144 27 purity purity NN 10_1101-2021_02_13_429885 144 28 tumour tumour NN 10_1101-2021_02_13_429885 144 29 in in IN 10_1101-2021_02_13_429885 144 30 ​Figures ​figure NNS 10_1101-2021_02_13_429885 144 31 2 2 CD 10_1101-2021_02_13_429885 144 32 ​and ​and CC 10_1101-2021_02_13_429885 144 33 3 3 CD 10_1101-2021_02_13_429885 144 34 is be VBZ 10_1101-2021_02_13_429885 144 35 taken take VBN 10_1101-2021_02_13_429885 144 36 from from IN 10_1101-2021_02_13_429885 144 37 this this DT 10_1101-2021_02_13_429885 144 38 list list NN 10_1101-2021_02_13_429885 144 39 - - : 10_1101-2021_02_13_429885 144 40 with with IN 10_1101-2021_02_13_429885 144 41 only only RB 10_1101-2021_02_13_429885 144 42 a a DT 10_1101-2021_02_13_429885 144 43 few few JJ 10_1101-2021_02_13_429885 144 44 cases case NNS 10_1101-2021_02_13_429885 144 45 requiring require VBG 10_1101-2021_02_13_429885 144 46 further further JJ 10_1101-2021_02_13_429885 144 47 checks check NNS 10_1101-2021_02_13_429885 144 48 ( ( -LRB- 10_1101-2021_02_13_429885 144 49 ​Figure ​figure NN 10_1101-2021_02_13_429885 144 50 5b 5b NN 10_1101-2021_02_13_429885 144 51 ​ ​ JJ 10_1101-2021_02_13_429885 144 52 ) ) -RRB- 10_1101-2021_02_13_429885 144 53 . . . 10_1101-2021_02_13_429885 145 1 The the DT 10_1101-2021_02_13_429885 145 2 peak peak NN 10_1101-2021_02_13_429885 145 3 detection detection NN 10_1101-2021_02_13_429885 145 4 by by IN 10_1101-2021_02_13_429885 145 5 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 145 6 therefore therefore RB 10_1101-2021_02_13_429885 145 7 confirms confirm VBZ 10_1101-2021_02_13_429885 145 8 the the DT 10_1101-2021_02_13_429885 145 9 calls call NNS 10_1101-2021_02_13_429885 145 10 reliability reliability NN 10_1101-2021_02_13_429885 145 11 in in IN 10_1101-2021_02_13_429885 145 12 terms term NNS 10_1101-2021_02_13_429885 145 13 of of IN 10_1101-2021_02_13_429885 145 14 breakpoints breakpoint NNS 10_1101-2021_02_13_429885 145 15 , , , 10_1101-2021_02_13_429885 145 16 segments segment VBZ 10_1101-2021_02_13_429885 145 17 ploidy ploidy NN 10_1101-2021_02_13_429885 145 18 and and CC 10_1101-2021_02_13_429885 145 19 tumour tumour NN 10_1101-2021_02_13_429885 145 20 purity purity NN 10_1101-2021_02_13_429885 145 21 . . . 10_1101-2021_02_13_429885 146 1 CCF ccf NN 10_1101-2021_02_13_429885 146 2 computations computation NNS 10_1101-2021_02_13_429885 146 3 showed show VBD 10_1101-2021_02_13_429885 146 4 a a DT 10_1101-2021_02_13_429885 146 5 higher high JJR 10_1101-2021_02_13_429885 146 6 rate rate NN 10_1101-2021_02_13_429885 146 7 of of IN 10_1101-2021_02_13_429885 146 8 failures failure NNS 10_1101-2021_02_13_429885 146 9 with with IN 10_1101-2021_02_13_429885 146 10 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 146 11 analysis analysis NN 10_1101-2021_02_13_429885 146 12 ( ( -LRB- 10_1101-2021_02_13_429885 146 13 ​Figure ​figure NN 10_1101-2021_02_13_429885 146 14 5a 5a CD 10_1101-2021_02_13_429885 146 15 ​ ​ JJ 10_1101-2021_02_13_429885 146 16 ) ) -RRB- 10_1101-2021_02_13_429885 146 17 . . . 10_1101-2021_02_13_429885 147 1 This this DT 10_1101-2021_02_13_429885 147 2 is be VBZ 10_1101-2021_02_13_429885 147 3 inevitably inevitably RB 10_1101-2021_02_13_429885 147 4 due due JJ 10_1101-2021_02_13_429885 147 5 to to IN 10_1101-2021_02_13_429885 147 6 the the DT 10_1101-2021_02_13_429885 147 7 lack lack NN 10_1101-2021_02_13_429885 147 8 of of IN 10_1101-2021_02_13_429885 147 9 signal signal JJ 10_1101-2021_02_13_429885 147 10 separability separability NN 10_1101-2021_02_13_429885 147 11 stemming stem VBG 10_1101-2021_02_13_429885 147 12 from from IN 10_1101-2021_02_13_429885 147 13 low low JJ 10_1101-2021_02_13_429885 147 14 coverage coverage NN 10_1101-2021_02_13_429885 147 15 of of IN 10_1101-2021_02_13_429885 147 16 these these DT 10_1101-2021_02_13_429885 147 17 samples sample NNS 10_1101-2021_02_13_429885 147 18 , , , 10_1101-2021_02_13_429885 147 19 even even RB 10_1101-2021_02_13_429885 147 20 for for IN 10_1101-2021_02_13_429885 147 21 high high JJ 10_1101-2021_02_13_429885 147 22 - - HYPH 10_1101-2021_02_13_429885 147 23 quality quality NN 10_1101-2021_02_13_429885 147 24 genomes genome NNS 10_1101-2021_02_13_429885 147 25 . . . 10_1101-2021_02_13_429885 148 1 Therefore therefore RB 10_1101-2021_02_13_429885 148 2 while while IN 10_1101-2021_02_13_429885 148 3 peaks peak NNS 10_1101-2021_02_13_429885 148 4 could could MD 10_1101-2021_02_13_429885 148 5 be be VB 10_1101-2021_02_13_429885 148 6 determined determine VBN 10_1101-2021_02_13_429885 148 7 for for IN 10_1101-2021_02_13_429885 148 8 these these DT 10_1101-2021_02_13_429885 148 9 data datum NNS 10_1101-2021_02_13_429885 148 10 , , , 10_1101-2021_02_13_429885 148 11 mutation mutation NN 10_1101-2021_02_13_429885 148 12 multiplicity multiplicity NN 10_1101-2021_02_13_429885 148 13 assessment assessment NN 10_1101-2021_02_13_429885 148 14 would would MD 10_1101-2021_02_13_429885 148 15 have have VB 10_1101-2021_02_13_429885 148 16 required require VBN 10_1101-2021_02_13_429885 148 17 higher high JJR 10_1101-2021_02_13_429885 148 18 coverage coverage NN 10_1101-2021_02_13_429885 148 19 than than IN 10_1101-2021_02_13_429885 148 20 what what WP 10_1101-2021_02_13_429885 148 21 was be VBD 10_1101-2021_02_13_429885 148 22 found find VBN 10_1101-2021_02_13_429885 148 23 available available JJ 10_1101-2021_02_13_429885 148 24 . . . 10_1101-2021_02_13_429885 149 1 In in IN 10_1101-2021_02_13_429885 149 2 summary summary NN 10_1101-2021_02_13_429885 149 3 , , , 10_1101-2021_02_13_429885 149 4 from from IN 10_1101-2021_02_13_429885 149 5 these these DT 10_1101-2021_02_13_429885 149 6 analyses analysis NNS 10_1101-2021_02_13_429885 149 7 we -PRON- PRP 10_1101-2021_02_13_429885 149 8 revealed reveal VBD 10_1101-2021_02_13_429885 149 9 that that IN 10_1101-2021_02_13_429885 149 10 the the DT 10_1101-2021_02_13_429885 149 11 problem problem NN 10_1101-2021_02_13_429885 149 12 of of IN 10_1101-2021_02_13_429885 149 13 validating validate VBG 10_1101-2021_02_13_429885 149 14 CNA CNA NNP 10_1101-2021_02_13_429885 149 15 calls call NNS 10_1101-2021_02_13_429885 149 16 , , , 10_1101-2021_02_13_429885 149 17 compared compare VBN 10_1101-2021_02_13_429885 149 18 to to IN 10_1101-2021_02_13_429885 149 19 determining determine VBG 10_1101-2021_02_13_429885 149 20 CCF ccf NN 10_1101-2021_02_13_429885 149 21 estimates estimate NNS 10_1101-2021_02_13_429885 149 22 , , , 10_1101-2021_02_13_429885 149 23 can can MD 10_1101-2021_02_13_429885 149 24 be be VB 10_1101-2021_02_13_429885 149 25 approached approach VBN 10_1101-2021_02_13_429885 149 26 with with IN 10_1101-2021_02_13_429885 149 27 lower low JJR 10_1101-2021_02_13_429885 149 28 coverage coverage NN 10_1101-2021_02_13_429885 149 29 and and CC 10_1101-2021_02_13_429885 149 30 purity purity NN 10_1101-2021_02_13_429885 149 31 values value NNS 10_1101-2021_02_13_429885 149 32 using use VBG 10_1101-2021_02_13_429885 149 33 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 149 34 . . . 10_1101-2021_02_13_429885 150 1 Discussion discussion NN 10_1101-2021_02_13_429885 150 2 WGS WGS NNP 10_1101-2021_02_13_429885 150 3 is be VBZ 10_1101-2021_02_13_429885 150 4 a a DT 10_1101-2021_02_13_429885 150 5 powerful powerful JJ 10_1101-2021_02_13_429885 150 6 approach approach NN 10_1101-2021_02_13_429885 150 7 to to TO 10_1101-2021_02_13_429885 150 8 detect detect VB 10_1101-2021_02_13_429885 150 9 extensive extensive JJ 10_1101-2021_02_13_429885 150 10 mutations mutation NNS 10_1101-2021_02_13_429885 150 11 that that WDT 10_1101-2021_02_13_429885 150 12 drive drive VBP 10_1101-2021_02_13_429885 150 13 human human JJ 10_1101-2021_02_13_429885 150 14 cancers cancer NNS 10_1101-2021_02_13_429885 150 15 . . . 10_1101-2021_02_13_429885 151 1 Many many JJ 10_1101-2021_02_13_429885 151 2 large large JJ 10_1101-2021_02_13_429885 151 3 - - HYPH 10_1101-2021_02_13_429885 151 4 scale scale NN 10_1101-2021_02_13_429885 151 5 initiatives initiative NNS 10_1101-2021_02_13_429885 151 6 such such JJ 10_1101-2021_02_13_429885 151 7 as as IN 10_1101-2021_02_13_429885 151 8 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 151 9 ​(Campbell ​(Campbell NNP 10_1101-2021_02_13_429885 151 10 et et FW 10_1101-2021_02_13_429885 151 11 al al NNP 10_1101-2021_02_13_429885 151 12 . . . 10_1101-2021_02_13_429885 152 1 2020)​ 2020)​ CD 10_1101-2021_02_13_429885 152 2 , , , 10_1101-2021_02_13_429885 152 3 the the DT 10_1101-2021_02_13_429885 152 4 Hartwig Hartwig NNP 10_1101-2021_02_13_429885 152 5 Medical Medical NNP 10_1101-2021_02_13_429885 152 6 Foundation Foundation NNP 10_1101-2021_02_13_429885 152 7 ​(Priestley ​(Priestley NNP 10_1101-2021_02_13_429885 152 8 et et NNP 10_1101-2021_02_13_429885 152 9 al al NNP 10_1101-2021_02_13_429885 152 10 . . . 10_1101-2021_02_13_429885 153 1 2019 2019 CD 10_1101-2021_02_13_429885 153 2 ) ) -RRB- 10_1101-2021_02_13_429885 153 3 and and CC 10_1101-2021_02_13_429885 153 4 Genomics Genomics NNP 10_1101-2021_02_13_429885 153 5 England England NNP 10_1101-2021_02_13_429885 153 6 ​(Turnbull ​(Turnbull NNP 10_1101-2021_02_13_429885 153 7 et et NNP 10_1101-2021_02_13_429885 153 8 al al NNP 10_1101-2021_02_13_429885 153 9 . . . 10_1101-2021_02_13_429885 154 1 2018 2018 CD 10_1101-2021_02_13_429885 154 2 ) ) -RRB- 10_1101-2021_02_13_429885 154 3 have have VBP 10_1101-2021_02_13_429885 154 4 already already RB 10_1101-2021_02_13_429885 154 5 generated generate VBN 10_1101-2021_02_13_429885 154 6 WGS WGS NNP 10_1101-2021_02_13_429885 154 7 data datum NNS 10_1101-2021_02_13_429885 154 8 for for IN 10_1101-2021_02_13_429885 154 9 thousands thousand NNS 10_1101-2021_02_13_429885 154 10 of of IN 10_1101-2021_02_13_429885 154 11 cancer cancer NN 10_1101-2021_02_13_429885 154 12 patients patient NNS 10_1101-2021_02_13_429885 154 13 , , , 10_1101-2021_02_13_429885 154 14 with with IN 10_1101-2021_02_13_429885 154 15 many many JJ 10_1101-2021_02_13_429885 154 16 cancer cancer NN 10_1101-2021_02_13_429885 154 17 institutes institute NNS 10_1101-2021_02_13_429885 154 18 converging converge VBG 10_1101-2021_02_13_429885 154 19 towards towards IN 10_1101-2021_02_13_429885 154 20 these these DT 10_1101-2021_02_13_429885 154 21 efforts effort NNS 10_1101-2021_02_13_429885 154 22 . . . 10_1101-2021_02_13_429885 155 1 Calling call VBG 10_1101-2021_02_13_429885 155 2 mutations mutation NNS 10_1101-2021_02_13_429885 155 3 from from IN 10_1101-2021_02_13_429885 155 4 WGS WGS NNP 10_1101-2021_02_13_429885 155 5 data datum NNS 10_1101-2021_02_13_429885 155 6 requires require VBZ 10_1101-2021_02_13_429885 155 7 complex complex JJ 10_1101-2021_02_13_429885 155 8 bioinformatics bioinformatic NNS 10_1101-2021_02_13_429885 155 9 pipelines pipeline NNS 10_1101-2021_02_13_429885 155 10 ​(Barnell ​(Barnell NNP 10_1101-2021_02_13_429885 155 11 et et FW 10_1101-2021_02_13_429885 155 12 al al NNP 10_1101-2021_02_13_429885 155 13 . . . 10_1101-2021_02_13_429885 156 1 2019 2019 CD 10_1101-2021_02_13_429885 156 2 ; ; : 10_1101-2021_02_13_429885 156 3 Cmero Cmero NNP 10_1101-2021_02_13_429885 156 4 et et NNP 10_1101-2021_02_13_429885 156 5 al al NNP 10_1101-2021_02_13_429885 156 6 . . . 10_1101-2021_02_13_429885 157 1 2020 2020 CD 10_1101-2021_02_13_429885 157 2 ; ; : 10_1101-2021_02_13_429885 157 3 Li Li NNP 10_1101-2021_02_13_429885 157 4 et et NNP 10_1101-2021_02_13_429885 157 5 al al NNP 10_1101-2021_02_13_429885 157 6 . . . 10_1101-2021_02_13_429885 158 1 2020 2020 CD 10_1101-2021_02_13_429885 158 2 ) ) -RRB- 10_1101-2021_02_13_429885 158 3 and and CC 10_1101-2021_02_13_429885 158 4 any any DT 10_1101-2021_02_13_429885 158 5 downstream downstream JJ 10_1101-2021_02_13_429885 158 6 analysis analysis NN 10_1101-2021_02_13_429885 158 7 relies rely VBZ 10_1101-2021_02_13_429885 158 8 upon upon IN 10_1101-2021_02_13_429885 158 9 these these DT 10_1101-2021_02_13_429885 158 10 calls call NNS 10_1101-2021_02_13_429885 158 11 , , , 10_1101-2021_02_13_429885 158 12 putting put VBG 10_1101-2021_02_13_429885 158 13 the the DT 10_1101-2021_02_13_429885 158 14 quality quality NN 10_1101-2021_02_13_429885 158 15 of of IN 10_1101-2021_02_13_429885 158 16 the the DT 10_1101-2021_02_13_429885 158 17 generated generate VBN 10_1101-2021_02_13_429885 158 18 data datum NNS 10_1101-2021_02_13_429885 158 19 under under IN 10_1101-2021_02_13_429885 158 20 the the DT 10_1101-2021_02_13_429885 158 21 spotlight spotlight NN 10_1101-2021_02_13_429885 158 22 . . . 10_1101-2021_02_13_429885 159 1 .CC .CC NFP 10_1101-2021_02_13_429885 159 2 - - : 10_1101-2021_02_13_429885 159 3 BY by IN 10_1101-2021_02_13_429885 159 4 - - HYPH 10_1101-2021_02_13_429885 159 5 NC NC NNP 10_1101-2021_02_13_429885 159 6 - - HYPH 10_1101-2021_02_13_429885 159 7 ND ND NNP 10_1101-2021_02_13_429885 159 8 4.0 4.0 CD 10_1101-2021_02_13_429885 159 9 International International NNP 10_1101-2021_02_13_429885 159 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 159 11 under under IN 10_1101-2021_02_13_429885 159 12 a a DT 10_1101-2021_02_13_429885 159 13 ( ( -LRB- 10_1101-2021_02_13_429885 159 14 which which WDT 10_1101-2021_02_13_429885 159 15 was be VBD 10_1101-2021_02_13_429885 159 16 not not RB 10_1101-2021_02_13_429885 159 17 certified certify VBN 10_1101-2021_02_13_429885 159 18 by by IN 10_1101-2021_02_13_429885 159 19 peer peer NN 10_1101-2021_02_13_429885 159 20 review review NN 10_1101-2021_02_13_429885 159 21 ) ) -RRB- 10_1101-2021_02_13_429885 159 22 is be VBZ 10_1101-2021_02_13_429885 159 23 the the DT 10_1101-2021_02_13_429885 159 24 author author NN 10_1101-2021_02_13_429885 159 25 / / SYM 10_1101-2021_02_13_429885 159 26 funder funder NN 10_1101-2021_02_13_429885 159 27 , , , 10_1101-2021_02_13_429885 159 28 who who WP 10_1101-2021_02_13_429885 159 29 has have VBZ 10_1101-2021_02_13_429885 159 30 granted grant VBN 10_1101-2021_02_13_429885 159 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 159 32 a a DT 10_1101-2021_02_13_429885 159 33 license license NN 10_1101-2021_02_13_429885 159 34 to to TO 10_1101-2021_02_13_429885 159 35 display display VB 10_1101-2021_02_13_429885 159 36 the the DT 10_1101-2021_02_13_429885 159 37 preprint preprint NN 10_1101-2021_02_13_429885 159 38 in in IN 10_1101-2021_02_13_429885 159 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 159 40 . . . 10_1101-2021_02_13_429885 160 1 It -PRON- PRP 10_1101-2021_02_13_429885 160 2 is be VBZ 10_1101-2021_02_13_429885 160 3 made make VBN 10_1101-2021_02_13_429885 160 4 The the DT 10_1101-2021_02_13_429885 160 5 copyright copyright NN 10_1101-2021_02_13_429885 160 6 holder holder NN 10_1101-2021_02_13_429885 160 7 for for IN 10_1101-2021_02_13_429885 160 8 this this DT 10_1101-2021_02_13_429885 160 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 160 10 version version NN 10_1101-2021_02_13_429885 160 11 posted post VBD 10_1101-2021_02_13_429885 160 12 February February NNP 10_1101-2021_02_13_429885 160 13 13 13 CD 10_1101-2021_02_13_429885 160 14 , , , 10_1101-2021_02_13_429885 160 15 2021 2021 CD 10_1101-2021_02_13_429885 160 16 . . . 10_1101-2021_02_13_429885 160 17 ; ; : 10_1101-2021_02_13_429885 160 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 160 19 : : : 10_1101-2021_02_13_429885 160 20 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 160 21 preprint preprint NN 10_1101-2021_02_13_429885 160 22 https://paperpile.com/c/rqVmzs/CxXa https://paperpile.com/c/rqvmzs/cxxa SYM 10_1101-2021_02_13_429885 160 23 https://paperpile.com/c/rqVmzs/67up https://paperpile.com/c/rqVmzs/67up NNP 10_1101-2021_02_13_429885 160 24 https://paperpile.com/c/rqVmzs/mWfz https://paperpile.com/c/rqVmzs/mWfz NNP 10_1101-2021_02_13_429885 160 25 https://paperpile.com/c/rqVmzs/j5j7+ydMa+tMOu https://paperpile.com/c/rqVmzs/j5j7+ydMa+tMOu NNP 10_1101-2021_02_13_429885 160 26 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 NNP 10_1101-2021_02_13_429885 160 27 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 160 28 Househam Househam NNP 10_1101-2021_02_13_429885 160 29 et et FW 10_1101-2021_02_13_429885 160 30 al al NNP 10_1101-2021_02_13_429885 160 31 . . . 10_1101-2021_02_13_429885 161 1 A a DT 10_1101-2021_02_13_429885 161 2 fully fully RB 10_1101-2021_02_13_429885 161 3 automated automate VBN 10_1101-2021_02_13_429885 161 4 approach approach NN 10_1101-2021_02_13_429885 161 5 for for IN 10_1101-2021_02_13_429885 161 6 quality quality NN 10_1101-2021_02_13_429885 161 7 control control NN 10_1101-2021_02_13_429885 161 8 of of IN 10_1101-2021_02_13_429885 161 9 cancer cancer NN 10_1101-2021_02_13_429885 161 10 mutations mutation NNS 10_1101-2021_02_13_429885 161 11 in in IN 10_1101-2021_02_13_429885 161 12 the the DT 10_1101-2021_02_13_429885 161 13 era era NN 10_1101-2021_02_13_429885 161 14 of of IN 10_1101-2021_02_13_429885 161 15 high high JJ 10_1101-2021_02_13_429885 161 16 - - HYPH 10_1101-2021_02_13_429885 161 17 resolution resolution NN 10_1101-2021_02_13_429885 161 18 whole whole JJ 10_1101-2021_02_13_429885 161 19 genome genome JJ 10_1101-2021_02_13_429885 161 20 sequencing sequencing NN 10_1101-2021_02_13_429885 161 21 . . . 10_1101-2021_02_13_429885 162 1 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 162 2 offers offer VBZ 10_1101-2021_02_13_429885 162 3 the the DT 10_1101-2021_02_13_429885 162 4 first first JJ 10_1101-2021_02_13_429885 162 5 principle principle JJ 10_1101-2021_02_13_429885 162 6 framework framework NN 10_1101-2021_02_13_429885 162 7 to to TO 10_1101-2021_02_13_429885 162 8 control control VB 10_1101-2021_02_13_429885 162 9 the the DT 10_1101-2021_02_13_429885 162 10 quality quality NN 10_1101-2021_02_13_429885 162 11 of of IN 10_1101-2021_02_13_429885 162 12 tumour tumour NN 10_1101-2021_02_13_429885 162 13 mutation mutation NN 10_1101-2021_02_13_429885 162 14 calls call NNS 10_1101-2021_02_13_429885 162 15 . . . 10_1101-2021_02_13_429885 163 1 The the DT 10_1101-2021_02_13_429885 163 2 tool tool NN 10_1101-2021_02_13_429885 163 3 can can MD 10_1101-2021_02_13_429885 163 4 analyse analyse VB 10_1101-2021_02_13_429885 163 5 SNVs SNVs NNPS 10_1101-2021_02_13_429885 163 6 and and CC 10_1101-2021_02_13_429885 163 7 more more JJR 10_1101-2021_02_13_429885 163 8 general general JJ 10_1101-2021_02_13_429885 163 9 types type NNS 10_1101-2021_02_13_429885 163 10 of of IN 10_1101-2021_02_13_429885 163 11 nucleotide nucleotide JJ 10_1101-2021_02_13_429885 163 12 substitutions substitution NNS 10_1101-2021_02_13_429885 163 13 ; ; : 10_1101-2021_02_13_429885 163 14 SNVs SNVs NNPS 10_1101-2021_02_13_429885 163 15 are be VBP 10_1101-2021_02_13_429885 163 16 more more RBR 10_1101-2021_02_13_429885 163 17 reliable reliable JJ 10_1101-2021_02_13_429885 163 18 and and CC 10_1101-2021_02_13_429885 163 19 depend depend VB 10_1101-2021_02_13_429885 163 20 less less RBR 10_1101-2021_02_13_429885 163 21 on on IN 10_1101-2021_02_13_429885 163 22 alignment alignment NN 10_1101-2021_02_13_429885 163 23 quality quality NN 10_1101-2021_02_13_429885 163 24 than than IN 10_1101-2021_02_13_429885 163 25 other other JJ 10_1101-2021_02_13_429885 163 26 mutations mutation NNS 10_1101-2021_02_13_429885 163 27 , , , 10_1101-2021_02_13_429885 163 28 and and CC 10_1101-2021_02_13_429885 163 29 therefore therefore RB 10_1101-2021_02_13_429885 163 30 should should MD 10_1101-2021_02_13_429885 163 31 be be VB 10_1101-2021_02_13_429885 163 32 checked check VBN 10_1101-2021_02_13_429885 163 33 first first RB 10_1101-2021_02_13_429885 163 34 . . . 10_1101-2021_02_13_429885 164 1 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 164 2 uses use VBZ 10_1101-2021_02_13_429885 164 3 a a DT 10_1101-2021_02_13_429885 164 4 peak peak NN 10_1101-2021_02_13_429885 164 5 - - HYPH 10_1101-2021_02_13_429885 164 6 detection detection NN 10_1101-2021_02_13_429885 164 7 analysis analysis NN 10_1101-2021_02_13_429885 164 8 to to TO 10_1101-2021_02_13_429885 164 9 validate validate VB 10_1101-2021_02_13_429885 164 10 CNA CNA NNP 10_1101-2021_02_13_429885 164 11 segments segment NNS 10_1101-2021_02_13_429885 164 12 and and CC 10_1101-2021_02_13_429885 164 13 purity purity NN 10_1101-2021_02_13_429885 164 14 , , , 10_1101-2021_02_13_429885 164 15 exploiting exploit VBG 10_1101-2021_02_13_429885 164 16 a a DT 10_1101-2021_02_13_429885 164 17 combinatorial combinatorial JJ 10_1101-2021_02_13_429885 164 18 model model NN 10_1101-2021_02_13_429885 164 19 for for IN 10_1101-2021_02_13_429885 164 20 cancer cancer NN 10_1101-2021_02_13_429885 164 21 alleles allele NNS 10_1101-2021_02_13_429885 164 22 . . . 10_1101-2021_02_13_429885 165 1 Within within IN 10_1101-2021_02_13_429885 165 2 the the DT 10_1101-2021_02_13_429885 165 3 same same JJ 10_1101-2021_02_13_429885 165 4 framework framework NN 10_1101-2021_02_13_429885 165 5 , , , 10_1101-2021_02_13_429885 165 6 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 165 7 also also RB 10_1101-2021_02_13_429885 165 8 computes compute VBZ 10_1101-2021_02_13_429885 165 9 CCF ccf NN 10_1101-2021_02_13_429885 165 10 values value NNS 10_1101-2021_02_13_429885 165 11 , , , 10_1101-2021_02_13_429885 165 12 highlighting highlight VBG 10_1101-2021_02_13_429885 165 13 mutations mutation NNS 10_1101-2021_02_13_429885 165 14 for for IN 10_1101-2021_02_13_429885 165 15 which which WDT 10_1101-2021_02_13_429885 165 16 such such JJ 10_1101-2021_02_13_429885 165 17 values value NNS 10_1101-2021_02_13_429885 165 18 are be VBP 10_1101-2021_02_13_429885 165 19 uncertain uncertain JJ 10_1101-2021_02_13_429885 165 20 . . . 10_1101-2021_02_13_429885 166 1 CNAqc cnaqc NN 10_1101-2021_02_13_429885 166 2 features feature NNS 10_1101-2021_02_13_429885 166 3 can can MD 10_1101-2021_02_13_429885 166 4 be be VB 10_1101-2021_02_13_429885 166 5 used use VBN 10_1101-2021_02_13_429885 166 6 to to TO 10_1101-2021_02_13_429885 166 7 clean clean VB 10_1101-2021_02_13_429885 166 8 up up RP 10_1101-2021_02_13_429885 166 9 data datum NNS 10_1101-2021_02_13_429885 166 10 , , , 10_1101-2021_02_13_429885 166 11 automatising automatise VBG 10_1101-2021_02_13_429885 166 12 parameter parameter NN 10_1101-2021_02_13_429885 166 13 choice choice NN 10_1101-2021_02_13_429885 166 14 for for IN 10_1101-2021_02_13_429885 166 15 virtually virtually RB 10_1101-2021_02_13_429885 166 16 any any DT 10_1101-2021_02_13_429885 166 17 caller caller NN 10_1101-2021_02_13_429885 166 18 , , , 10_1101-2021_02_13_429885 166 19 prioritizing prioritize VBG 10_1101-2021_02_13_429885 166 20 good good JJ 10_1101-2021_02_13_429885 166 21 calls call NNS 10_1101-2021_02_13_429885 166 22 and and CC 10_1101-2021_02_13_429885 166 23 selecting select VBG 10_1101-2021_02_13_429885 166 24 information information NN 10_1101-2021_02_13_429885 166 25 for for IN 10_1101-2021_02_13_429885 166 26 downstream downstream JJ 10_1101-2021_02_13_429885 166 27 analyses analysis NNS 10_1101-2021_02_13_429885 166 28 . . . 10_1101-2021_02_13_429885 167 1 The the DT 10_1101-2021_02_13_429885 167 2 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 167 3 framework framework NN 10_1101-2021_02_13_429885 167 4 leverages leverage VBZ 10_1101-2021_02_13_429885 167 5 the the DT 10_1101-2021_02_13_429885 167 6 relationship relationship NN 10_1101-2021_02_13_429885 167 7 between between IN 10_1101-2021_02_13_429885 167 8 tumour tumour NN 10_1101-2021_02_13_429885 167 9 VAF VAF NNP 10_1101-2021_02_13_429885 167 10 and and CC 10_1101-2021_02_13_429885 167 11 ploidy ploidy NN 10_1101-2021_02_13_429885 167 12 . . . 10_1101-2021_02_13_429885 168 1 The the DT 10_1101-2021_02_13_429885 168 2 quality quality NN 10_1101-2021_02_13_429885 168 3 of of IN 10_1101-2021_02_13_429885 168 4 the the DT 10_1101-2021_02_13_429885 168 5 control control NN 10_1101-2021_02_13_429885 168 6 process process NN 10_1101-2021_02_13_429885 168 7 itself -PRON- PRP 10_1101-2021_02_13_429885 168 8 depends depend VBZ 10_1101-2021_02_13_429885 168 9 on on IN 10_1101-2021_02_13_429885 168 10 the the DT 10_1101-2021_02_13_429885 168 11 ability ability NN 10_1101-2021_02_13_429885 168 12 to to TO 10_1101-2021_02_13_429885 168 13 process process VB 10_1101-2021_02_13_429885 168 14 the the DT 10_1101-2021_02_13_429885 168 15 VAF VAF NNP 10_1101-2021_02_13_429885 168 16 spectrum spectrum NN 10_1101-2021_02_13_429885 168 17 and and CC 10_1101-2021_02_13_429885 168 18 detect detect JJ 10_1101-2021_02_13_429885 168 19 peaks peak NNS 10_1101-2021_02_13_429885 168 20 . . . 10_1101-2021_02_13_429885 169 1 Therefore therefore RB 10_1101-2021_02_13_429885 169 2 , , , 10_1101-2021_02_13_429885 169 3 if if IN 10_1101-2021_02_13_429885 169 4 the the DT 10_1101-2021_02_13_429885 169 5 VAF VAF NNP 10_1101-2021_02_13_429885 169 6 quality quality NN 10_1101-2021_02_13_429885 169 7 is be VBZ 10_1101-2021_02_13_429885 169 8 very very RB 10_1101-2021_02_13_429885 169 9 low low JJ 10_1101-2021_02_13_429885 169 10 because because IN 10_1101-2021_02_13_429885 169 11 , , , 10_1101-2021_02_13_429885 169 12 e.g. e.g. RB 10_1101-2021_02_13_429885 169 13 , , , 10_1101-2021_02_13_429885 169 14 the the DT 10_1101-2021_02_13_429885 169 15 sample sample NN 10_1101-2021_02_13_429885 169 16 has have VBZ 10_1101-2021_02_13_429885 169 17 low low JJ 10_1101-2021_02_13_429885 169 18 purity purity NN 10_1101-2021_02_13_429885 169 19 or or CC 10_1101-2021_02_13_429885 169 20 coverage coverage NN 10_1101-2021_02_13_429885 169 21 , , , 10_1101-2021_02_13_429885 169 22 the the DT 10_1101-2021_02_13_429885 169 23 overall overall JJ 10_1101-2021_02_13_429885 169 24 quality quality NN 10_1101-2021_02_13_429885 169 25 of of IN 10_1101-2021_02_13_429885 169 26 the the DT 10_1101-2021_02_13_429885 169 27 check check NN 10_1101-2021_02_13_429885 169 28 decreases decrease NNS 10_1101-2021_02_13_429885 169 29 , , , 10_1101-2021_02_13_429885 169 30 making make VBG 10_1101-2021_02_13_429885 169 31 it -PRON- PRP 10_1101-2021_02_13_429885 169 32 more more RBR 10_1101-2021_02_13_429885 169 33 difficult difficult JJ 10_1101-2021_02_13_429885 169 34 to to TO 10_1101-2021_02_13_429885 169 35 completely completely RB 10_1101-2021_02_13_429885 169 36 automate automate VB 10_1101-2021_02_13_429885 169 37 quality quality NN 10_1101-2021_02_13_429885 169 38 checking checking NN 10_1101-2021_02_13_429885 169 39 . . . 10_1101-2021_02_13_429885 170 1 However however RB 10_1101-2021_02_13_429885 170 2 , , , 10_1101-2021_02_13_429885 170 3 for for IN 10_1101-2021_02_13_429885 170 4 the the DT 10_1101-2021_02_13_429885 170 5 large large JJ 10_1101-2021_02_13_429885 170 6 majority majority NN 10_1101-2021_02_13_429885 170 7 of of IN 10_1101-2021_02_13_429885 170 8 samples sample NNS 10_1101-2021_02_13_429885 170 9 , , , 10_1101-2021_02_13_429885 170 10 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 170 11 provides provide VBZ 10_1101-2021_02_13_429885 170 12 a a DT 10_1101-2021_02_13_429885 170 13 very very RB 10_1101-2021_02_13_429885 170 14 effective effective JJ 10_1101-2021_02_13_429885 170 15 and and CC 10_1101-2021_02_13_429885 170 16 fast fast JJ 10_1101-2021_02_13_429885 170 17 way way NN 10_1101-2021_02_13_429885 170 18 to to TO 10_1101-2021_02_13_429885 170 19 integrate integrate VB 10_1101-2021_02_13_429885 170 20 quality quality NN 10_1101-2021_02_13_429885 170 21 metrics metric NNS 10_1101-2021_02_13_429885 170 22 in in IN 10_1101-2021_02_13_429885 170 23 standard standard JJ 10_1101-2021_02_13_429885 170 24 pipelines pipeline NNS 10_1101-2021_02_13_429885 170 25 . . . 10_1101-2021_02_13_429885 171 1 Generating generate VBG 10_1101-2021_02_13_429885 171 2 high high JJ 10_1101-2021_02_13_429885 171 3 quality quality NN 10_1101-2021_02_13_429885 171 4 calls call NNS 10_1101-2021_02_13_429885 171 5 is be VBZ 10_1101-2021_02_13_429885 171 6 just just RB 10_1101-2021_02_13_429885 171 7 a a DT 10_1101-2021_02_13_429885 171 8 prelude prelude NN 10_1101-2021_02_13_429885 171 9 to to IN 10_1101-2021_02_13_429885 171 10 more more RBR 10_1101-2021_02_13_429885 171 11 complex complex JJ 10_1101-2021_02_13_429885 171 12 analyses analysis NNS 10_1101-2021_02_13_429885 171 13 that that WDT 10_1101-2021_02_13_429885 171 14 interpret interpret VBP 10_1101-2021_02_13_429885 171 15 cancer cancer NN 10_1101-2021_02_13_429885 171 16 genotypes genotype NNS 10_1101-2021_02_13_429885 171 17 and and CC 10_1101-2021_02_13_429885 171 18 their -PRON- PRP$ 10_1101-2021_02_13_429885 171 19 history history NN 10_1101-2021_02_13_429885 171 20 , , , 10_1101-2021_02_13_429885 171 21 with with IN 10_1101-2021_02_13_429885 171 22 and and CC 10_1101-2021_02_13_429885 171 23 without without IN 10_1101-2021_02_13_429885 171 24 therapy therapy NN 10_1101-2021_02_13_429885 171 25 ​(Ding ​(Ding NNP 10_1101-2021_02_13_429885 171 26 et et NNP 10_1101-2021_02_13_429885 171 27 al al NNP 10_1101-2021_02_13_429885 171 28 . . . 10_1101-2021_02_13_429885 172 1 2012 2012 CD 10_1101-2021_02_13_429885 172 2 ; ; : 10_1101-2021_02_13_429885 172 3 Landau Landau NNP 10_1101-2021_02_13_429885 172 4 et et NNP 10_1101-2021_02_13_429885 172 5 al al NNP 10_1101-2021_02_13_429885 172 6 . . . 10_1101-2021_02_13_429885 173 1 2013 2013 CD 10_1101-2021_02_13_429885 173 2 ; ; : 10_1101-2021_02_13_429885 173 3 Caravagna Caravagna NNP 10_1101-2021_02_13_429885 173 4 et et NNP 10_1101-2021_02_13_429885 173 5 al al NNP 10_1101-2021_02_13_429885 173 6 . . . 10_1101-2021_02_13_429885 174 1 07 07 CD 10_1101-2021_02_13_429885 174 2 12 12 CD 10_1101-2021_02_13_429885 174 3 , , , 10_1101-2021_02_13_429885 174 4 2016 2016 CD 10_1101-2021_02_13_429885 174 5 ; ; : 10_1101-2021_02_13_429885 174 6 Jamal Jamal NNP 10_1101-2021_02_13_429885 174 7 - - HYPH 10_1101-2021_02_13_429885 174 8 Hanjani Hanjani NNP 10_1101-2021_02_13_429885 174 9 et et NNP 10_1101-2021_02_13_429885 174 10 al al NNP 10_1101-2021_02_13_429885 174 11 . . . 10_1101-2021_02_13_429885 175 1 2017 2017 CD 10_1101-2021_02_13_429885 175 2 ; ; : 10_1101-2021_02_13_429885 175 3 Turajlic Turajlic NNP 10_1101-2021_02_13_429885 175 4 et et FW 10_1101-2021_02_13_429885 175 5 al al NNP 10_1101-2021_02_13_429885 175 6 . . . 10_1101-2021_02_13_429885 176 1 2018 2018 CD 10_1101-2021_02_13_429885 176 2 ; ; : 10_1101-2021_02_13_429885 176 3 Caravagna Caravagna NNP 10_1101-2021_02_13_429885 176 4 et et NNP 10_1101-2021_02_13_429885 176 5 al al NNP 10_1101-2021_02_13_429885 176 6 . . . 10_1101-2021_02_13_429885 177 1 09 09 CD 10_1101-2021_02_13_429885 177 2 2018)​. 2018)​. CD 10_1101-2021_02_13_429885 178 1 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 178 2 can can MD 10_1101-2021_02_13_429885 178 3 pass pass VB 10_1101-2021_02_13_429885 178 4 a a DT 10_1101-2021_02_13_429885 178 5 sample sample NN 10_1101-2021_02_13_429885 178 6 at at IN 10_1101-2021_02_13_429885 178 7 an an DT 10_1101-2021_02_13_429885 178 8 early early JJ 10_1101-2021_02_13_429885 178 9 stage stage NN 10_1101-2021_02_13_429885 178 10 , , , 10_1101-2021_02_13_429885 178 11 leaving leave VBG 10_1101-2021_02_13_429885 178 12 the the DT 10_1101-2021_02_13_429885 178 13 possibility possibility NN 10_1101-2021_02_13_429885 178 14 of of IN 10_1101-2021_02_13_429885 178 15 assessing assessing NN 10_1101-2021_02_13_429885 178 16 , , , 10_1101-2021_02_13_429885 178 17 at at IN 10_1101-2021_02_13_429885 178 18 a a DT 10_1101-2021_02_13_429885 178 19 later later JJ 10_1101-2021_02_13_429885 178 20 stage stage NN 10_1101-2021_02_13_429885 178 21 , , , 10_1101-2021_02_13_429885 178 22 whether whether IN 10_1101-2021_02_13_429885 178 23 the the DT 10_1101-2021_02_13_429885 178 24 quality quality NN 10_1101-2021_02_13_429885 178 25 of of IN 10_1101-2021_02_13_429885 178 26 the the DT 10_1101-2021_02_13_429885 178 27 data datum NNS 10_1101-2021_02_13_429885 178 28 is be VBZ 10_1101-2021_02_13_429885 178 29 high high JJ 10_1101-2021_02_13_429885 178 30 enough enough RB 10_1101-2021_02_13_429885 178 31 to to TO 10_1101-2021_02_13_429885 178 32 approach approach VB 10_1101-2021_02_13_429885 178 33 specific specific JJ 10_1101-2021_02_13_429885 178 34 research research NN 10_1101-2021_02_13_429885 178 35 questions question NNS 10_1101-2021_02_13_429885 178 36 . . . 10_1101-2021_02_13_429885 179 1 With with IN 10_1101-2021_02_13_429885 179 2 the the DT 10_1101-2021_02_13_429885 179 3 ongoing ongoing JJ 10_1101-2021_02_13_429885 179 4 implementation implementation NN 10_1101-2021_02_13_429885 179 5 of of IN 10_1101-2021_02_13_429885 179 6 large large JJ 10_1101-2021_02_13_429885 179 7 - - HYPH 10_1101-2021_02_13_429885 179 8 scale scale NN 10_1101-2021_02_13_429885 179 9 sequencing sequencing NN 10_1101-2021_02_13_429885 179 10 efforts effort NNS 10_1101-2021_02_13_429885 179 11 , , , 10_1101-2021_02_13_429885 179 12 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 179 13 provides provide VBZ 10_1101-2021_02_13_429885 179 14 a a DT 10_1101-2021_02_13_429885 179 15 good good JJ 10_1101-2021_02_13_429885 179 16 solution solution NN 10_1101-2021_02_13_429885 179 17 for for IN 10_1101-2021_02_13_429885 179 18 modular modular JJ 10_1101-2021_02_13_429885 179 19 pipelines pipeline NNS 10_1101-2021_02_13_429885 179 20 that that IN 10_1101-2021_02_13_429885 179 21 self self NN 10_1101-2021_02_13_429885 179 22 - - HYPH 10_1101-2021_02_13_429885 179 23 tune tune NN 10_1101-2021_02_13_429885 179 24 parameters parameter NNS 10_1101-2021_02_13_429885 179 25 , , , 10_1101-2021_02_13_429885 179 26 based base VBN 10_1101-2021_02_13_429885 179 27 on on IN 10_1101-2021_02_13_429885 179 28 quality quality NN 10_1101-2021_02_13_429885 179 29 scores score NNS 10_1101-2021_02_13_429885 179 30 . . . 10_1101-2021_02_13_429885 180 1 To to IN 10_1101-2021_02_13_429885 180 2 our -PRON- PRP$ 10_1101-2021_02_13_429885 180 3 knowledge knowledge NN 10_1101-2021_02_13_429885 180 4 , , , 10_1101-2021_02_13_429885 180 5 this this DT 10_1101-2021_02_13_429885 180 6 is be VBZ 10_1101-2021_02_13_429885 180 7 the the DT 10_1101-2021_02_13_429885 180 8 first first JJ 10_1101-2021_02_13_429885 180 9 stand stand VB 10_1101-2021_02_13_429885 180 10 - - HYPH 10_1101-2021_02_13_429885 180 11 alone alone RB 10_1101-2021_02_13_429885 180 12 tool tool NN 10_1101-2021_02_13_429885 180 13 which which WDT 10_1101-2021_02_13_429885 180 14 leverages leverage VBZ 10_1101-2021_02_13_429885 180 15 the the DT 10_1101-2021_02_13_429885 180 16 power power NN 10_1101-2021_02_13_429885 180 17 of of IN 10_1101-2021_02_13_429885 180 18 combining combine VBG 10_1101-2021_02_13_429885 180 19 the the DT 10_1101-2021_02_13_429885 180 20 most most RBS 10_1101-2021_02_13_429885 180 21 common common JJ 10_1101-2021_02_13_429885 180 22 types type NNS 10_1101-2021_02_13_429885 180 23 of of IN 10_1101-2021_02_13_429885 180 24 cancer cancer NN 10_1101-2021_02_13_429885 180 25 mutations mutation NNS 10_1101-2021_02_13_429885 180 26 - - , 10_1101-2021_02_13_429885 180 27 SNVs snv NNS 10_1101-2021_02_13_429885 180 28 and and CC 10_1101-2021_02_13_429885 180 29 CNAs cna NNS 10_1101-2021_02_13_429885 180 30 - - , 10_1101-2021_02_13_429885 180 31 to to TO 10_1101-2021_02_13_429885 180 32 automatically automatically RB 10_1101-2021_02_13_429885 180 33 control control VB 10_1101-2021_02_13_429885 180 34 the the DT 10_1101-2021_02_13_429885 180 35 quality quality NN 10_1101-2021_02_13_429885 180 36 of of IN 10_1101-2021_02_13_429885 180 37 WGS WGS NNP 10_1101-2021_02_13_429885 180 38 assays assays RB 10_1101-2021_02_13_429885 180 39 . . . 10_1101-2021_02_13_429885 181 1 We -PRON- PRP 10_1101-2021_02_13_429885 181 2 believe believe VBP 10_1101-2021_02_13_429885 181 3 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 181 4 can can MD 10_1101-2021_02_13_429885 181 5 help help VB 10_1101-2021_02_13_429885 181 6 reduce reduce VB 10_1101-2021_02_13_429885 181 7 the the DT 10_1101-2021_02_13_429885 181 8 burden burden NN 10_1101-2021_02_13_429885 181 9 of of IN 10_1101-2021_02_13_429885 181 10 manual manual JJ 10_1101-2021_02_13_429885 181 11 quality quality NN 10_1101-2021_02_13_429885 181 12 checking checking NN 10_1101-2021_02_13_429885 181 13 and and CC 10_1101-2021_02_13_429885 181 14 parameter parameter NN 10_1101-2021_02_13_429885 181 15 tuning tuning NN 10_1101-2021_02_13_429885 181 16 . . . 10_1101-2021_02_13_429885 182 1 References References NNPS 10_1101-2021_02_13_429885 182 2 Bailey Bailey NNP 10_1101-2021_02_13_429885 182 3 , , , 10_1101-2021_02_13_429885 182 4 Matthew Matthew NNP 10_1101-2021_02_13_429885 182 5 H. H. NNP 10_1101-2021_02_13_429885 182 6 , , , 10_1101-2021_02_13_429885 182 7 Collin Collin NNP 10_1101-2021_02_13_429885 182 8 Tokheim Tokheim NNP 10_1101-2021_02_13_429885 182 9 , , , 10_1101-2021_02_13_429885 182 10 Eduard Eduard NNP 10_1101-2021_02_13_429885 182 11 Porta Porta NNP 10_1101-2021_02_13_429885 182 12 - - HYPH 10_1101-2021_02_13_429885 182 13 Pardo Pardo NNP 10_1101-2021_02_13_429885 182 14 , , , 10_1101-2021_02_13_429885 182 15 Sohini Sohini NNP 10_1101-2021_02_13_429885 182 16 Sengupta Sengupta NNP 10_1101-2021_02_13_429885 182 17 , , , 10_1101-2021_02_13_429885 182 18 Denis Denis NNP 10_1101-2021_02_13_429885 182 19 Bertrand Bertrand NNP 10_1101-2021_02_13_429885 182 20 , , , 10_1101-2021_02_13_429885 182 21 Amila Amila NNP 10_1101-2021_02_13_429885 182 22 Weerasinghe Weerasinghe NNP 10_1101-2021_02_13_429885 182 23 , , , 10_1101-2021_02_13_429885 182 24 Antonio Antonio NNP 10_1101-2021_02_13_429885 182 25 Colaprico Colaprico NNP 10_1101-2021_02_13_429885 182 26 , , , 10_1101-2021_02_13_429885 182 27 et et NNP 10_1101-2021_02_13_429885 182 28 al al NNP 10_1101-2021_02_13_429885 182 29 . . . 10_1101-2021_02_13_429885 183 1 2018 2018 CD 10_1101-2021_02_13_429885 183 2 . . . 10_1101-2021_02_13_429885 184 1 “ " `` 10_1101-2021_02_13_429885 184 2 Comprehensive comprehensive JJ 10_1101-2021_02_13_429885 184 3 Characterization characterization NN 10_1101-2021_02_13_429885 184 4 of of IN 10_1101-2021_02_13_429885 184 5 Cancer Cancer NNP 10_1101-2021_02_13_429885 184 6 Driver Driver NNP 10_1101-2021_02_13_429885 184 7 Genes Genes NNP 10_1101-2021_02_13_429885 184 8 and and CC 10_1101-2021_02_13_429885 184 9 Mutations Mutations NNPS 10_1101-2021_02_13_429885 184 10 . . . 10_1101-2021_02_13_429885 184 11 ” " '' 10_1101-2021_02_13_429885 184 12 ​Cell​ ​Cell​ NNP 10_1101-2021_02_13_429885 184 13 173 173 CD 10_1101-2021_02_13_429885 184 14 ( ( -LRB- 10_1101-2021_02_13_429885 184 15 2 2 CD 10_1101-2021_02_13_429885 184 16 ) ) -RRB- 10_1101-2021_02_13_429885 184 17 : : : 10_1101-2021_02_13_429885 184 18 371–85.e18 371–85.e18 LS 10_1101-2021_02_13_429885 184 19 . . . 10_1101-2021_02_13_429885 185 1 https://doi.org/​10.1016/j.cell.2018.02.060 https://doi.org/​10.1016/j.cell.2018.02.060 NNP 10_1101-2021_02_13_429885 185 2 ​. ​. NNP 10_1101-2021_02_13_429885 186 1 Barnell Barnell NNP 10_1101-2021_02_13_429885 186 2 , , , 10_1101-2021_02_13_429885 186 3 Erica Erica NNP 10_1101-2021_02_13_429885 186 4 K. K. NNP 10_1101-2021_02_13_429885 186 5 , , , 10_1101-2021_02_13_429885 186 6 Peter Peter NNP 10_1101-2021_02_13_429885 186 7 Ronning Ronning NNP 10_1101-2021_02_13_429885 186 8 , , , 10_1101-2021_02_13_429885 186 9 Katie Katie NNP 10_1101-2021_02_13_429885 186 10 M. M. NNP 10_1101-2021_02_13_429885 186 11 Campbell Campbell NNP 10_1101-2021_02_13_429885 186 12 , , , 10_1101-2021_02_13_429885 186 13 Kilannin Kilannin NNP 10_1101-2021_02_13_429885 186 14 Krysiak Krysiak NNP 10_1101-2021_02_13_429885 186 15 , , , 10_1101-2021_02_13_429885 186 16 Benjamin Benjamin NNP 10_1101-2021_02_13_429885 186 17 J. J. NNP 10_1101-2021_02_13_429885 186 18 Ainscough Ainscough NNP 10_1101-2021_02_13_429885 186 19 , , , 10_1101-2021_02_13_429885 186 20 Lana Lana NNP 10_1101-2021_02_13_429885 186 21 M. M. NNP 10_1101-2021_02_13_429885 186 22 Sheta Sheta NNP 10_1101-2021_02_13_429885 186 23 , , , 10_1101-2021_02_13_429885 186 24 Shahil Shahil NNP 10_1101-2021_02_13_429885 186 25 P. P. NNP 10_1101-2021_02_13_429885 186 26 Pema Pema NNP 10_1101-2021_02_13_429885 186 27 , , , 10_1101-2021_02_13_429885 186 28 et et NNP 10_1101-2021_02_13_429885 186 29 al al NNP 10_1101-2021_02_13_429885 186 30 . . . 10_1101-2021_02_13_429885 187 1 2019 2019 CD 10_1101-2021_02_13_429885 187 2 . . . 10_1101-2021_02_13_429885 188 1 “ " `` 10_1101-2021_02_13_429885 188 2 Standard Standard NNP 10_1101-2021_02_13_429885 188 3 Operating Operating NNP 10_1101-2021_02_13_429885 188 4 Procedure Procedure NNP 10_1101-2021_02_13_429885 188 5 for for IN 10_1101-2021_02_13_429885 188 6 Somatic Somatic NNP 10_1101-2021_02_13_429885 188 7 Variant Variant NNP 10_1101-2021_02_13_429885 188 8 Refinement Refinement NNP 10_1101-2021_02_13_429885 188 9 of of IN 10_1101-2021_02_13_429885 188 10 Sequencing Sequencing NNP 10_1101-2021_02_13_429885 188 11 Data Data NNP 10_1101-2021_02_13_429885 188 12 with with IN 10_1101-2021_02_13_429885 188 13 Paired Paired NNP 10_1101-2021_02_13_429885 188 14 Tumor Tumor NNP 10_1101-2021_02_13_429885 188 15 and and CC 10_1101-2021_02_13_429885 188 16 Normal Normal NNP 10_1101-2021_02_13_429885 188 17 Samples Samples NNPS 10_1101-2021_02_13_429885 188 18 . . . 10_1101-2021_02_13_429885 188 19 ” " '' 10_1101-2021_02_13_429885 188 20 ​Genetics ​genetic NNS 10_1101-2021_02_13_429885 188 21 .CC .CC NFP 10_1101-2021_02_13_429885 188 22 - - HYPH 10_1101-2021_02_13_429885 188 23 BY by IN 10_1101-2021_02_13_429885 188 24 - - HYPH 10_1101-2021_02_13_429885 188 25 NC NC NNP 10_1101-2021_02_13_429885 188 26 - - HYPH 10_1101-2021_02_13_429885 188 27 ND ND NNP 10_1101-2021_02_13_429885 188 28 4.0 4.0 CD 10_1101-2021_02_13_429885 188 29 International International NNP 10_1101-2021_02_13_429885 188 30 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 188 31 under under IN 10_1101-2021_02_13_429885 188 32 a a DT 10_1101-2021_02_13_429885 188 33 ( ( -LRB- 10_1101-2021_02_13_429885 188 34 which which WDT 10_1101-2021_02_13_429885 188 35 was be VBD 10_1101-2021_02_13_429885 188 36 not not RB 10_1101-2021_02_13_429885 188 37 certified certify VBN 10_1101-2021_02_13_429885 188 38 by by IN 10_1101-2021_02_13_429885 188 39 peer peer NN 10_1101-2021_02_13_429885 188 40 review review NN 10_1101-2021_02_13_429885 188 41 ) ) -RRB- 10_1101-2021_02_13_429885 188 42 is be VBZ 10_1101-2021_02_13_429885 188 43 the the DT 10_1101-2021_02_13_429885 188 44 author author NN 10_1101-2021_02_13_429885 188 45 / / SYM 10_1101-2021_02_13_429885 188 46 funder funder NN 10_1101-2021_02_13_429885 188 47 , , , 10_1101-2021_02_13_429885 188 48 who who WP 10_1101-2021_02_13_429885 188 49 has have VBZ 10_1101-2021_02_13_429885 188 50 granted grant VBN 10_1101-2021_02_13_429885 188 51 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 188 52 a a DT 10_1101-2021_02_13_429885 188 53 license license NN 10_1101-2021_02_13_429885 188 54 to to TO 10_1101-2021_02_13_429885 188 55 display display VB 10_1101-2021_02_13_429885 188 56 the the DT 10_1101-2021_02_13_429885 188 57 preprint preprint NN 10_1101-2021_02_13_429885 188 58 in in IN 10_1101-2021_02_13_429885 188 59 perpetuity perpetuity NN 10_1101-2021_02_13_429885 188 60 . . . 10_1101-2021_02_13_429885 189 1 It -PRON- PRP 10_1101-2021_02_13_429885 189 2 is be VBZ 10_1101-2021_02_13_429885 189 3 made make VBN 10_1101-2021_02_13_429885 189 4 The the DT 10_1101-2021_02_13_429885 189 5 copyright copyright NN 10_1101-2021_02_13_429885 189 6 holder holder NN 10_1101-2021_02_13_429885 189 7 for for IN 10_1101-2021_02_13_429885 189 8 this this DT 10_1101-2021_02_13_429885 189 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 189 10 version version NN 10_1101-2021_02_13_429885 189 11 posted post VBD 10_1101-2021_02_13_429885 189 12 February February NNP 10_1101-2021_02_13_429885 189 13 13 13 CD 10_1101-2021_02_13_429885 189 14 , , , 10_1101-2021_02_13_429885 189 15 2021 2021 CD 10_1101-2021_02_13_429885 189 16 . . . 10_1101-2021_02_13_429885 189 17 ; ; : 10_1101-2021_02_13_429885 189 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 189 19 : : : 10_1101-2021_02_13_429885 189 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 189 21 preprint preprint NN 10_1101-2021_02_13_429885 189 22 https://paperpile.com/c/rqVmzs/wPG3+tqeT+Rl5f+CImd+JI4a+eR0S https://paperpile.com/c/rqvmzs/wpg3+tqet+rl5f+cimd+ji4a+er0s UH 10_1101-2021_02_13_429885 189 23 https://paperpile.com/c/rqVmzs/wPG3+tqeT+Rl5f+CImd+JI4a+eR0S https://paperpile.com/c/rqvmzs/wpg3+tqet+rl5f+cimd+ji4a+er0s UH 10_1101-2021_02_13_429885 189 24 https://paperpile.com/c/rqVmzs/wPG3+tqeT+Rl5f+CImd+JI4a+eR0S https://paperpile.com/c/rqvmzs/wpg3+tqet+rl5f+cimd+ji4a+er0s UH 10_1101-2021_02_13_429885 189 25 http://paperpile.com/b/rqVmzs/UEke http://paperpile.com/b/rqvmzs/ueke JJ 10_1101-2021_02_13_429885 189 26 http://paperpile.com/b/rqVmzs/UEke http://paperpile.com/b/rqvmzs/ueke JJ 10_1101-2021_02_13_429885 189 27 http://paperpile.com/b/rqVmzs/UEke http://paperpile.com/b/rqvmzs/ueke JJ 10_1101-2021_02_13_429885 189 28 http://paperpile.com/b/rqVmzs/UEke http://paperpile.com/b/rqvmzs/ueke JJ 10_1101-2021_02_13_429885 189 29 http://paperpile.com/b/rqVmzs/UEke http://paperpile.com/b/rqvmzs/ueke JJ 10_1101-2021_02_13_429885 189 30 http://paperpile.com/b/rqVmzs/UEke http://paperpile.com/b/rqVmzs/UEke NNS 10_1101-2021_02_13_429885 189 31 http://dx.doi.org/10.1016/j.cell.2018.02.060 http://dx.doi.org/10.1016/j.cell.2018.02.060 VBZ 10_1101-2021_02_13_429885 189 32 http://paperpile.com/b/rqVmzs/UEke http://paperpile.com/b/rqVmzs/UEke NNS 10_1101-2021_02_13_429885 189 33 http://paperpile.com/b/rqVmzs/j5j7 http://paperpile.com/b/rqVmzs/j5j7 NNS 10_1101-2021_02_13_429885 189 34 http://paperpile.com/b/rqVmzs/j5j7 http://paperpile.com/b/rqvmzs/j5j7 CD 10_1101-2021_02_13_429885 189 35 http://paperpile.com/b/rqVmzs/j5j7 http://paperpile.com/b/rqVmzs/j5j7 NNS 10_1101-2021_02_13_429885 189 36 http://paperpile.com/b/rqVmzs/j5j7 http://paperpile.com/b/rqVmzs/j5j7 NNS 10_1101-2021_02_13_429885 189 37 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 NNP 10_1101-2021_02_13_429885 189 38 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 189 39 Househam Househam NNP 10_1101-2021_02_13_429885 189 40 et et FW 10_1101-2021_02_13_429885 189 41 al al NNP 10_1101-2021_02_13_429885 189 42 . . . 10_1101-2021_02_13_429885 190 1 A a DT 10_1101-2021_02_13_429885 190 2 fully fully RB 10_1101-2021_02_13_429885 190 3 automated automate VBN 10_1101-2021_02_13_429885 190 4 approach approach NN 10_1101-2021_02_13_429885 190 5 for for IN 10_1101-2021_02_13_429885 190 6 quality quality NN 10_1101-2021_02_13_429885 190 7 control control NN 10_1101-2021_02_13_429885 190 8 of of IN 10_1101-2021_02_13_429885 190 9 cancer cancer NN 10_1101-2021_02_13_429885 190 10 mutations mutation NNS 10_1101-2021_02_13_429885 190 11 in in IN 10_1101-2021_02_13_429885 190 12 the the DT 10_1101-2021_02_13_429885 190 13 era era NN 10_1101-2021_02_13_429885 190 14 of of IN 10_1101-2021_02_13_429885 190 15 high high JJ 10_1101-2021_02_13_429885 190 16 - - HYPH 10_1101-2021_02_13_429885 190 17 resolution resolution NN 10_1101-2021_02_13_429885 190 18 whole whole JJ 10_1101-2021_02_13_429885 190 19 genome genome JJ 10_1101-2021_02_13_429885 190 20 sequencing sequencing NN 10_1101-2021_02_13_429885 190 21 . . . 10_1101-2021_02_13_429885 191 1 in in IN 10_1101-2021_02_13_429885 191 2 Medicine Medicine NNP 10_1101-2021_02_13_429885 191 3 : : : 10_1101-2021_02_13_429885 191 4 Official Official NNP 10_1101-2021_02_13_429885 191 5 Journal Journal NNP 10_1101-2021_02_13_429885 191 6 of of IN 10_1101-2021_02_13_429885 191 7 the the DT 10_1101-2021_02_13_429885 191 8 American American NNP 10_1101-2021_02_13_429885 191 9 College College NNP 10_1101-2021_02_13_429885 191 10 of of IN 10_1101-2021_02_13_429885 191 11 Medical Medical NNP 10_1101-2021_02_13_429885 191 12 Genetics​ Genetics​ NNP 10_1101-2021_02_13_429885 191 13 21 21 CD 10_1101-2021_02_13_429885 191 14 ( ( -LRB- 10_1101-2021_02_13_429885 191 15 4 4 CD 10_1101-2021_02_13_429885 191 16 ) ) -RRB- 10_1101-2021_02_13_429885 191 17 : : : 10_1101-2021_02_13_429885 191 18 972–81 972–81 CD 10_1101-2021_02_13_429885 191 19 . . . 10_1101-2021_02_13_429885 192 1 https://doi.org/​10.1038/s41436-018-0278-z​. https://doi.org/​10.1038/s41436-018-0278-z​. LS 10_1101-2021_02_13_429885 193 1 Benjamin Benjamin NNP 10_1101-2021_02_13_429885 193 2 , , , 10_1101-2021_02_13_429885 193 3 David David NNP 10_1101-2021_02_13_429885 193 4 , , , 10_1101-2021_02_13_429885 193 5 Takuto Takuto NNP 10_1101-2021_02_13_429885 193 6 Sato Sato NNP 10_1101-2021_02_13_429885 193 7 , , , 10_1101-2021_02_13_429885 193 8 Kristian kristian JJ 10_1101-2021_02_13_429885 193 9 Cibulskis Cibulskis NNP 10_1101-2021_02_13_429885 193 10 , , , 10_1101-2021_02_13_429885 193 11 Gad Gad NNP 10_1101-2021_02_13_429885 193 12 Getz Getz NNP 10_1101-2021_02_13_429885 193 13 , , , 10_1101-2021_02_13_429885 193 14 Chip Chip NNP 10_1101-2021_02_13_429885 193 15 Stewart Stewart NNP 10_1101-2021_02_13_429885 193 16 , , , 10_1101-2021_02_13_429885 193 17 and and CC 10_1101-2021_02_13_429885 193 18 Lee Lee NNP 10_1101-2021_02_13_429885 193 19 Lichtenstein Lichtenstein NNP 10_1101-2021_02_13_429885 193 20 . . . 10_1101-2021_02_13_429885 194 1 2019 2019 CD 10_1101-2021_02_13_429885 194 2 . . . 10_1101-2021_02_13_429885 195 1 “ " `` 10_1101-2021_02_13_429885 195 2 Calling call VBG 10_1101-2021_02_13_429885 195 3 Somatic somatic JJ 10_1101-2021_02_13_429885 195 4 SNVs SNVs NNPS 10_1101-2021_02_13_429885 195 5 and and CC 10_1101-2021_02_13_429885 195 6 Indels Indels NNPS 10_1101-2021_02_13_429885 195 7 with with IN 10_1101-2021_02_13_429885 195 8 Mutect2 Mutect2 NNP 10_1101-2021_02_13_429885 195 9 . . . 10_1101-2021_02_13_429885 195 10 ” " '' 10_1101-2021_02_13_429885 195 11 ​bioRxiv​ ​bioRxiv​ NNP 10_1101-2021_02_13_429885 195 12 , , , 10_1101-2021_02_13_429885 195 13 December December NNP 10_1101-2021_02_13_429885 195 14 , , , 10_1101-2021_02_13_429885 195 15 861054 861054 CD 10_1101-2021_02_13_429885 195 16 . . . 10_1101-2021_02_13_429885 195 17 https://doi.org/​10.1101/861054 https://doi.org/​10.1101/861054 NNP 10_1101-2021_02_13_429885 195 18 ​. ​. NNP 10_1101-2021_02_13_429885 196 1 Boeva Boeva NNP 10_1101-2021_02_13_429885 196 2 , , , 10_1101-2021_02_13_429885 196 3 Valentina Valentina NNP 10_1101-2021_02_13_429885 196 4 , , , 10_1101-2021_02_13_429885 196 5 Andrei Andrei NNP 10_1101-2021_02_13_429885 196 6 Zinovyev Zinovyev NNP 10_1101-2021_02_13_429885 196 7 , , , 10_1101-2021_02_13_429885 196 8 Kevin Kevin NNP 10_1101-2021_02_13_429885 196 9 Bleakley Bleakley NNP 10_1101-2021_02_13_429885 196 10 , , , 10_1101-2021_02_13_429885 196 11 Jean Jean NNP 10_1101-2021_02_13_429885 196 12 - - HYPH 10_1101-2021_02_13_429885 196 13 Philippe Philippe NNP 10_1101-2021_02_13_429885 196 14 Vert Vert NNP 10_1101-2021_02_13_429885 196 15 , , , 10_1101-2021_02_13_429885 196 16 Isabelle Isabelle NNP 10_1101-2021_02_13_429885 196 17 Janoueix Janoueix NNP 10_1101-2021_02_13_429885 196 18 - - HYPH 10_1101-2021_02_13_429885 196 19 Lerosey Lerosey NNP 10_1101-2021_02_13_429885 196 20 , , , 10_1101-2021_02_13_429885 196 21 Olivier Olivier NNP 10_1101-2021_02_13_429885 196 22 Delattre Delattre NNP 10_1101-2021_02_13_429885 196 23 , , , 10_1101-2021_02_13_429885 196 24 and and CC 10_1101-2021_02_13_429885 196 25 Emmanuel Emmanuel NNP 10_1101-2021_02_13_429885 196 26 Barillot Barillot NNP 10_1101-2021_02_13_429885 196 27 . . . 10_1101-2021_02_13_429885 197 1 2011 2011 CD 10_1101-2021_02_13_429885 197 2 . . . 10_1101-2021_02_13_429885 198 1 “ " `` 10_1101-2021_02_13_429885 198 2 Control Control NNP 10_1101-2021_02_13_429885 198 3 - - HYPH 10_1101-2021_02_13_429885 198 4 Free Free NNP 10_1101-2021_02_13_429885 198 5 Calling Calling NNP 10_1101-2021_02_13_429885 198 6 of of IN 10_1101-2021_02_13_429885 198 7 Copy Copy NNP 10_1101-2021_02_13_429885 198 8 Number Number NNP 10_1101-2021_02_13_429885 198 9 Alterations Alterations NNPS 10_1101-2021_02_13_429885 198 10 in in IN 10_1101-2021_02_13_429885 198 11 Deep Deep NNP 10_1101-2021_02_13_429885 198 12 - - HYPH 10_1101-2021_02_13_429885 198 13 Sequencing Sequencing NNP 10_1101-2021_02_13_429885 198 14 Data datum NNS 10_1101-2021_02_13_429885 198 15 Using use VBG 10_1101-2021_02_13_429885 198 16 GC GC NNP 10_1101-2021_02_13_429885 198 17 - - HYPH 10_1101-2021_02_13_429885 198 18 Content Content NNP 10_1101-2021_02_13_429885 198 19 Normalization Normalization NNP 10_1101-2021_02_13_429885 198 20 . . . 10_1101-2021_02_13_429885 198 21 ” " '' 10_1101-2021_02_13_429885 198 22 Bioinformatics Bioinformatics NNP 10_1101-2021_02_13_429885 198 23 ​ ​ NNP 10_1101-2021_02_13_429885 198 24 27 27 CD 10_1101-2021_02_13_429885 198 25 ( ( -LRB- 10_1101-2021_02_13_429885 198 26 2 2 CD 10_1101-2021_02_13_429885 198 27 ) ) -RRB- 10_1101-2021_02_13_429885 198 28 : : : 10_1101-2021_02_13_429885 198 29 268–69 268–69 CD 10_1101-2021_02_13_429885 198 30 . . . 10_1101-2021_02_13_429885 199 1 https://doi.org/​10.1093/bioinformatics/btq635 https://doi.org/​10.1093/bioinformatics/btq635 PRP 10_1101-2021_02_13_429885 199 2 ​. ​. NNP 10_1101-2021_02_13_429885 200 1 Campbell Campbell NNP 10_1101-2021_02_13_429885 200 2 , , , 10_1101-2021_02_13_429885 200 3 Peter Peter NNP 10_1101-2021_02_13_429885 200 4 J. J. NNP 10_1101-2021_02_13_429885 200 5 , , , 10_1101-2021_02_13_429885 200 6 Gad Gad NNP 10_1101-2021_02_13_429885 200 7 Getz Getz NNP 10_1101-2021_02_13_429885 200 8 , , , 10_1101-2021_02_13_429885 200 9 Jan Jan NNP 10_1101-2021_02_13_429885 200 10 O. O. NNP 10_1101-2021_02_13_429885 200 11 Korbel Korbel NNP 10_1101-2021_02_13_429885 200 12 , , , 10_1101-2021_02_13_429885 200 13 Joshua Joshua NNP 10_1101-2021_02_13_429885 200 14 M. M. NNP 10_1101-2021_02_13_429885 200 15 Stuart Stuart NNP 10_1101-2021_02_13_429885 200 16 , , , 10_1101-2021_02_13_429885 200 17 Jennifer Jennifer NNP 10_1101-2021_02_13_429885 200 18 L. L. NNP 10_1101-2021_02_13_429885 200 19 Jennings Jennings NNP 10_1101-2021_02_13_429885 200 20 , , , 10_1101-2021_02_13_429885 200 21 Lincoln Lincoln NNP 10_1101-2021_02_13_429885 200 22 D. D. NNP 10_1101-2021_02_13_429885 200 23 Stein Stein NNP 10_1101-2021_02_13_429885 200 24 , , , 10_1101-2021_02_13_429885 200 25 Marc Marc NNP 10_1101-2021_02_13_429885 200 26 D. D. NNP 10_1101-2021_02_13_429885 200 27 Perry Perry NNP 10_1101-2021_02_13_429885 200 28 , , , 10_1101-2021_02_13_429885 200 29 et et NNP 10_1101-2021_02_13_429885 200 30 al al NNP 10_1101-2021_02_13_429885 200 31 . . . 10_1101-2021_02_13_429885 201 1 2020 2020 CD 10_1101-2021_02_13_429885 201 2 . . . 10_1101-2021_02_13_429885 202 1 “ " `` 10_1101-2021_02_13_429885 202 2 Pan Pan NNP 10_1101-2021_02_13_429885 202 3 - - NNP 10_1101-2021_02_13_429885 202 4 Cancer Cancer NNP 10_1101-2021_02_13_429885 202 5 Analysis Analysis NNP 10_1101-2021_02_13_429885 202 6 of of IN 10_1101-2021_02_13_429885 202 7 Whole Whole NNP 10_1101-2021_02_13_429885 202 8 Genomes Genomes NNP 10_1101-2021_02_13_429885 202 9 . . . 10_1101-2021_02_13_429885 202 10 ” " '' 10_1101-2021_02_13_429885 202 11 ​Nature​ ​Nature​ NNP 10_1101-2021_02_13_429885 202 12 578 578 CD 10_1101-2021_02_13_429885 202 13 ( ( -LRB- 10_1101-2021_02_13_429885 202 14 7793 7793 CD 10_1101-2021_02_13_429885 202 15 ) ) -RRB- 10_1101-2021_02_13_429885 202 16 : : : 10_1101-2021_02_13_429885 202 17 82–93 82–93 CD 10_1101-2021_02_13_429885 202 18 . . . 10_1101-2021_02_13_429885 203 1 https://doi.org/​10.1038/s41586-020-1969-6 https://doi.org/​10.1038/s41586-020-1969-6 ADD 10_1101-2021_02_13_429885 203 2 ​. ​. CD 10_1101-2021_02_13_429885 204 1 Caravagna Caravagna NNP 10_1101-2021_02_13_429885 204 2 , , , 10_1101-2021_02_13_429885 204 3 Giulio Giulio NNP 10_1101-2021_02_13_429885 204 4 , , , 10_1101-2021_02_13_429885 204 5 Ylenia Ylenia NNP 10_1101-2021_02_13_429885 204 6 Giarratano Giarratano NNP 10_1101-2021_02_13_429885 204 7 , , , 10_1101-2021_02_13_429885 204 8 Daniele Daniele NNP 10_1101-2021_02_13_429885 204 9 Ramazzotti Ramazzotti NNP 10_1101-2021_02_13_429885 204 10 , , , 10_1101-2021_02_13_429885 204 11 Ian Ian NNP 10_1101-2021_02_13_429885 204 12 Tomlinson Tomlinson NNP 10_1101-2021_02_13_429885 204 13 , , , 10_1101-2021_02_13_429885 204 14 Trevor Trevor NNP 10_1101-2021_02_13_429885 204 15 A. a. NN 10_1101-2021_02_13_429885 204 16 Graham Graham NNP 10_1101-2021_02_13_429885 204 17 , , , 10_1101-2021_02_13_429885 204 18 Guido Guido NNP 10_1101-2021_02_13_429885 204 19 Sanguinetti Sanguinetti NNP 10_1101-2021_02_13_429885 204 20 , , , 10_1101-2021_02_13_429885 204 21 and and CC 10_1101-2021_02_13_429885 204 22 Andrea Andrea NNP 10_1101-2021_02_13_429885 204 23 Sottoriva Sottoriva NNP 10_1101-2021_02_13_429885 204 24 . . . 10_1101-2021_02_13_429885 205 1 09 09 CD 10_1101-2021_02_13_429885 205 2 2018 2018 CD 10_1101-2021_02_13_429885 205 3 . . . 10_1101-2021_02_13_429885 206 1 “ " `` 10_1101-2021_02_13_429885 206 2 Detecting detect VBG 10_1101-2021_02_13_429885 206 3 Repeated repeat VBN 10_1101-2021_02_13_429885 206 4 Cancer Cancer NNP 10_1101-2021_02_13_429885 206 5 Evolution Evolution NNP 10_1101-2021_02_13_429885 206 6 from from IN 10_1101-2021_02_13_429885 206 7 Multi Multi NNP 10_1101-2021_02_13_429885 206 8 - - NNP 10_1101-2021_02_13_429885 206 9 Region Region NNP 10_1101-2021_02_13_429885 206 10 Tumor Tumor NNP 10_1101-2021_02_13_429885 206 11 Sequencing Sequencing NNP 10_1101-2021_02_13_429885 206 12 Data Data NNPS 10_1101-2021_02_13_429885 206 13 . . . 10_1101-2021_02_13_429885 206 14 ” " '' 10_1101-2021_02_13_429885 206 15 ​Nature ​nature JJ 10_1101-2021_02_13_429885 206 16 Methods​ Methods​ NNP 10_1101-2021_02_13_429885 206 17 15 15 CD 10_1101-2021_02_13_429885 206 18 ( ( -LRB- 10_1101-2021_02_13_429885 206 19 9 9 CD 10_1101-2021_02_13_429885 206 20 ) ) -RRB- 10_1101-2021_02_13_429885 206 21 : : : 10_1101-2021_02_13_429885 206 22 707–14 707–14 CD 10_1101-2021_02_13_429885 206 23 . . . 10_1101-2021_02_13_429885 207 1 https://doi.org/​10.1038/s41592-018-0108-x​. https://doi.org/​10.1038/s41592-018-0108-x​. ADD 10_1101-2021_02_13_429885 208 1 Caravagna Caravagna NNP 10_1101-2021_02_13_429885 208 2 , , , 10_1101-2021_02_13_429885 208 3 Giulio Giulio NNP 10_1101-2021_02_13_429885 208 4 , , , 10_1101-2021_02_13_429885 208 5 Alex Alex NNP 10_1101-2021_02_13_429885 208 6 Graudenzi Graudenzi NNP 10_1101-2021_02_13_429885 208 7 , , , 10_1101-2021_02_13_429885 208 8 Daniele Daniele NNP 10_1101-2021_02_13_429885 208 9 Ramazzotti Ramazzotti NNP 10_1101-2021_02_13_429885 208 10 , , , 10_1101-2021_02_13_429885 208 11 Rebeca Rebeca NNP 10_1101-2021_02_13_429885 208 12 Sanz Sanz NNP 10_1101-2021_02_13_429885 208 13 - - HYPH 10_1101-2021_02_13_429885 208 14 Pamplona Pamplona NNP 10_1101-2021_02_13_429885 208 15 , , , 10_1101-2021_02_13_429885 208 16 Luca Luca NNP 10_1101-2021_02_13_429885 208 17 De De NNP 10_1101-2021_02_13_429885 208 18 Sano Sano NNP 10_1101-2021_02_13_429885 208 19 , , , 10_1101-2021_02_13_429885 208 20 Giancarlo Giancarlo NNP 10_1101-2021_02_13_429885 208 21 Mauri Mauri NNP 10_1101-2021_02_13_429885 208 22 , , , 10_1101-2021_02_13_429885 208 23 Victor Victor NNP 10_1101-2021_02_13_429885 208 24 Moreno Moreno NNP 10_1101-2021_02_13_429885 208 25 , , , 10_1101-2021_02_13_429885 208 26 Marco Marco NNP 10_1101-2021_02_13_429885 208 27 Antoniotti Antoniotti NNP 10_1101-2021_02_13_429885 208 28 , , , 10_1101-2021_02_13_429885 208 29 and and CC 10_1101-2021_02_13_429885 208 30 Bud Bud NNP 10_1101-2021_02_13_429885 208 31 Mishra Mishra NNP 10_1101-2021_02_13_429885 208 32 . . . 10_1101-2021_02_13_429885 209 1 07 07 CD 10_1101-2021_02_13_429885 209 2 12 12 CD 10_1101-2021_02_13_429885 209 3 , , , 10_1101-2021_02_13_429885 209 4 2016 2016 CD 10_1101-2021_02_13_429885 209 5 . . . 10_1101-2021_02_13_429885 210 1 “ " `` 10_1101-2021_02_13_429885 210 2 Algorithmic Algorithmic NNP 10_1101-2021_02_13_429885 210 3 Methods method NNS 10_1101-2021_02_13_429885 210 4 to to TO 10_1101-2021_02_13_429885 210 5 Infer infer VB 10_1101-2021_02_13_429885 210 6 the the DT 10_1101-2021_02_13_429885 210 7 Evolutionary Evolutionary NNP 10_1101-2021_02_13_429885 210 8 Trajectories Trajectories NNPS 10_1101-2021_02_13_429885 210 9 in in IN 10_1101-2021_02_13_429885 210 10 Cancer Cancer NNP 10_1101-2021_02_13_429885 210 11 Progression Progression NNP 10_1101-2021_02_13_429885 210 12 . . . 10_1101-2021_02_13_429885 210 13 ” " '' 10_1101-2021_02_13_429885 210 14 Proceedings proceeding NNS 10_1101-2021_02_13_429885 210 15 of of IN 10_1101-2021_02_13_429885 210 16 the the DT 10_1101-2021_02_13_429885 210 17 National National NNP 10_1101-2021_02_13_429885 210 18 Academy Academy NNP 10_1101-2021_02_13_429885 210 19 of of IN 10_1101-2021_02_13_429885 210 20 Sciences Sciences NNPS 10_1101-2021_02_13_429885 210 21 of of IN 10_1101-2021_02_13_429885 210 22 the the DT 10_1101-2021_02_13_429885 210 23 United United NNP 10_1101-2021_02_13_429885 210 24 States States NNP 10_1101-2021_02_13_429885 210 25 of of IN 10_1101-2021_02_13_429885 210 26 America​ America​ NNP 10_1101-2021_02_13_429885 210 27 113 113 CD 10_1101-2021_02_13_429885 210 28 ( ( -LRB- 10_1101-2021_02_13_429885 210 29 28 28 CD 10_1101-2021_02_13_429885 210 30 ) ) -RRB- 10_1101-2021_02_13_429885 210 31 : : : 10_1101-2021_02_13_429885 210 32 E4025–34 e4025–34 ADD 10_1101-2021_02_13_429885 210 33 . . . 10_1101-2021_02_13_429885 211 1 https://doi.org/​10.1073/pnas.1520213113 https://doi.org/​10.1073/pnas.1520213113 RB 10_1101-2021_02_13_429885 211 2 ​. ​. NNP 10_1101-2021_02_13_429885 212 1 Caravagna Caravagna NNP 10_1101-2021_02_13_429885 212 2 , , , 10_1101-2021_02_13_429885 212 3 Giulio Giulio NNP 10_1101-2021_02_13_429885 212 4 , , , 10_1101-2021_02_13_429885 212 5 Timon Timon NNP 10_1101-2021_02_13_429885 212 6 Heide Heide NNP 10_1101-2021_02_13_429885 212 7 , , , 10_1101-2021_02_13_429885 212 8 Marc Marc NNP 10_1101-2021_02_13_429885 212 9 J. J. NNP 10_1101-2021_02_13_429885 212 10 Williams Williams NNP 10_1101-2021_02_13_429885 212 11 , , , 10_1101-2021_02_13_429885 212 12 Luis Luis NNP 10_1101-2021_02_13_429885 212 13 Zapata Zapata NNP 10_1101-2021_02_13_429885 212 14 , , , 10_1101-2021_02_13_429885 212 15 Daniel Daniel NNP 10_1101-2021_02_13_429885 212 16 Nichol Nichol NNP 10_1101-2021_02_13_429885 212 17 , , , 10_1101-2021_02_13_429885 212 18 Ketevan Ketevan NNP 10_1101-2021_02_13_429885 212 19 Chkhaidze Chkhaidze NNP 10_1101-2021_02_13_429885 212 20 , , , 10_1101-2021_02_13_429885 212 21 William William NNP 10_1101-2021_02_13_429885 212 22 Cross Cross NNP 10_1101-2021_02_13_429885 212 23 , , , 10_1101-2021_02_13_429885 212 24 et et NNP 10_1101-2021_02_13_429885 212 25 al al NNP 10_1101-2021_02_13_429885 212 26 . . . 10_1101-2021_02_13_429885 213 1 2020 2020 CD 10_1101-2021_02_13_429885 213 2 . . . 10_1101-2021_02_13_429885 214 1 “ " `` 10_1101-2021_02_13_429885 214 2 Subclonal subclonal JJ 10_1101-2021_02_13_429885 214 3 Reconstruction reconstruction NN 10_1101-2021_02_13_429885 214 4 of of IN 10_1101-2021_02_13_429885 214 5 Tumors Tumors NNPS 10_1101-2021_02_13_429885 214 6 by by IN 10_1101-2021_02_13_429885 214 7 Using use VBG 10_1101-2021_02_13_429885 214 8 Machine Machine NNP 10_1101-2021_02_13_429885 214 9 Learning Learning NNP 10_1101-2021_02_13_429885 214 10 and and CC 10_1101-2021_02_13_429885 214 11 Population Population NNP 10_1101-2021_02_13_429885 214 12 Genetics Genetics NNPS 10_1101-2021_02_13_429885 214 13 . . . 10_1101-2021_02_13_429885 214 14 ” " '' 10_1101-2021_02_13_429885 214 15 ​Nature ​nature JJ 10_1101-2021_02_13_429885 214 16 Genetics​ genetics​ NN 10_1101-2021_02_13_429885 214 17 52 52 CD 10_1101-2021_02_13_429885 214 18 ( ( -LRB- 10_1101-2021_02_13_429885 214 19 9 9 CD 10_1101-2021_02_13_429885 214 20 ) ) -RRB- 10_1101-2021_02_13_429885 214 21 : : : 10_1101-2021_02_13_429885 214 22 898–907 898–907 CD 10_1101-2021_02_13_429885 214 23 . . . 10_1101-2021_02_13_429885 215 1 https://doi.org/​10.1038/s41588-020-0675-5 https://doi.org/​10.1038/s41588-020-0675-5 CD 10_1101-2021_02_13_429885 215 2 ​. ​. NNP 10_1101-2021_02_13_429885 216 1 Cmero Cmero NNP 10_1101-2021_02_13_429885 216 2 , , , 10_1101-2021_02_13_429885 216 3 Marek Marek NNP 10_1101-2021_02_13_429885 216 4 , , , 10_1101-2021_02_13_429885 216 5 Ke Ke NNP 10_1101-2021_02_13_429885 216 6 Yuan Yuan NNP 10_1101-2021_02_13_429885 216 7 , , , 10_1101-2021_02_13_429885 216 8 Cheng Cheng NNP 10_1101-2021_02_13_429885 216 9 Soon Soon NNP 10_1101-2021_02_13_429885 216 10 Ong Ong NNP 10_1101-2021_02_13_429885 216 11 , , , 10_1101-2021_02_13_429885 216 12 Jan Jan NNP 10_1101-2021_02_13_429885 216 13 Schröder Schröder NNP 10_1101-2021_02_13_429885 216 14 , , , 10_1101-2021_02_13_429885 216 15 Niall Niall NNP 10_1101-2021_02_13_429885 216 16 M. M. NNP 10_1101-2021_02_13_429885 216 17 Corcoran Corcoran NNP 10_1101-2021_02_13_429885 216 18 , , , 10_1101-2021_02_13_429885 216 19 Tony Tony NNP 10_1101-2021_02_13_429885 216 20 Papenfuss Papenfuss NNP 10_1101-2021_02_13_429885 216 21 , , , 10_1101-2021_02_13_429885 216 22 Christopher Christopher NNP 10_1101-2021_02_13_429885 216 23 M. M. NNP 10_1101-2021_02_13_429885 216 24 Hovens Hovens NNP 10_1101-2021_02_13_429885 216 25 , , , 10_1101-2021_02_13_429885 216 26 Florian Florian NNP 10_1101-2021_02_13_429885 216 27 Markowetz Markowetz NNP 10_1101-2021_02_13_429885 216 28 , , , 10_1101-2021_02_13_429885 216 29 and and CC 10_1101-2021_02_13_429885 216 30 Geoff Geoff NNP 10_1101-2021_02_13_429885 216 31 Macintyre Macintyre NNP 10_1101-2021_02_13_429885 216 32 . . . 10_1101-2021_02_13_429885 217 1 2020 2020 CD 10_1101-2021_02_13_429885 217 2 . . . 10_1101-2021_02_13_429885 218 1 “ " `` 10_1101-2021_02_13_429885 218 2 Inferring infer VBG 10_1101-2021_02_13_429885 218 3 Structural Structural NNP 10_1101-2021_02_13_429885 218 4 Variant Variant NNP 10_1101-2021_02_13_429885 218 5 Cancer Cancer NNP 10_1101-2021_02_13_429885 218 6 Cell Cell NNP 10_1101-2021_02_13_429885 218 7 Fraction Fraction NNP 10_1101-2021_02_13_429885 218 8 . . . 10_1101-2021_02_13_429885 218 9 ” " '' 10_1101-2021_02_13_429885 218 10 ​Nature ​nature JJ 10_1101-2021_02_13_429885 218 11 Communications​ communications​ NN 10_1101-2021_02_13_429885 218 12 11 11 CD 10_1101-2021_02_13_429885 218 13 ( ( -LRB- 10_1101-2021_02_13_429885 218 14 1 1 CD 10_1101-2021_02_13_429885 218 15 ) ) -RRB- 10_1101-2021_02_13_429885 218 16 : : : 10_1101-2021_02_13_429885 218 17 730 730 CD 10_1101-2021_02_13_429885 218 18 . . . 10_1101-2021_02_13_429885 218 19 https://doi.org/​10.1038/s41467-020-14351-8 https://doi.org/​10.1038/s41467-020-14351-8 ADD 10_1101-2021_02_13_429885 218 20 ​. ​. NN 10_1101-2021_02_13_429885 219 1 Cortés Cortés NNP 10_1101-2021_02_13_429885 219 2 - - HYPH 10_1101-2021_02_13_429885 219 3 Ciriano Ciriano NNP 10_1101-2021_02_13_429885 219 4 , , , 10_1101-2021_02_13_429885 219 5 Isidro Isidro NNP 10_1101-2021_02_13_429885 219 6 , , , 10_1101-2021_02_13_429885 219 7 Jake Jake NNP 10_1101-2021_02_13_429885 219 8 June June NNP 10_1101-2021_02_13_429885 219 9 - - HYPH 10_1101-2021_02_13_429885 219 10 Koo Koo NNP 10_1101-2021_02_13_429885 219 11 Lee Lee NNP 10_1101-2021_02_13_429885 219 12 , , , 10_1101-2021_02_13_429885 219 13 Ruibin Ruibin NNP 10_1101-2021_02_13_429885 219 14 Xi Xi NNP 10_1101-2021_02_13_429885 219 15 , , , 10_1101-2021_02_13_429885 219 16 Dhawal Dhawal NNP 10_1101-2021_02_13_429885 219 17 Jain Jain NNP 10_1101-2021_02_13_429885 219 18 , , , 10_1101-2021_02_13_429885 219 19 Youngsook Youngsook NNP 10_1101-2021_02_13_429885 219 20 L. L. NNP 10_1101-2021_02_13_429885 219 21 Jung Jung NNP 10_1101-2021_02_13_429885 219 22 , , , 10_1101-2021_02_13_429885 219 23 Lixing Lixing NNP 10_1101-2021_02_13_429885 219 24 Yang Yang NNP 10_1101-2021_02_13_429885 219 25 , , , 10_1101-2021_02_13_429885 219 26 Dmitry Dmitry NNP 10_1101-2021_02_13_429885 219 27 Gordenin Gordenin NNP 10_1101-2021_02_13_429885 219 28 , , , 10_1101-2021_02_13_429885 219 29 et et NNP 10_1101-2021_02_13_429885 219 30 al al NNP 10_1101-2021_02_13_429885 219 31 . . . 10_1101-2021_02_13_429885 220 1 2020 2020 CD 10_1101-2021_02_13_429885 220 2 . . . 10_1101-2021_02_13_429885 221 1 “ " `` 10_1101-2021_02_13_429885 221 2 Comprehensive comprehensive JJ 10_1101-2021_02_13_429885 221 3 Analysis Analysis NNP 10_1101-2021_02_13_429885 221 4 of of IN 10_1101-2021_02_13_429885 221 5 Chromothripsis Chromothripsis NNP 10_1101-2021_02_13_429885 221 6 in in IN 10_1101-2021_02_13_429885 221 7 2,658 2,658 CD 10_1101-2021_02_13_429885 221 8 Human Human NNP 10_1101-2021_02_13_429885 221 9 Cancers Cancers NNPS 10_1101-2021_02_13_429885 221 10 Using use VBG 10_1101-2021_02_13_429885 221 11 Whole whole JJ 10_1101-2021_02_13_429885 221 12 - - HYPH 10_1101-2021_02_13_429885 221 13 Genome genome NN 10_1101-2021_02_13_429885 221 14 Sequencing sequencing NN 10_1101-2021_02_13_429885 221 15 . . . 10_1101-2021_02_13_429885 221 16 ” " '' 10_1101-2021_02_13_429885 221 17 ​Nature ​nature JJ 10_1101-2021_02_13_429885 221 18 Genetics​ genetics​ NN 10_1101-2021_02_13_429885 221 19 52 52 CD 10_1101-2021_02_13_429885 221 20 ( ( -LRB- 10_1101-2021_02_13_429885 221 21 3 3 CD 10_1101-2021_02_13_429885 221 22 ) ) -RRB- 10_1101-2021_02_13_429885 221 23 : : : 10_1101-2021_02_13_429885 221 24 331–41 331–41 CD 10_1101-2021_02_13_429885 221 25 . . . 10_1101-2021_02_13_429885 222 1 https://doi.org/​10.1038/s41588-019-0576-7 https://doi.org/​10.1038/s41588-019-0576-7 ADD 10_1101-2021_02_13_429885 222 2 ​. ​. NNP 10_1101-2021_02_13_429885 223 1 Cross Cross NNP 10_1101-2021_02_13_429885 223 2 , , , 10_1101-2021_02_13_429885 223 3 William William NNP 10_1101-2021_02_13_429885 223 4 , , , 10_1101-2021_02_13_429885 223 5 Michal Michal NNP 10_1101-2021_02_13_429885 223 6 Kovac Kovac NNP 10_1101-2021_02_13_429885 223 7 , , , 10_1101-2021_02_13_429885 223 8 Ville Ville NNP 10_1101-2021_02_13_429885 223 9 Mustonen Mustonen NNP 10_1101-2021_02_13_429885 223 10 , , , 10_1101-2021_02_13_429885 223 11 Daniel Daniel NNP 10_1101-2021_02_13_429885 223 12 Temko Temko NNP 10_1101-2021_02_13_429885 223 13 , , , 10_1101-2021_02_13_429885 223 14 Hayley Hayley NNP 10_1101-2021_02_13_429885 223 15 Davis Davis NNP 10_1101-2021_02_13_429885 223 16 , , , 10_1101-2021_02_13_429885 223 17 Ann Ann NNP 10_1101-2021_02_13_429885 223 18 - - HYPH 10_1101-2021_02_13_429885 223 19 Marie Marie NNP 10_1101-2021_02_13_429885 223 20 Baker Baker NNP 10_1101-2021_02_13_429885 223 21 , , , 10_1101-2021_02_13_429885 223 22 Sujata Sujata NNP 10_1101-2021_02_13_429885 223 23 Biswas Biswas NNP 10_1101-2021_02_13_429885 223 24 , , , 10_1101-2021_02_13_429885 223 25 et et NNP 10_1101-2021_02_13_429885 223 26 al al NNP 10_1101-2021_02_13_429885 223 27 . . . 10_1101-2021_02_13_429885 224 1 10 10 CD 10_1101-2021_02_13_429885 224 2 2018 2018 CD 10_1101-2021_02_13_429885 224 3 . . . 10_1101-2021_02_13_429885 225 1 “ " `` 10_1101-2021_02_13_429885 225 2 The the DT 10_1101-2021_02_13_429885 225 3 Evolutionary Evolutionary NNP 10_1101-2021_02_13_429885 225 4 Landscape Landscape NNP 10_1101-2021_02_13_429885 225 5 of of IN 10_1101-2021_02_13_429885 225 6 Colorectal Colorectal NNP 10_1101-2021_02_13_429885 225 7 Tumorigenesis Tumorigenesis NNP 10_1101-2021_02_13_429885 225 8 . . . 10_1101-2021_02_13_429885 225 9 ” " '' 10_1101-2021_02_13_429885 225 10 Nature Nature NNP 10_1101-2021_02_13_429885 225 11 Ecology Ecology NNP 10_1101-2021_02_13_429885 225 12 & & CC 10_1101-2021_02_13_429885 225 13 Evolution​ Evolution​ NNP 10_1101-2021_02_13_429885 225 14 2 2 CD 10_1101-2021_02_13_429885 225 15 ( ( -LRB- 10_1101-2021_02_13_429885 225 16 10 10 CD 10_1101-2021_02_13_429885 225 17 ) ) -RRB- 10_1101-2021_02_13_429885 225 18 : : : 10_1101-2021_02_13_429885 225 19 1661–72 1661–72 CD 10_1101-2021_02_13_429885 225 20 . . . 10_1101-2021_02_13_429885 226 1 https://doi.org/​10.1038/s41559-018-0642-z​. https://doi.org/​10.1038/s41559-018-0642-z​. LS 10_1101-2021_02_13_429885 227 1 Dentro Dentro NNP 10_1101-2021_02_13_429885 227 2 , , , 10_1101-2021_02_13_429885 227 3 Stefan Stefan NNP 10_1101-2021_02_13_429885 227 4 C. C. NNP 10_1101-2021_02_13_429885 227 5 , , , 10_1101-2021_02_13_429885 227 6 David David NNP 10_1101-2021_02_13_429885 227 7 C. C. NNP 10_1101-2021_02_13_429885 227 8 Wedge Wedge NNP 10_1101-2021_02_13_429885 227 9 , , , 10_1101-2021_02_13_429885 227 10 and and CC 10_1101-2021_02_13_429885 227 11 Peter Peter NNP 10_1101-2021_02_13_429885 227 12 Van Van NNP 10_1101-2021_02_13_429885 227 13 Loo Loo NNP 10_1101-2021_02_13_429885 227 14 . . . 10_1101-2021_02_13_429885 228 1 2017 2017 CD 10_1101-2021_02_13_429885 228 2 . . . 10_1101-2021_02_13_429885 229 1 “ " `` 10_1101-2021_02_13_429885 229 2 Principles principle NNS 10_1101-2021_02_13_429885 229 3 of of IN 10_1101-2021_02_13_429885 229 4 Reconstructing reconstruct VBG 10_1101-2021_02_13_429885 229 5 the the DT 10_1101-2021_02_13_429885 229 6 Subclonal Subclonal NNP 10_1101-2021_02_13_429885 229 7 Architecture Architecture NNP 10_1101-2021_02_13_429885 229 8 of of IN 10_1101-2021_02_13_429885 229 9 Cancers Cancers NNPS 10_1101-2021_02_13_429885 229 10 . . . 10_1101-2021_02_13_429885 229 11 ” " '' 10_1101-2021_02_13_429885 229 12 ​Cold ​Cold NNP 10_1101-2021_02_13_429885 229 13 Spring Spring NNP 10_1101-2021_02_13_429885 229 14 Harbor Harbor NNP 10_1101-2021_02_13_429885 229 15 Perspectives Perspectives NNPS 10_1101-2021_02_13_429885 229 16 in in IN 10_1101-2021_02_13_429885 229 17 Medicine​ Medicine​ NNP 10_1101-2021_02_13_429885 229 18 7 7 CD 10_1101-2021_02_13_429885 229 19 ( ( -LRB- 10_1101-2021_02_13_429885 229 20 8) 8) CD 10_1101-2021_02_13_429885 229 21 . . . 10_1101-2021_02_13_429885 230 1 https://doi.org/​10.1101/cshperspect.a026625 https://doi.org/​10.1101/cshperspect.a026625 UH 10_1101-2021_02_13_429885 230 2 ​. ​. JJ 10_1101-2021_02_13_429885 231 1 Ding Ding NNP 10_1101-2021_02_13_429885 231 2 , , , 10_1101-2021_02_13_429885 231 3 Li Li NNP 10_1101-2021_02_13_429885 231 4 , , , 10_1101-2021_02_13_429885 231 5 Timothy Timothy NNP 10_1101-2021_02_13_429885 231 6 J. J. NNP 10_1101-2021_02_13_429885 231 7 Ley Ley NNP 10_1101-2021_02_13_429885 231 8 , , , 10_1101-2021_02_13_429885 231 9 David David NNP 10_1101-2021_02_13_429885 231 10 E. E. NNP 10_1101-2021_02_13_429885 231 11 Larson Larson NNP 10_1101-2021_02_13_429885 231 12 , , , 10_1101-2021_02_13_429885 231 13 Christopher Christopher NNP 10_1101-2021_02_13_429885 231 14 A. a. NN 10_1101-2021_02_13_429885 231 15 Miller Miller NNP 10_1101-2021_02_13_429885 231 16 , , , 10_1101-2021_02_13_429885 231 17 Daniel Daniel NNP 10_1101-2021_02_13_429885 231 18 C. C. NNP 10_1101-2021_02_13_429885 231 19 Koboldt Koboldt NNP 10_1101-2021_02_13_429885 231 20 , , , 10_1101-2021_02_13_429885 231 21 John John NNP 10_1101-2021_02_13_429885 231 22 S. S. NNP 10_1101-2021_02_13_429885 231 23 Welch Welch NNP 10_1101-2021_02_13_429885 231 24 , , , 10_1101-2021_02_13_429885 231 25 Julie Julie NNP 10_1101-2021_02_13_429885 231 26 K. K. NNP 10_1101-2021_02_13_429885 231 27 Ritchey Ritchey NNP 10_1101-2021_02_13_429885 231 28 , , , 10_1101-2021_02_13_429885 231 29 et et NNP 10_1101-2021_02_13_429885 231 30 al al NNP 10_1101-2021_02_13_429885 231 31 . . . 10_1101-2021_02_13_429885 232 1 2012 2012 CD 10_1101-2021_02_13_429885 232 2 . . . 10_1101-2021_02_13_429885 233 1 “ " `` 10_1101-2021_02_13_429885 233 2 Clonal Clonal NNP 10_1101-2021_02_13_429885 233 3 Evolution Evolution NNP 10_1101-2021_02_13_429885 233 4 in in IN 10_1101-2021_02_13_429885 233 5 Relapsed Relapsed NNP 10_1101-2021_02_13_429885 233 6 Acute Acute NNP 10_1101-2021_02_13_429885 233 7 Myeloid Myeloid NNP 10_1101-2021_02_13_429885 233 8 Leukaemia Leukaemia NNP 10_1101-2021_02_13_429885 233 9 Revealed reveal VBN 10_1101-2021_02_13_429885 233 10 by by IN 10_1101-2021_02_13_429885 233 11 Whole whole JJ 10_1101-2021_02_13_429885 233 12 - - HYPH 10_1101-2021_02_13_429885 233 13 Genome genome NN 10_1101-2021_02_13_429885 233 14 Sequencing sequencing NN 10_1101-2021_02_13_429885 233 15 . . . 10_1101-2021_02_13_429885 233 16 ” " '' 10_1101-2021_02_13_429885 233 17 ​Nature​ ​nature​ CD 10_1101-2021_02_13_429885 233 18 481 481 CD 10_1101-2021_02_13_429885 233 19 ( ( -LRB- 10_1101-2021_02_13_429885 233 20 7382 7382 CD 10_1101-2021_02_13_429885 233 21 ) ) -RRB- 10_1101-2021_02_13_429885 233 22 : : : 10_1101-2021_02_13_429885 233 23 506–10 506–10 CD 10_1101-2021_02_13_429885 233 24 . . . 10_1101-2021_02_13_429885 234 1 https://doi.org/​10.1038/nature10738 https://doi.org/​10.1038/nature10738 NNP 10_1101-2021_02_13_429885 234 2 ​. ​. NNP 10_1101-2021_02_13_429885 235 1 Favero Favero NNP 10_1101-2021_02_13_429885 235 2 , , , 10_1101-2021_02_13_429885 235 3 F. F. NNP 10_1101-2021_02_13_429885 235 4 , , , 10_1101-2021_02_13_429885 235 5 T. T. NNP 10_1101-2021_02_13_429885 235 6 Joshi Joshi NNP 10_1101-2021_02_13_429885 235 7 , , , 10_1101-2021_02_13_429885 235 8 A. A. NNP 10_1101-2021_02_13_429885 235 9 M. M. NNP 10_1101-2021_02_13_429885 235 10 Marquard Marquard NNP 10_1101-2021_02_13_429885 235 11 , , , 10_1101-2021_02_13_429885 235 12 N. N. NNP 10_1101-2021_02_13_429885 235 13 J. J. NNP 10_1101-2021_02_13_429885 235 14 Birkbak Birkbak NNP 10_1101-2021_02_13_429885 235 15 , , , 10_1101-2021_02_13_429885 235 16 M. M. NNP 10_1101-2021_02_13_429885 235 17 Krzystanek Krzystanek NNP 10_1101-2021_02_13_429885 235 18 , , , 10_1101-2021_02_13_429885 235 19 Q. Q. NNP 10_1101-2021_02_13_429885 235 20 Li Li NNP 10_1101-2021_02_13_429885 235 21 , , , 10_1101-2021_02_13_429885 235 22 Z. Z. NNP 10_1101-2021_02_13_429885 235 23 Szallasi Szallasi NNP 10_1101-2021_02_13_429885 235 24 , , , 10_1101-2021_02_13_429885 235 25 and and CC 10_1101-2021_02_13_429885 235 26 A. A. NNP 10_1101-2021_02_13_429885 235 27 C. C. NNP 10_1101-2021_02_13_429885 235 28 Eklund Eklund NNP 10_1101-2021_02_13_429885 235 29 . . . 10_1101-2021_02_13_429885 236 1 2015 2015 CD 10_1101-2021_02_13_429885 236 2 . . . 10_1101-2021_02_13_429885 237 1 “ " `` 10_1101-2021_02_13_429885 237 2 Sequenza sequenza NN 10_1101-2021_02_13_429885 237 3 : : : 10_1101-2021_02_13_429885 237 4 Allele Allele NNP 10_1101-2021_02_13_429885 237 5 - - HYPH 10_1101-2021_02_13_429885 237 6 Specific Specific NNP 10_1101-2021_02_13_429885 237 7 Copy Copy NNP 10_1101-2021_02_13_429885 237 8 Number number NN 10_1101-2021_02_13_429885 237 9 and and CC 10_1101-2021_02_13_429885 237 10 Mutation Mutation NNP 10_1101-2021_02_13_429885 237 11 Profiles Profiles NNPS 10_1101-2021_02_13_429885 237 12 from from IN 10_1101-2021_02_13_429885 237 13 Tumor Tumor NNP 10_1101-2021_02_13_429885 237 14 Sequencing Sequencing NNP 10_1101-2021_02_13_429885 237 15 Data Data NNPS 10_1101-2021_02_13_429885 237 16 . . . 10_1101-2021_02_13_429885 237 17 ” " '' 10_1101-2021_02_13_429885 237 18 ​Annals ​annal NNS 10_1101-2021_02_13_429885 237 19 of of IN 10_1101-2021_02_13_429885 237 20 Oncology Oncology NNP 10_1101-2021_02_13_429885 237 21 : : : 10_1101-2021_02_13_429885 237 22 Official Official NNP 10_1101-2021_02_13_429885 237 23 Journal Journal NNP 10_1101-2021_02_13_429885 237 24 of of IN 10_1101-2021_02_13_429885 237 25 the the DT 10_1101-2021_02_13_429885 237 26 European European NNP 10_1101-2021_02_13_429885 237 27 Society Society NNP 10_1101-2021_02_13_429885 237 28 for for IN 10_1101-2021_02_13_429885 237 29 Medical Medical NNP 10_1101-2021_02_13_429885 237 30 Oncology Oncology NNP 10_1101-2021_02_13_429885 237 31 / / SYM 10_1101-2021_02_13_429885 237 32 ESMO​ ESMO​ NNP 10_1101-2021_02_13_429885 237 33 26 26 CD 10_1101-2021_02_13_429885 237 34 ( ( -LRB- 10_1101-2021_02_13_429885 237 35 1 1 CD 10_1101-2021_02_13_429885 237 36 ) ) -RRB- 10_1101-2021_02_13_429885 237 37 : : : 10_1101-2021_02_13_429885 237 38 64–70 64–70 LS 10_1101-2021_02_13_429885 237 39 . . . 10_1101-2021_02_13_429885 238 1 https://doi.org/​10.1093/annonc/mdu479 https://doi.org/​10.1093/annonc/mdu479 PRP 10_1101-2021_02_13_429885 238 2 ​. ​. NNP 10_1101-2021_02_13_429885 239 1 Fischer Fischer NNP 10_1101-2021_02_13_429885 239 2 , , , 10_1101-2021_02_13_429885 239 3 Andrej Andrej NNP 10_1101-2021_02_13_429885 239 4 , , , 10_1101-2021_02_13_429885 239 5 Ignacio Ignacio NNP 10_1101-2021_02_13_429885 239 6 Vázquez Vázquez NNP 10_1101-2021_02_13_429885 239 7 - - HYPH 10_1101-2021_02_13_429885 239 8 García García NNP 10_1101-2021_02_13_429885 239 9 , , , 10_1101-2021_02_13_429885 239 10 Christopher Christopher NNP 10_1101-2021_02_13_429885 239 11 J. J. NNP 10_1101-2021_02_13_429885 239 12 R. R. NNP 10_1101-2021_02_13_429885 239 13 Illingworth Illingworth NNP 10_1101-2021_02_13_429885 239 14 , , , 10_1101-2021_02_13_429885 239 15 and and CC 10_1101-2021_02_13_429885 239 16 Ville Ville NNP 10_1101-2021_02_13_429885 239 17 Mustonen Mustonen NNP 10_1101-2021_02_13_429885 239 18 . . . 10_1101-2021_02_13_429885 240 1 .CC .CC NFP 10_1101-2021_02_13_429885 240 2 - - : 10_1101-2021_02_13_429885 240 3 BY by IN 10_1101-2021_02_13_429885 240 4 - - HYPH 10_1101-2021_02_13_429885 240 5 NC NC NNP 10_1101-2021_02_13_429885 240 6 - - HYPH 10_1101-2021_02_13_429885 240 7 ND ND NNP 10_1101-2021_02_13_429885 240 8 4.0 4.0 CD 10_1101-2021_02_13_429885 240 9 International International NNP 10_1101-2021_02_13_429885 240 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 240 11 under under IN 10_1101-2021_02_13_429885 240 12 a a DT 10_1101-2021_02_13_429885 240 13 ( ( -LRB- 10_1101-2021_02_13_429885 240 14 which which WDT 10_1101-2021_02_13_429885 240 15 was be VBD 10_1101-2021_02_13_429885 240 16 not not RB 10_1101-2021_02_13_429885 240 17 certified certify VBN 10_1101-2021_02_13_429885 240 18 by by IN 10_1101-2021_02_13_429885 240 19 peer peer NN 10_1101-2021_02_13_429885 240 20 review review NN 10_1101-2021_02_13_429885 240 21 ) ) -RRB- 10_1101-2021_02_13_429885 240 22 is be VBZ 10_1101-2021_02_13_429885 240 23 the the DT 10_1101-2021_02_13_429885 240 24 author author NN 10_1101-2021_02_13_429885 240 25 / / SYM 10_1101-2021_02_13_429885 240 26 funder funder NN 10_1101-2021_02_13_429885 240 27 , , , 10_1101-2021_02_13_429885 240 28 who who WP 10_1101-2021_02_13_429885 240 29 has have VBZ 10_1101-2021_02_13_429885 240 30 granted grant VBN 10_1101-2021_02_13_429885 240 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 240 32 a a DT 10_1101-2021_02_13_429885 240 33 license license NN 10_1101-2021_02_13_429885 240 34 to to TO 10_1101-2021_02_13_429885 240 35 display display VB 10_1101-2021_02_13_429885 240 36 the the DT 10_1101-2021_02_13_429885 240 37 preprint preprint NN 10_1101-2021_02_13_429885 240 38 in in IN 10_1101-2021_02_13_429885 240 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 240 40 . . . 10_1101-2021_02_13_429885 241 1 It -PRON- PRP 10_1101-2021_02_13_429885 241 2 is be VBZ 10_1101-2021_02_13_429885 241 3 made make VBN 10_1101-2021_02_13_429885 241 4 The the DT 10_1101-2021_02_13_429885 241 5 copyright copyright NN 10_1101-2021_02_13_429885 241 6 holder holder NN 10_1101-2021_02_13_429885 241 7 for for IN 10_1101-2021_02_13_429885 241 8 this this DT 10_1101-2021_02_13_429885 241 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 241 10 version version NN 10_1101-2021_02_13_429885 241 11 posted post VBD 10_1101-2021_02_13_429885 241 12 February February NNP 10_1101-2021_02_13_429885 241 13 13 13 CD 10_1101-2021_02_13_429885 241 14 , , , 10_1101-2021_02_13_429885 241 15 2021 2021 CD 10_1101-2021_02_13_429885 241 16 . . . 10_1101-2021_02_13_429885 241 17 ; ; : 10_1101-2021_02_13_429885 241 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 241 19 : : : 10_1101-2021_02_13_429885 241 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 241 21 preprint preprint NN 10_1101-2021_02_13_429885 241 22 http://paperpile.com/b/rqVmzs/j5j7 http://paperpile.com/b/rqVmzs/j5j7 VBZ 10_1101-2021_02_13_429885 241 23 http://paperpile.com/b/rqVmzs/j5j7 http://paperpile.com/b/rqVmzs/j5j7 NNS 10_1101-2021_02_13_429885 241 24 http://paperpile.com/b/rqVmzs/j5j7 http://paperpile.com/b/rqvmzs/j5j7 CD 10_1101-2021_02_13_429885 241 25 http://dx.doi.org/10.1038/s41436-018-0278-z http://dx.doi.org/10.1038/s41436-018-0278-z NNP 10_1101-2021_02_13_429885 241 26 http://paperpile.com/b/rqVmzs/j5j7 http://paperpile.com/b/rqVmzs/j5j7 NNS 10_1101-2021_02_13_429885 241 27 http://paperpile.com/b/rqVmzs/bD5o http://paperpile.com/b/rqVmzs/bD5o NNP 10_1101-2021_02_13_429885 241 28 http://paperpile.com/b/rqVmzs/bD5o http://paperpile.com/b/rqVmzs/bD5o NNP 10_1101-2021_02_13_429885 241 29 http://paperpile.com/b/rqVmzs/bD5o http://paperpile.com/b/rqVmzs/bD5o NNP 10_1101-2021_02_13_429885 241 30 http://paperpile.com/b/rqVmzs/bD5o http://paperpile.com/b/rqVmzs/bD5o NNP 10_1101-2021_02_13_429885 241 31 http://paperpile.com/b/rqVmzs/bD5o http://paperpile.com/b/rqVmzs/bD5o NNP 10_1101-2021_02_13_429885 241 32 http://dx.doi.org/10.1101/861054 http://dx.doi.org/10.1101/861054 NNP 10_1101-2021_02_13_429885 241 33 http://paperpile.com/b/rqVmzs/bD5o http://paperpile.com/b/rqVmzs/bD5o NNP 10_1101-2021_02_13_429885 241 34 http://paperpile.com/b/rqVmzs/IX1R http://paperpile.com/b/rqVmzs/IX1R NNP 10_1101-2021_02_13_429885 241 35 http://paperpile.com/b/rqVmzs/IX1R http://paperpile.com/b/rqVmzs/IX1R NNP 10_1101-2021_02_13_429885 241 36 http://paperpile.com/b/rqVmzs/IX1R http://paperpile.com/b/rqVmzs/IX1R NNP 10_1101-2021_02_13_429885 241 37 http://paperpile.com/b/rqVmzs/IX1R http://paperpile.com/b/rqVmzs/IX1R NNP 10_1101-2021_02_13_429885 241 38 http://paperpile.com/b/rqVmzs/IX1R http://paperpile.com/b/rqVmzs/IX1R NNP 10_1101-2021_02_13_429885 241 39 http://dx.doi.org/10.1093/bioinformatics/btq635 http://dx.doi.org/10.1093/bioinformatics/btq635 NNP 10_1101-2021_02_13_429885 241 40 http://paperpile.com/b/rqVmzs/IX1R http://paperpile.com/b/rqVmzs/IX1R NNP 10_1101-2021_02_13_429885 241 41 http://paperpile.com/b/rqVmzs/CxXa http://paperpile.com/b/rqVmzs/CxXa NNP 10_1101-2021_02_13_429885 241 42 http://paperpile.com/b/rqVmzs/CxXa http://paperpile.com/b/rqVmzs/CxXa NNP 10_1101-2021_02_13_429885 241 43 http://paperpile.com/b/rqVmzs/CxXa http://paperpile.com/b/rqVmzs/CxXa NNP 10_1101-2021_02_13_429885 241 44 http://paperpile.com/b/rqVmzs/CxXa http://paperpile.com/b/rqVmzs/CxXa NNP 10_1101-2021_02_13_429885 241 45 http://paperpile.com/b/rqVmzs/CxXa http://paperpile.com/b/rqVmzs/CxXa NNP 10_1101-2021_02_13_429885 241 46 http://dx.doi.org/10.1038/s41586-020-1969-6 http://dx.doi.org/10.1038/s41586-020-1969-6 NNP 10_1101-2021_02_13_429885 241 47 http://paperpile.com/b/rqVmzs/CxXa http://paperpile.com/b/rqvmzs/cxxa CD 10_1101-2021_02_13_429885 241 48 http://paperpile.com/b/rqVmzs/eR0S http://paperpile.com/b/rqVmzs/eR0S NNP 10_1101-2021_02_13_429885 241 49 http://paperpile.com/b/rqVmzs/eR0S http://paperpile.com/b/rqVmzs/eR0S NNP 10_1101-2021_02_13_429885 241 50 http://paperpile.com/b/rqVmzs/eR0S http://paperpile.com/b/rqvmzs/er0s NN 10_1101-2021_02_13_429885 241 51 http://paperpile.com/b/rqVmzs/eR0S http://paperpile.com/b/rqvmzs/er0s NN 10_1101-2021_02_13_429885 241 52 http://paperpile.com/b/rqVmzs/eR0S http://paperpile.com/b/rqVmzs/eR0S NNP 10_1101-2021_02_13_429885 241 53 http://paperpile.com/b/rqVmzs/eR0S http://paperpile.com/b/rqvmzs/er0s NN 10_1101-2021_02_13_429885 241 54 http://dx.doi.org/10.1038/s41592-018-0108-x http://dx.doi.org/10.1038/s41592-018-0108-x VBZ 10_1101-2021_02_13_429885 241 55 http://paperpile.com/b/rqVmzs/eR0S http://paperpile.com/b/rqVmzs/eR0S NNP 10_1101-2021_02_13_429885 241 56 http://paperpile.com/b/rqVmzs/Rl5f http://paperpile.com/b/rqVmzs/Rl5f NNP 10_1101-2021_02_13_429885 241 57 http://paperpile.com/b/rqVmzs/Rl5f http://paperpile.com/b/rqVmzs/Rl5f NNP 10_1101-2021_02_13_429885 241 58 http://paperpile.com/b/rqVmzs/Rl5f http://paperpile.com/b/rqVmzs/Rl5f NNP 10_1101-2021_02_13_429885 241 59 http://paperpile.com/b/rqVmzs/Rl5f http://paperpile.com/b/rqVmzs/Rl5f NNP 10_1101-2021_02_13_429885 241 60 http://paperpile.com/b/rqVmzs/Rl5f http://paperpile.com/b/rqVmzs/Rl5f NNP 10_1101-2021_02_13_429885 241 61 http://paperpile.com/b/rqVmzs/Rl5f http://paperpile.com/b/rqVmzs/Rl5f NNP 10_1101-2021_02_13_429885 241 62 http://dx.doi.org/10.1073/pnas.1520213113 http://dx.doi.org/10.1073/pnas.1520213113 NN 10_1101-2021_02_13_429885 241 63 http://paperpile.com/b/rqVmzs/Rl5f http://paperpile.com/b/rqVmzs/Rl5f NNP 10_1101-2021_02_13_429885 241 64 http://paperpile.com/b/rqVmzs/chqB http://paperpile.com/b/rqVmzs/chqB NNP 10_1101-2021_02_13_429885 241 65 http://paperpile.com/b/rqVmzs/chqB http://paperpile.com/b/rqVmzs/chqB NNP 10_1101-2021_02_13_429885 241 66 http://paperpile.com/b/rqVmzs/chqB http://paperpile.com/b/rqVmzs/chqB NNP 10_1101-2021_02_13_429885 241 67 http://paperpile.com/b/rqVmzs/chqB http://paperpile.com/b/rqVmzs/chqB NNP 10_1101-2021_02_13_429885 241 68 http://paperpile.com/b/rqVmzs/chqB http://paperpile.com/b/rqVmzs/chqB NNP 10_1101-2021_02_13_429885 241 69 http://paperpile.com/b/rqVmzs/chqB http://paperpile.com/b/rqVmzs/chqB NNP 10_1101-2021_02_13_429885 241 70 http://dx.doi.org/10.1038/s41588-020-0675-5 http://dx.doi.org/10.1038/s41588-020-0675-5 NN 10_1101-2021_02_13_429885 241 71 http://paperpile.com/b/rqVmzs/chqB http://paperpile.com/b/rqVmzs/chqB NNP 10_1101-2021_02_13_429885 241 72 http://paperpile.com/b/rqVmzs/ydMa http://paperpile.com/b/rqvmzs/ydma NN 10_1101-2021_02_13_429885 241 73 http://paperpile.com/b/rqVmzs/ydMa http://paperpile.com/b/rqvmzs/ydma PRP 10_1101-2021_02_13_429885 241 74 http://paperpile.com/b/rqVmzs/ydMa http://paperpile.com/b/rqvmzs/ydma NN 10_1101-2021_02_13_429885 241 75 http://paperpile.com/b/rqVmzs/ydMa http://paperpile.com/b/rqVmzs/ydMa NNP 10_1101-2021_02_13_429885 241 76 http://paperpile.com/b/rqVmzs/ydMa http://paperpile.com/b/rqVmzs/ydMa NNP 10_1101-2021_02_13_429885 241 77 http://paperpile.com/b/rqVmzs/ydMa http://paperpile.com/b/rqVmzs/ydMa NNP 10_1101-2021_02_13_429885 241 78 http://dx.doi.org/10.1038/s41467-020-14351-8 http://dx.doi.org/10.1038/s41467-020-14351-8 NNP 10_1101-2021_02_13_429885 241 79 http://paperpile.com/b/rqVmzs/ydMa http://paperpile.com/b/rqVmzs/ydMa NNP 10_1101-2021_02_13_429885 241 80 http://paperpile.com/b/rqVmzs/FjZP http://paperpile.com/b/rqvmzs/fjzp ADD 10_1101-2021_02_13_429885 241 81 http://paperpile.com/b/rqVmzs/FjZP http://paperpile.com/b/rqvmzs/fjzp ADD 10_1101-2021_02_13_429885 241 82 http://paperpile.com/b/rqVmzs/FjZP http://paperpile.com/b/rqvmzs/fjzp ADD 10_1101-2021_02_13_429885 241 83 http://paperpile.com/b/rqVmzs/FjZP http://paperpile.com/b/rqvmzs/fjzp ADD 10_1101-2021_02_13_429885 241 84 http://paperpile.com/b/rqVmzs/FjZP http://paperpile.com/b/rqvmzs/fjzp ADD 10_1101-2021_02_13_429885 241 85 http://paperpile.com/b/rqVmzs/FjZP http://paperpile.com/b/rqvmzs/fjzp UH 10_1101-2021_02_13_429885 241 86 http://dx.doi.org/10.1038/s41588-019-0576-7 http://dx.doi.org/10.1038/s41588-019-0576-7 NNP 10_1101-2021_02_13_429885 241 87 http://paperpile.com/b/rqVmzs/FjZP http://paperpile.com/b/rqVmzs/FjZP NNP 10_1101-2021_02_13_429885 241 88 http://paperpile.com/b/rqVmzs/IC0y http://paperpile.com/b/rqVmzs/IC0y NNP 10_1101-2021_02_13_429885 241 89 http://paperpile.com/b/rqVmzs/IC0y http://paperpile.com/b/rqVmzs/IC0y NNP 10_1101-2021_02_13_429885 241 90 http://paperpile.com/b/rqVmzs/IC0y http://paperpile.com/b/rqVmzs/IC0y NNP 10_1101-2021_02_13_429885 241 91 http://paperpile.com/b/rqVmzs/IC0y http://paperpile.com/b/rqVmzs/IC0y NNP 10_1101-2021_02_13_429885 241 92 http://dx.doi.org/10.1038/s41559-018-0642-z http://dx.doi.org/10.1038/s41559-018-0642-z NNP 10_1101-2021_02_13_429885 241 93 http://paperpile.com/b/rqVmzs/IC0y http://paperpile.com/b/rqVmzs/IC0y NNP 10_1101-2021_02_13_429885 241 94 http://paperpile.com/b/rqVmzs/Uxwc http://paperpile.com/b/rqVmzs/Uxwc NNP 10_1101-2021_02_13_429885 241 95 http://paperpile.com/b/rqVmzs/Uxwc http://paperpile.com/b/rqvmzs/uxwc JJ 10_1101-2021_02_13_429885 241 96 http://paperpile.com/b/rqVmzs/Uxwc http://paperpile.com/b/rqVmzs/Uxwc NNP 10_1101-2021_02_13_429885 241 97 http://paperpile.com/b/rqVmzs/Uxwc http://paperpile.com/b/rqvmzs/uxwc JJ 10_1101-2021_02_13_429885 241 98 http://paperpile.com/b/rqVmzs/Uxwc http://paperpile.com/b/rqVmzs/Uxwc NNP 10_1101-2021_02_13_429885 241 99 http://dx.doi.org/10.1101/cshperspect.a026625 http://dx.doi.org/10.1101/cshperspect.a026625 NNP 10_1101-2021_02_13_429885 241 100 http://paperpile.com/b/rqVmzs/Uxwc http://paperpile.com/b/rqVmzs/Uxwc NNP 10_1101-2021_02_13_429885 241 101 http://paperpile.com/b/rqVmzs/wPG3 http://paperpile.com/b/rqvmzs/wpg3 CD 10_1101-2021_02_13_429885 241 102 http://paperpile.com/b/rqVmzs/wPG3 http://paperpile.com/b/rqvmzs/wpg3 CD 10_1101-2021_02_13_429885 241 103 http://paperpile.com/b/rqVmzs/wPG3 http://paperpile.com/b/rqvmzs/wpg3 CD 10_1101-2021_02_13_429885 241 104 http://paperpile.com/b/rqVmzs/wPG3 http://paperpile.com/b/rqvmzs/wpg3 CD 10_1101-2021_02_13_429885 241 105 http://paperpile.com/b/rqVmzs/wPG3 http://paperpile.com/b/rqvmzs/wpg3 CD 10_1101-2021_02_13_429885 241 106 http://paperpile.com/b/rqVmzs/wPG3 http://paperpile.com/b/rqvmzs/wpg3 CD 10_1101-2021_02_13_429885 241 107 http://dx.doi.org/10.1038/nature10738 http://dx.doi.org/10.1038/nature10738 NNP 10_1101-2021_02_13_429885 241 108 http://paperpile.com/b/rqVmzs/wPG3 http://paperpile.com/b/rqvmzs/wpg3 CD 10_1101-2021_02_13_429885 241 109 http://paperpile.com/b/rqVmzs/tCb6 http://paperpile.com/b/rqvmzs/tcb6 UH 10_1101-2021_02_13_429885 241 110 http://paperpile.com/b/rqVmzs/tCb6 http://paperpile.com/b/rqvmzs/tcb6 UH 10_1101-2021_02_13_429885 241 111 http://paperpile.com/b/rqVmzs/tCb6 http://paperpile.com/b/rqvmzs/tcb6 UH 10_1101-2021_02_13_429885 241 112 http://paperpile.com/b/rqVmzs/tCb6 http://paperpile.com/b/rqvmzs/tcb6 UH 10_1101-2021_02_13_429885 241 113 http://paperpile.com/b/rqVmzs/tCb6 http://paperpile.com/b/rqvmzs/tcb6 UH 10_1101-2021_02_13_429885 241 114 http://paperpile.com/b/rqVmzs/tCb6 http://paperpile.com/b/rqVmzs/tCb6 NNP 10_1101-2021_02_13_429885 241 115 http://dx.doi.org/10.1093/annonc/mdu479 http://dx.doi.org/10.1093/annonc/mdu479 NNP 10_1101-2021_02_13_429885 241 116 http://paperpile.com/b/rqVmzs/tCb6 http://paperpile.com/b/rqVmzs/tCb6 NNP 10_1101-2021_02_13_429885 241 117 http://paperpile.com/b/rqVmzs/A7Vg http://paperpile.com/b/rqVmzs/A7Vg NNP 10_1101-2021_02_13_429885 241 118 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 241 119 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 241 120 Househam Househam NNP 10_1101-2021_02_13_429885 241 121 et et FW 10_1101-2021_02_13_429885 241 122 al al NNP 10_1101-2021_02_13_429885 241 123 . . . 10_1101-2021_02_13_429885 242 1 A a DT 10_1101-2021_02_13_429885 242 2 fully fully RB 10_1101-2021_02_13_429885 242 3 automated automate VBN 10_1101-2021_02_13_429885 242 4 approach approach NN 10_1101-2021_02_13_429885 242 5 for for IN 10_1101-2021_02_13_429885 242 6 quality quality NN 10_1101-2021_02_13_429885 242 7 control control NN 10_1101-2021_02_13_429885 242 8 of of IN 10_1101-2021_02_13_429885 242 9 cancer cancer NN 10_1101-2021_02_13_429885 242 10 mutations mutation NNS 10_1101-2021_02_13_429885 242 11 in in IN 10_1101-2021_02_13_429885 242 12 the the DT 10_1101-2021_02_13_429885 242 13 era era NN 10_1101-2021_02_13_429885 242 14 of of IN 10_1101-2021_02_13_429885 242 15 high high JJ 10_1101-2021_02_13_429885 242 16 - - HYPH 10_1101-2021_02_13_429885 242 17 resolution resolution NN 10_1101-2021_02_13_429885 242 18 whole whole JJ 10_1101-2021_02_13_429885 242 19 genome genome JJ 10_1101-2021_02_13_429885 242 20 sequencing sequencing NN 10_1101-2021_02_13_429885 242 21 . . . 10_1101-2021_02_13_429885 243 1 2014 2014 CD 10_1101-2021_02_13_429885 243 2 . . . 10_1101-2021_02_13_429885 244 1 “ " `` 10_1101-2021_02_13_429885 244 2 High High NNP 10_1101-2021_02_13_429885 244 3 - - HYPH 10_1101-2021_02_13_429885 244 4 Definition Definition NNP 10_1101-2021_02_13_429885 244 5 Reconstruction Reconstruction NNP 10_1101-2021_02_13_429885 244 6 of of IN 10_1101-2021_02_13_429885 244 7 Clonal Clonal NNP 10_1101-2021_02_13_429885 244 8 Composition Composition NNP 10_1101-2021_02_13_429885 244 9 in in IN 10_1101-2021_02_13_429885 244 10 Cancer Cancer NNP 10_1101-2021_02_13_429885 244 11 . . . 10_1101-2021_02_13_429885 244 12 ” " '' 10_1101-2021_02_13_429885 244 13 ​Cell ​Cell NNP 10_1101-2021_02_13_429885 244 14 Reports​ Reports​ NNP 10_1101-2021_02_13_429885 244 15 7 7 CD 10_1101-2021_02_13_429885 244 16 ( ( -LRB- 10_1101-2021_02_13_429885 244 17 5 5 CD 10_1101-2021_02_13_429885 244 18 ) ) -RRB- 10_1101-2021_02_13_429885 244 19 : : : 10_1101-2021_02_13_429885 244 20 1740–52 1740–52 LS 10_1101-2021_02_13_429885 244 21 . . . 10_1101-2021_02_13_429885 245 1 https://doi.org/​10.1016/j.celrep.2014.04.055 https://doi.org/​10.1016/j.celrep.2014.04.055 NNP 10_1101-2021_02_13_429885 245 2 ​. ​. NNP 10_1101-2021_02_13_429885 246 1 Gerstung Gerstung NNP 10_1101-2021_02_13_429885 246 2 , , , 10_1101-2021_02_13_429885 246 3 Moritz Moritz NNP 10_1101-2021_02_13_429885 246 4 , , , 10_1101-2021_02_13_429885 246 5 Clemency Clemency NNP 10_1101-2021_02_13_429885 246 6 Jolly Jolly NNP 10_1101-2021_02_13_429885 246 7 , , , 10_1101-2021_02_13_429885 246 8 Ignaty Ignaty NNP 10_1101-2021_02_13_429885 246 9 Leshchiner Leshchiner NNP 10_1101-2021_02_13_429885 246 10 , , , 10_1101-2021_02_13_429885 246 11 Stefan Stefan NNP 10_1101-2021_02_13_429885 246 12 C. C. NNP 10_1101-2021_02_13_429885 246 13 Dentro Dentro NNP 10_1101-2021_02_13_429885 246 14 , , , 10_1101-2021_02_13_429885 246 15 Santiago Santiago NNP 10_1101-2021_02_13_429885 246 16 Gonzalez Gonzalez NNP 10_1101-2021_02_13_429885 246 17 , , , 10_1101-2021_02_13_429885 246 18 Daniel Daniel NNP 10_1101-2021_02_13_429885 246 19 Rosebrock Rosebrock NNP 10_1101-2021_02_13_429885 246 20 , , , 10_1101-2021_02_13_429885 246 21 Thomas Thomas NNP 10_1101-2021_02_13_429885 246 22 J. J. NNP 10_1101-2021_02_13_429885 246 23 Mitchell Mitchell NNP 10_1101-2021_02_13_429885 246 24 , , , 10_1101-2021_02_13_429885 246 25 et et NNP 10_1101-2021_02_13_429885 246 26 al al NNP 10_1101-2021_02_13_429885 246 27 . . . 10_1101-2021_02_13_429885 247 1 2020 2020 CD 10_1101-2021_02_13_429885 247 2 . . . 10_1101-2021_02_13_429885 248 1 “ " `` 10_1101-2021_02_13_429885 248 2 The the DT 10_1101-2021_02_13_429885 248 3 Evolutionary Evolutionary NNP 10_1101-2021_02_13_429885 248 4 History history NN 10_1101-2021_02_13_429885 248 5 of of IN 10_1101-2021_02_13_429885 248 6 2,658 2,658 CD 10_1101-2021_02_13_429885 248 7 Cancers Cancers NNPS 10_1101-2021_02_13_429885 248 8 . . . 10_1101-2021_02_13_429885 248 9 ” " '' 10_1101-2021_02_13_429885 248 10 ​Nature​ ​Nature​ NNP 10_1101-2021_02_13_429885 248 11 578 578 CD 10_1101-2021_02_13_429885 248 12 ( ( -LRB- 10_1101-2021_02_13_429885 248 13 7793 7793 CD 10_1101-2021_02_13_429885 248 14 ) ) -RRB- 10_1101-2021_02_13_429885 248 15 : : : 10_1101-2021_02_13_429885 248 16 122–28 122–28 CD 10_1101-2021_02_13_429885 248 17 . . . 10_1101-2021_02_13_429885 249 1 https://doi.org/​10.1038/s41586-019-1907-7 https://doi.org/​10.1038/s41586-019-1907-7 NNP 10_1101-2021_02_13_429885 249 2 ​. ​. NNP 10_1101-2021_02_13_429885 250 1 Gonzalez Gonzalez NNP 10_1101-2021_02_13_429885 250 2 - - HYPH 10_1101-2021_02_13_429885 250 3 Perez Perez NNP 10_1101-2021_02_13_429885 250 4 , , , 10_1101-2021_02_13_429885 250 5 Abel Abel NNP 10_1101-2021_02_13_429885 250 6 , , , 10_1101-2021_02_13_429885 250 7 Christian Christian NNP 10_1101-2021_02_13_429885 250 8 Perez Perez NNP 10_1101-2021_02_13_429885 250 9 - - HYPH 10_1101-2021_02_13_429885 250 10 Llamas Llamas NNP 10_1101-2021_02_13_429885 250 11 , , , 10_1101-2021_02_13_429885 250 12 Jordi Jordi NNP 10_1101-2021_02_13_429885 250 13 Deu Deu NNP 10_1101-2021_02_13_429885 250 14 - - HYPH 10_1101-2021_02_13_429885 250 15 Pons Pons NNP 10_1101-2021_02_13_429885 250 16 , , , 10_1101-2021_02_13_429885 250 17 David David NNP 10_1101-2021_02_13_429885 250 18 Tamborero Tamborero NNP 10_1101-2021_02_13_429885 250 19 , , , 10_1101-2021_02_13_429885 250 20 Michael Michael NNP 10_1101-2021_02_13_429885 250 21 P. P. NNP 10_1101-2021_02_13_429885 250 22 Schroeder Schroeder NNP 10_1101-2021_02_13_429885 250 23 , , , 10_1101-2021_02_13_429885 250 24 Alba Alba NNP 10_1101-2021_02_13_429885 250 25 Jene Jene NNP 10_1101-2021_02_13_429885 250 26 - - HYPH 10_1101-2021_02_13_429885 250 27 Sanz Sanz NNP 10_1101-2021_02_13_429885 250 28 , , , 10_1101-2021_02_13_429885 250 29 Alberto Alberto NNP 10_1101-2021_02_13_429885 250 30 Santos Santos NNP 10_1101-2021_02_13_429885 250 31 , , , 10_1101-2021_02_13_429885 250 32 and and CC 10_1101-2021_02_13_429885 250 33 Nuria Nuria NNP 10_1101-2021_02_13_429885 250 34 Lopez Lopez NNP 10_1101-2021_02_13_429885 250 35 - - HYPH 10_1101-2021_02_13_429885 250 36 Bigas Bigas NNP 10_1101-2021_02_13_429885 250 37 . . . 10_1101-2021_02_13_429885 251 1 2013 2013 CD 10_1101-2021_02_13_429885 251 2 . . . 10_1101-2021_02_13_429885 252 1 “ " `` 10_1101-2021_02_13_429885 252 2 IntOGen IntOGen NNP 10_1101-2021_02_13_429885 252 3 - - HYPH 10_1101-2021_02_13_429885 252 4 Mutations Mutations NNP 10_1101-2021_02_13_429885 252 5 Identifies Identifies NNPS 10_1101-2021_02_13_429885 252 6 Cancer Cancer NNP 10_1101-2021_02_13_429885 252 7 Drivers Drivers NNPS 10_1101-2021_02_13_429885 252 8 across across IN 10_1101-2021_02_13_429885 252 9 Tumor Tumor NNP 10_1101-2021_02_13_429885 252 10 Types Types NNP 10_1101-2021_02_13_429885 252 11 . . . 10_1101-2021_02_13_429885 252 12 ” " '' 10_1101-2021_02_13_429885 252 13 ​Nature ​nature JJ 10_1101-2021_02_13_429885 252 14 Methods​ Methods​ NNP 10_1101-2021_02_13_429885 252 15 10 10 CD 10_1101-2021_02_13_429885 252 16 ( ( -LRB- 10_1101-2021_02_13_429885 252 17 11 11 CD 10_1101-2021_02_13_429885 252 18 ) ) -RRB- 10_1101-2021_02_13_429885 252 19 : : : 10_1101-2021_02_13_429885 252 20 1081–82 1081–82 LS 10_1101-2021_02_13_429885 252 21 . . . 10_1101-2021_02_13_429885 253 1 https://doi.org/​10.1038/nmeth.2642 https://doi.org/​10.1038/nmeth.2642 RB 10_1101-2021_02_13_429885 253 2 ​. ​. JJ 10_1101-2021_02_13_429885 254 1 Greaves Greaves NNP 10_1101-2021_02_13_429885 254 2 , , , 10_1101-2021_02_13_429885 254 3 Mel Mel NNP 10_1101-2021_02_13_429885 254 4 , , , 10_1101-2021_02_13_429885 254 5 and and CC 10_1101-2021_02_13_429885 254 6 Carlo Carlo NNP 10_1101-2021_02_13_429885 254 7 C. C. NNP 10_1101-2021_02_13_429885 254 8 Maley Maley NNP 10_1101-2021_02_13_429885 254 9 . . . 10_1101-2021_02_13_429885 255 1 2012 2012 CD 10_1101-2021_02_13_429885 255 2 . . . 10_1101-2021_02_13_429885 256 1 “ " `` 10_1101-2021_02_13_429885 256 2 Clonal Clonal NNP 10_1101-2021_02_13_429885 256 3 Evolution Evolution NNP 10_1101-2021_02_13_429885 256 4 in in IN 10_1101-2021_02_13_429885 256 5 Cancer Cancer NNP 10_1101-2021_02_13_429885 256 6 . . . 10_1101-2021_02_13_429885 256 7 ” " '' 10_1101-2021_02_13_429885 256 8 ​Nature​ ​nature​ CD 10_1101-2021_02_13_429885 256 9 481 481 CD 10_1101-2021_02_13_429885 256 10 ( ( -LRB- 10_1101-2021_02_13_429885 256 11 7381 7381 CD 10_1101-2021_02_13_429885 256 12 ) ) -RRB- 10_1101-2021_02_13_429885 256 13 : : : 10_1101-2021_02_13_429885 256 14 306–13 306–13 CD 10_1101-2021_02_13_429885 256 15 . . . 10_1101-2021_02_13_429885 257 1 https://doi.org/​10.1038/nature10762 https://doi.org/​10.1038/nature10762 NNS 10_1101-2021_02_13_429885 257 2 ​. ​. JJ 10_1101-2021_02_13_429885 258 1 Jamal Jamal NNP 10_1101-2021_02_13_429885 258 2 - - HYPH 10_1101-2021_02_13_429885 258 3 Hanjani Hanjani NNP 10_1101-2021_02_13_429885 258 4 , , , 10_1101-2021_02_13_429885 258 5 Mariam Mariam NNP 10_1101-2021_02_13_429885 258 6 , , , 10_1101-2021_02_13_429885 258 7 Gareth Gareth NNP 10_1101-2021_02_13_429885 258 8 A. A. NNP 10_1101-2021_02_13_429885 258 9 Wilson Wilson NNP 10_1101-2021_02_13_429885 258 10 , , , 10_1101-2021_02_13_429885 258 11 Nicholas Nicholas NNP 10_1101-2021_02_13_429885 258 12 McGranahan McGranahan NNP 10_1101-2021_02_13_429885 258 13 , , , 10_1101-2021_02_13_429885 258 14 Nicolai Nicolai NNP 10_1101-2021_02_13_429885 258 15 J. J. NNP 10_1101-2021_02_13_429885 258 16 Birkbak Birkbak NNP 10_1101-2021_02_13_429885 258 17 , , , 10_1101-2021_02_13_429885 258 18 Thomas Thomas NNP 10_1101-2021_02_13_429885 258 19 B. B. NNP 10_1101-2021_02_13_429885 258 20 K. K. NNP 10_1101-2021_02_13_429885 258 21 Watkins Watkins NNP 10_1101-2021_02_13_429885 258 22 , , , 10_1101-2021_02_13_429885 258 23 Selvaraju Selvaraju NNP 10_1101-2021_02_13_429885 258 24 Veeriah Veeriah NNP 10_1101-2021_02_13_429885 258 25 , , , 10_1101-2021_02_13_429885 258 26 Seema Seema NNP 10_1101-2021_02_13_429885 258 27 Shafi Shafi NNP 10_1101-2021_02_13_429885 258 28 , , , 10_1101-2021_02_13_429885 258 29 et et NNP 10_1101-2021_02_13_429885 258 30 al al NNP 10_1101-2021_02_13_429885 258 31 . . . 10_1101-2021_02_13_429885 259 1 2017 2017 CD 10_1101-2021_02_13_429885 259 2 . . . 10_1101-2021_02_13_429885 260 1 “ " `` 10_1101-2021_02_13_429885 260 2 Tracking track VBG 10_1101-2021_02_13_429885 260 3 the the DT 10_1101-2021_02_13_429885 260 4 Evolution Evolution NNP 10_1101-2021_02_13_429885 260 5 of of IN 10_1101-2021_02_13_429885 260 6 Non non JJ 10_1101-2021_02_13_429885 260 7 - - JJ 10_1101-2021_02_13_429885 260 8 Small small JJ 10_1101-2021_02_13_429885 260 9 - - HYPH 10_1101-2021_02_13_429885 260 10 Cell Cell NNP 10_1101-2021_02_13_429885 260 11 Lung Lung NNP 10_1101-2021_02_13_429885 260 12 Cancer Cancer NNP 10_1101-2021_02_13_429885 260 13 . . . 10_1101-2021_02_13_429885 260 14 ” " '' 10_1101-2021_02_13_429885 260 15 ​The ​The NNP 10_1101-2021_02_13_429885 260 16 New New NNP 10_1101-2021_02_13_429885 260 17 England England NNP 10_1101-2021_02_13_429885 260 18 Journal Journal NNP 10_1101-2021_02_13_429885 260 19 of of IN 10_1101-2021_02_13_429885 260 20 Medicine​ Medicine​ NNP 10_1101-2021_02_13_429885 260 21 376 376 CD 10_1101-2021_02_13_429885 260 22 ( ( -LRB- 10_1101-2021_02_13_429885 260 23 22 22 CD 10_1101-2021_02_13_429885 260 24 ) ) -RRB- 10_1101-2021_02_13_429885 260 25 : : : 10_1101-2021_02_13_429885 260 26 2109–21 2109–21 LS 10_1101-2021_02_13_429885 260 27 . . . 10_1101-2021_02_13_429885 261 1 https://doi.org/​10.1056/NEJMoa1616288 https://doi.org/​10.1056/nejmoa1616288 CD 10_1101-2021_02_13_429885 261 2 ​. ​. CD 10_1101-2021_02_13_429885 262 1 Kent Kent NNP 10_1101-2021_02_13_429885 262 2 , , , 10_1101-2021_02_13_429885 262 3 David David NNP 10_1101-2021_02_13_429885 262 4 G. G. NNP 10_1101-2021_02_13_429885 262 5 , , , 10_1101-2021_02_13_429885 262 6 and and CC 10_1101-2021_02_13_429885 262 7 Anthony Anthony NNP 10_1101-2021_02_13_429885 262 8 R. R. NNP 10_1101-2021_02_13_429885 262 9 Green Green NNP 10_1101-2021_02_13_429885 262 10 . . . 10_1101-2021_02_13_429885 263 1 2017 2017 CD 10_1101-2021_02_13_429885 263 2 - - SYM 10_1101-2021_02_13_429885 263 3 4 4 CD 10_1101-2021_02_13_429885 263 4 . . . 10_1101-2021_02_13_429885 264 1 “ " `` 10_1101-2021_02_13_429885 264 2 Order order NN 10_1101-2021_02_13_429885 264 3 Matters matter NNS 10_1101-2021_02_13_429885 264 4 : : : 10_1101-2021_02_13_429885 264 5 The the DT 10_1101-2021_02_13_429885 264 6 Order order NN 10_1101-2021_02_13_429885 264 7 of of IN 10_1101-2021_02_13_429885 264 8 Somatic somatic JJ 10_1101-2021_02_13_429885 264 9 Mutations Mutations NNPS 10_1101-2021_02_13_429885 264 10 Influences Influences NNPS 10_1101-2021_02_13_429885 264 11 Cancer Cancer NNP 10_1101-2021_02_13_429885 264 12 Evolution Evolution NNP 10_1101-2021_02_13_429885 264 13 . . . 10_1101-2021_02_13_429885 264 14 ” " '' 10_1101-2021_02_13_429885 264 15 ​Cold ​Cold `` 10_1101-2021_02_13_429885 264 16 Spring Spring NNP 10_1101-2021_02_13_429885 264 17 Harbor Harbor NNP 10_1101-2021_02_13_429885 264 18 Perspectives Perspectives NNPS 10_1101-2021_02_13_429885 264 19 in in IN 10_1101-2021_02_13_429885 264 20 Medicine​ Medicine​ NNP 10_1101-2021_02_13_429885 264 21 7 7 CD 10_1101-2021_02_13_429885 264 22 ( ( -LRB- 10_1101-2021_02_13_429885 264 23 4 4 CD 10_1101-2021_02_13_429885 264 24 ) ) -RRB- 10_1101-2021_02_13_429885 264 25 . . . 10_1101-2021_02_13_429885 265 1 https://doi.org/​10.1101/cshperspect.a027060 https://doi.org/​10.1101/cshperspect.a027060 NNP 10_1101-2021_02_13_429885 265 2 ​. ​. CD 10_1101-2021_02_13_429885 266 1 Landau Landau NNS 10_1101-2021_02_13_429885 266 2 , , , 10_1101-2021_02_13_429885 266 3 Dan Dan NNP 10_1101-2021_02_13_429885 266 4 A. A. NNP 10_1101-2021_02_13_429885 266 5 , , , 10_1101-2021_02_13_429885 266 6 Scott Scott NNP 10_1101-2021_02_13_429885 266 7 L. L. NNP 10_1101-2021_02_13_429885 266 8 Carter Carter NNP 10_1101-2021_02_13_429885 266 9 , , , 10_1101-2021_02_13_429885 266 10 Petar Petar NNP 10_1101-2021_02_13_429885 266 11 Stojanov Stojanov NNP 10_1101-2021_02_13_429885 266 12 , , , 10_1101-2021_02_13_429885 266 13 Aaron Aaron NNP 10_1101-2021_02_13_429885 266 14 McKenna McKenna NNP 10_1101-2021_02_13_429885 266 15 , , , 10_1101-2021_02_13_429885 266 16 Kristen Kristen NNP 10_1101-2021_02_13_429885 266 17 Stevenson Stevenson NNP 10_1101-2021_02_13_429885 266 18 , , , 10_1101-2021_02_13_429885 266 19 Michael Michael NNP 10_1101-2021_02_13_429885 266 20 S. S. NNP 10_1101-2021_02_13_429885 266 21 Lawrence Lawrence NNP 10_1101-2021_02_13_429885 266 22 , , , 10_1101-2021_02_13_429885 266 23 Carrie Carrie NNP 10_1101-2021_02_13_429885 266 24 Sougnez Sougnez NNP 10_1101-2021_02_13_429885 266 25 , , , 10_1101-2021_02_13_429885 266 26 et et NNP 10_1101-2021_02_13_429885 266 27 al al NNP 10_1101-2021_02_13_429885 266 28 . . . 10_1101-2021_02_13_429885 267 1 2013 2013 CD 10_1101-2021_02_13_429885 267 2 . . . 10_1101-2021_02_13_429885 268 1 “ " `` 10_1101-2021_02_13_429885 268 2 Evolution Evolution NNP 10_1101-2021_02_13_429885 268 3 and and CC 10_1101-2021_02_13_429885 268 4 Impact Impact NNP 10_1101-2021_02_13_429885 268 5 of of IN 10_1101-2021_02_13_429885 268 6 Subclonal Subclonal NNP 10_1101-2021_02_13_429885 268 7 Mutations Mutations NNPS 10_1101-2021_02_13_429885 268 8 in in IN 10_1101-2021_02_13_429885 268 9 Chronic Chronic NNP 10_1101-2021_02_13_429885 268 10 Lymphocytic Lymphocytic NNP 10_1101-2021_02_13_429885 268 11 Leukemia Leukemia NNP 10_1101-2021_02_13_429885 268 12 . . . 10_1101-2021_02_13_429885 268 13 ” " '' 10_1101-2021_02_13_429885 268 14 ​Cell​ ​Cell​ NNP 10_1101-2021_02_13_429885 268 15 152 152 CD 10_1101-2021_02_13_429885 268 16 ( ( -LRB- 10_1101-2021_02_13_429885 268 17 4 4 CD 10_1101-2021_02_13_429885 268 18 ) ) -RRB- 10_1101-2021_02_13_429885 268 19 : : : 10_1101-2021_02_13_429885 268 20 714–26 714–26 CD 10_1101-2021_02_13_429885 268 21 . . . 10_1101-2021_02_13_429885 269 1 https://doi.org/​10.1016/j.cell.2013.01.019 https://doi.org/​10.1016/j.cell.2013.01.019 NNP 10_1101-2021_02_13_429885 269 2 ​. ​. CD 10_1101-2021_02_13_429885 270 1 Levine Levine NNP 10_1101-2021_02_13_429885 270 2 , , , 10_1101-2021_02_13_429885 270 3 Arnold Arnold NNP 10_1101-2021_02_13_429885 270 4 J. J. NNP 10_1101-2021_02_13_429885 270 5 , , , 10_1101-2021_02_13_429885 270 6 Nancy Nancy NNP 10_1101-2021_02_13_429885 270 7 A. A. NNP 10_1101-2021_02_13_429885 270 8 Jenkins Jenkins NNP 10_1101-2021_02_13_429885 270 9 , , , 10_1101-2021_02_13_429885 270 10 and and CC 10_1101-2021_02_13_429885 270 11 Neal Neal NNP 10_1101-2021_02_13_429885 270 12 G. G. NNP 10_1101-2021_02_13_429885 270 13 Copeland Copeland NNP 10_1101-2021_02_13_429885 270 14 . . . 10_1101-2021_02_13_429885 271 1 2019 2019 CD 10_1101-2021_02_13_429885 271 2 . . . 10_1101-2021_02_13_429885 272 1 “ " `` 10_1101-2021_02_13_429885 272 2 The the DT 10_1101-2021_02_13_429885 272 3 Roles Roles NNPS 10_1101-2021_02_13_429885 272 4 of of IN 10_1101-2021_02_13_429885 272 5 Initiating Initiating NNP 10_1101-2021_02_13_429885 272 6 Truncal Truncal NNP 10_1101-2021_02_13_429885 272 7 Mutations Mutations NNPS 10_1101-2021_02_13_429885 272 8 in in IN 10_1101-2021_02_13_429885 272 9 Human Human NNP 10_1101-2021_02_13_429885 272 10 Cancers Cancers NNPS 10_1101-2021_02_13_429885 272 11 : : : 10_1101-2021_02_13_429885 272 12 The the DT 10_1101-2021_02_13_429885 272 13 Order order NN 10_1101-2021_02_13_429885 272 14 of of IN 10_1101-2021_02_13_429885 272 15 Mutations Mutations NNPS 10_1101-2021_02_13_429885 272 16 and and CC 10_1101-2021_02_13_429885 272 17 Tumor Tumor NNP 10_1101-2021_02_13_429885 272 18 Cell Cell NNP 10_1101-2021_02_13_429885 272 19 Type Type NNP 10_1101-2021_02_13_429885 272 20 Matters Matters NNPS 10_1101-2021_02_13_429885 272 21 . . . 10_1101-2021_02_13_429885 272 22 ” " '' 10_1101-2021_02_13_429885 272 23 ​Cancer ​cancer NN 10_1101-2021_02_13_429885 272 24 Cell​ cell​ CD 10_1101-2021_02_13_429885 272 25 35 35 CD 10_1101-2021_02_13_429885 272 26 ( ( -LRB- 10_1101-2021_02_13_429885 272 27 1 1 CD 10_1101-2021_02_13_429885 272 28 ) ) -RRB- 10_1101-2021_02_13_429885 272 29 : : : 10_1101-2021_02_13_429885 272 30 10–15 10–15 LS 10_1101-2021_02_13_429885 272 31 . . . 10_1101-2021_02_13_429885 273 1 https://doi.org/​10.1016/j.ccell.2018.11.009 https://doi.org/​10.1016/j.ccell.2018.11.009 NNP 10_1101-2021_02_13_429885 273 2 ​. ​. NNP 10_1101-2021_02_13_429885 274 1 Li Li NNP 10_1101-2021_02_13_429885 274 2 , , , 10_1101-2021_02_13_429885 274 3 Yilong Yilong NNP 10_1101-2021_02_13_429885 274 4 , , , 10_1101-2021_02_13_429885 274 5 Nicola Nicola NNP 10_1101-2021_02_13_429885 274 6 D. D. NNP 10_1101-2021_02_13_429885 274 7 Roberts Roberts NNP 10_1101-2021_02_13_429885 274 8 , , , 10_1101-2021_02_13_429885 274 9 Jeremiah Jeremiah NNP 10_1101-2021_02_13_429885 274 10 A. a. NN 10_1101-2021_02_13_429885 274 11 Wala Wala NNP 10_1101-2021_02_13_429885 274 12 , , , 10_1101-2021_02_13_429885 274 13 Ofer Ofer NNP 10_1101-2021_02_13_429885 274 14 Shapira Shapira NNP 10_1101-2021_02_13_429885 274 15 , , , 10_1101-2021_02_13_429885 274 16 Steven Steven NNP 10_1101-2021_02_13_429885 274 17 E. E. NNP 10_1101-2021_02_13_429885 274 18 Schumacher Schumacher NNP 10_1101-2021_02_13_429885 274 19 , , , 10_1101-2021_02_13_429885 274 20 Kiran Kiran NNP 10_1101-2021_02_13_429885 274 21 Kumar Kumar NNP 10_1101-2021_02_13_429885 274 22 , , , 10_1101-2021_02_13_429885 274 23 Ekta Ekta NNP 10_1101-2021_02_13_429885 274 24 Khurana Khurana NNP 10_1101-2021_02_13_429885 274 25 , , , 10_1101-2021_02_13_429885 274 26 et et NNP 10_1101-2021_02_13_429885 274 27 al al NNP 10_1101-2021_02_13_429885 274 28 . . . 10_1101-2021_02_13_429885 275 1 2020 2020 CD 10_1101-2021_02_13_429885 275 2 . . . 10_1101-2021_02_13_429885 276 1 “ " `` 10_1101-2021_02_13_429885 276 2 Patterns Patterns NNPS 10_1101-2021_02_13_429885 276 3 of of IN 10_1101-2021_02_13_429885 276 4 Somatic Somatic NNP 10_1101-2021_02_13_429885 276 5 Structural Structural NNP 10_1101-2021_02_13_429885 276 6 Variation Variation NNP 10_1101-2021_02_13_429885 276 7 in in IN 10_1101-2021_02_13_429885 276 8 Human Human NNP 10_1101-2021_02_13_429885 276 9 Cancer Cancer NNP 10_1101-2021_02_13_429885 276 10 Genomes Genomes NNPS 10_1101-2021_02_13_429885 276 11 . . . 10_1101-2021_02_13_429885 276 12 ” " '' 10_1101-2021_02_13_429885 276 13 ​Nature​ ​Nature​ NNP 10_1101-2021_02_13_429885 276 14 578 578 CD 10_1101-2021_02_13_429885 276 15 ( ( -LRB- 10_1101-2021_02_13_429885 276 16 7793 7793 CD 10_1101-2021_02_13_429885 276 17 ) ) -RRB- 10_1101-2021_02_13_429885 276 18 : : : 10_1101-2021_02_13_429885 276 19 112–21 112–21 CD 10_1101-2021_02_13_429885 276 20 . . . 10_1101-2021_02_13_429885 277 1 https://doi.org/​10.1038/s41586-019-1913-9 https://doi.org/​10.1038/s41586-019-1913-9 NNP 10_1101-2021_02_13_429885 277 2 ​. ​. NNP 10_1101-2021_02_13_429885 278 1 Macintyre Macintyre NNP 10_1101-2021_02_13_429885 278 2 , , , 10_1101-2021_02_13_429885 278 3 Geoff Geoff NNP 10_1101-2021_02_13_429885 278 4 , , , 10_1101-2021_02_13_429885 278 5 Teodora Teodora NNP 10_1101-2021_02_13_429885 278 6 E. E. NNP 10_1101-2021_02_13_429885 278 7 Goranova Goranova NNP 10_1101-2021_02_13_429885 278 8 , , , 10_1101-2021_02_13_429885 278 9 Dilrini Dilrini NNP 10_1101-2021_02_13_429885 278 10 De De NNP 10_1101-2021_02_13_429885 278 11 Silva Silva NNP 10_1101-2021_02_13_429885 278 12 , , , 10_1101-2021_02_13_429885 278 13 Darren Darren NNP 10_1101-2021_02_13_429885 278 14 Ennis Ennis NNP 10_1101-2021_02_13_429885 278 15 , , , 10_1101-2021_02_13_429885 278 16 Anna Anna NNP 10_1101-2021_02_13_429885 278 17 M. M. NNP 10_1101-2021_02_13_429885 278 18 Piskorz Piskorz NNP 10_1101-2021_02_13_429885 278 19 , , , 10_1101-2021_02_13_429885 278 20 Matthew Matthew NNP 10_1101-2021_02_13_429885 278 21 Eldridge Eldridge NNP 10_1101-2021_02_13_429885 278 22 , , , 10_1101-2021_02_13_429885 278 23 Daoud Daoud NNP 10_1101-2021_02_13_429885 278 24 Sie Sie NNP 10_1101-2021_02_13_429885 278 25 , , , 10_1101-2021_02_13_429885 278 26 et et NNP 10_1101-2021_02_13_429885 278 27 al al NNP 10_1101-2021_02_13_429885 278 28 . . . 10_1101-2021_02_13_429885 279 1 2018 2018 CD 10_1101-2021_02_13_429885 279 2 . . . 10_1101-2021_02_13_429885 280 1 “ " `` 10_1101-2021_02_13_429885 280 2 Copy copy VB 10_1101-2021_02_13_429885 280 3 Number number NN 10_1101-2021_02_13_429885 280 4 Signatures signature NNS 10_1101-2021_02_13_429885 280 5 and and CC 10_1101-2021_02_13_429885 280 6 Mutational mutational JJ 10_1101-2021_02_13_429885 280 7 Processes Processes NNPS 10_1101-2021_02_13_429885 280 8 in in IN 10_1101-2021_02_13_429885 280 9 Ovarian Ovarian NNP 10_1101-2021_02_13_429885 280 10 Carcinoma Carcinoma NNP 10_1101-2021_02_13_429885 280 11 . . . 10_1101-2021_02_13_429885 280 12 ” " '' 10_1101-2021_02_13_429885 280 13 ​Nature ​nature JJ 10_1101-2021_02_13_429885 280 14 Genetics​ genetics​ NN 10_1101-2021_02_13_429885 280 15 50 50 CD 10_1101-2021_02_13_429885 280 16 ( ( -LRB- 10_1101-2021_02_13_429885 280 17 9 9 CD 10_1101-2021_02_13_429885 280 18 ) ) -RRB- 10_1101-2021_02_13_429885 280 19 : : : 10_1101-2021_02_13_429885 280 20 1262–70 1262–70 CD 10_1101-2021_02_13_429885 280 21 . . . 10_1101-2021_02_13_429885 281 1 https://doi.org/​10.1038/s41588-018-0179-8 https://doi.org/​10.1038/s41588-018-0179-8 NNS 10_1101-2021_02_13_429885 281 2 ​. ​. CD 10_1101-2021_02_13_429885 282 1 Martincorena Martincorena NNP 10_1101-2021_02_13_429885 282 2 , , , 10_1101-2021_02_13_429885 282 3 Iñigo Iñigo NNP 10_1101-2021_02_13_429885 282 4 , , , 10_1101-2021_02_13_429885 282 5 Joanna Joanna NNP 10_1101-2021_02_13_429885 282 6 C. C. NNP 10_1101-2021_02_13_429885 282 7 Fowler Fowler NNP 10_1101-2021_02_13_429885 282 8 , , , 10_1101-2021_02_13_429885 282 9 Agnieszka Agnieszka NNP 10_1101-2021_02_13_429885 282 10 Wabik Wabik NNP 10_1101-2021_02_13_429885 282 11 , , , 10_1101-2021_02_13_429885 282 12 Andrew Andrew NNP 10_1101-2021_02_13_429885 282 13 R. R. NNP 10_1101-2021_02_13_429885 282 14 J. J. NNP 10_1101-2021_02_13_429885 282 15 Lawson Lawson NNP 10_1101-2021_02_13_429885 282 16 , , , 10_1101-2021_02_13_429885 282 17 Federico Federico NNP 10_1101-2021_02_13_429885 282 18 Abascal Abascal NNP 10_1101-2021_02_13_429885 282 19 , , , 10_1101-2021_02_13_429885 282 20 Michael Michael NNP 10_1101-2021_02_13_429885 282 21 W. W. NNP 10_1101-2021_02_13_429885 282 22 J. J. NNP 10_1101-2021_02_13_429885 283 1 Hall Hall NNP 10_1101-2021_02_13_429885 283 2 , , , 10_1101-2021_02_13_429885 283 3 Alex Alex NNP 10_1101-2021_02_13_429885 283 4 Cagan Cagan NNP 10_1101-2021_02_13_429885 283 5 , , , 10_1101-2021_02_13_429885 283 6 et et NNP 10_1101-2021_02_13_429885 283 7 al al NNP 10_1101-2021_02_13_429885 283 8 . . . 10_1101-2021_02_13_429885 284 1 2018 2018 CD 10_1101-2021_02_13_429885 284 2 . . . 10_1101-2021_02_13_429885 285 1 “ " `` 10_1101-2021_02_13_429885 285 2 Somatic Somatic NNP 10_1101-2021_02_13_429885 285 3 Mutant Mutant NNP 10_1101-2021_02_13_429885 285 4 Clones Clones NNPS 10_1101-2021_02_13_429885 285 5 Colonize colonize VBP 10_1101-2021_02_13_429885 285 6 the the DT 10_1101-2021_02_13_429885 285 7 Human Human NNP 10_1101-2021_02_13_429885 285 8 Esophagus Esophagus NNP 10_1101-2021_02_13_429885 285 9 with with IN 10_1101-2021_02_13_429885 285 10 Age Age NNP 10_1101-2021_02_13_429885 285 11 . . . 10_1101-2021_02_13_429885 285 12 ” " '' 10_1101-2021_02_13_429885 285 13 ​Science​ ​science​ CD 10_1101-2021_02_13_429885 285 14 362 362 CD 10_1101-2021_02_13_429885 285 15 ( ( -LRB- 10_1101-2021_02_13_429885 285 16 6417 6417 CD 10_1101-2021_02_13_429885 285 17 ) ) -RRB- 10_1101-2021_02_13_429885 285 18 : : : 10_1101-2021_02_13_429885 285 19 911–17 911–17 CD 10_1101-2021_02_13_429885 285 20 . . . 10_1101-2021_02_13_429885 286 1 https://doi.org/​10.1126/science.aau3879 https://doi.org/​10.1126/science.aau3879 NNP 10_1101-2021_02_13_429885 286 2 ​. ​. NNP 10_1101-2021_02_13_429885 287 1 Martincorena Martincorena NNP 10_1101-2021_02_13_429885 287 2 , , , 10_1101-2021_02_13_429885 287 3 Iñigo Iñigo NNP 10_1101-2021_02_13_429885 287 4 , , , 10_1101-2021_02_13_429885 287 5 Amit Amit NNP 10_1101-2021_02_13_429885 287 6 Roshan Roshan NNP 10_1101-2021_02_13_429885 287 7 , , , 10_1101-2021_02_13_429885 287 8 Moritz Moritz NNP 10_1101-2021_02_13_429885 287 9 Gerstung Gerstung NNP 10_1101-2021_02_13_429885 287 10 , , , 10_1101-2021_02_13_429885 287 11 Peter Peter NNP 10_1101-2021_02_13_429885 287 12 Ellis Ellis NNP 10_1101-2021_02_13_429885 287 13 , , , 10_1101-2021_02_13_429885 287 14 Peter Peter NNP 10_1101-2021_02_13_429885 287 15 Van Van NNP 10_1101-2021_02_13_429885 287 16 Loo Loo NNP 10_1101-2021_02_13_429885 287 17 , , , 10_1101-2021_02_13_429885 287 18 Stuart Stuart NNP 10_1101-2021_02_13_429885 287 19 McLaren McLaren NNP 10_1101-2021_02_13_429885 287 20 , , , 10_1101-2021_02_13_429885 287 21 David David NNP 10_1101-2021_02_13_429885 287 22 C. C. NNP 10_1101-2021_02_13_429885 287 23 Wedge Wedge NNP 10_1101-2021_02_13_429885 287 24 , , , 10_1101-2021_02_13_429885 287 25 et et NNP 10_1101-2021_02_13_429885 287 26 al al NNP 10_1101-2021_02_13_429885 287 27 . . . 10_1101-2021_02_13_429885 288 1 2015 2015 CD 10_1101-2021_02_13_429885 288 2 . . . 10_1101-2021_02_13_429885 289 1 “ " `` 10_1101-2021_02_13_429885 289 2 High High NNP 10_1101-2021_02_13_429885 289 3 Burden Burden NNP 10_1101-2021_02_13_429885 289 4 and and CC 10_1101-2021_02_13_429885 289 5 Pervasive Pervasive NNP 10_1101-2021_02_13_429885 289 6 Positive Positive NNP 10_1101-2021_02_13_429885 289 7 Selection Selection NNP 10_1101-2021_02_13_429885 289 8 of of IN 10_1101-2021_02_13_429885 289 9 Somatic Somatic NNP 10_1101-2021_02_13_429885 289 10 Mutations Mutations NNPS 10_1101-2021_02_13_429885 289 11 in in IN 10_1101-2021_02_13_429885 289 12 Normal Normal NNP 10_1101-2021_02_13_429885 289 13 Human Human NNP 10_1101-2021_02_13_429885 289 14 Skin Skin NNP 10_1101-2021_02_13_429885 289 15 . . . 10_1101-2021_02_13_429885 289 16 ” " '' 10_1101-2021_02_13_429885 289 17 ​Science​ ​science​ CD 10_1101-2021_02_13_429885 289 18 348 348 CD 10_1101-2021_02_13_429885 289 19 ( ( -LRB- 10_1101-2021_02_13_429885 289 20 6237 6237 CD 10_1101-2021_02_13_429885 289 21 ) ) -RRB- 10_1101-2021_02_13_429885 289 22 : : : 10_1101-2021_02_13_429885 289 23 880–86 880–86 CD 10_1101-2021_02_13_429885 289 24 . . . 10_1101-2021_02_13_429885 290 1 https://doi.org/​10.1126/science.aaa6806 https://doi.org/​10.1126/science.aaa6806 VBD 10_1101-2021_02_13_429885 290 2 ​. ​. NNP 10_1101-2021_02_13_429885 291 1 McGranahan McGranahan NNP 10_1101-2021_02_13_429885 291 2 , , , 10_1101-2021_02_13_429885 291 3 Nicholas Nicholas NNP 10_1101-2021_02_13_429885 291 4 , , , 10_1101-2021_02_13_429885 291 5 and and CC 10_1101-2021_02_13_429885 291 6 Charles Charles NNP 10_1101-2021_02_13_429885 291 7 Swanton Swanton NNP 10_1101-2021_02_13_429885 291 8 . . . 10_1101-2021_02_13_429885 292 1 2015 2015 CD 10_1101-2021_02_13_429885 292 2 . . . 10_1101-2021_02_13_429885 293 1 “ " `` 10_1101-2021_02_13_429885 293 2 Biological biological JJ 10_1101-2021_02_13_429885 293 3 and and CC 10_1101-2021_02_13_429885 293 4 Therapeutic therapeutic JJ 10_1101-2021_02_13_429885 293 5 Impact impact NN 10_1101-2021_02_13_429885 293 6 of of IN 10_1101-2021_02_13_429885 293 7 Intratumor intratumor JJ 10_1101-2021_02_13_429885 293 8 Heterogeneity Heterogeneity NNP 10_1101-2021_02_13_429885 293 9 in in IN 10_1101-2021_02_13_429885 293 10 Cancer Cancer NNP 10_1101-2021_02_13_429885 293 11 Evolution Evolution NNP 10_1101-2021_02_13_429885 293 12 . . . 10_1101-2021_02_13_429885 293 13 ” " '' 10_1101-2021_02_13_429885 293 14 ​Cancer ​cancer NN 10_1101-2021_02_13_429885 293 15 Cell​ cell​ CD 10_1101-2021_02_13_429885 293 16 27 27 CD 10_1101-2021_02_13_429885 293 17 ( ( -LRB- 10_1101-2021_02_13_429885 293 18 1 1 CD 10_1101-2021_02_13_429885 293 19 ) ) -RRB- 10_1101-2021_02_13_429885 293 20 : : : 10_1101-2021_02_13_429885 293 21 15–26 15–26 CD 10_1101-2021_02_13_429885 293 22 . . . 10_1101-2021_02_13_429885 294 1 https://doi.org/​10.1016/j.ccell.2014.12.001 https://doi.org/​10.1016/j.ccell.2014.12.001 NNP 10_1101-2021_02_13_429885 294 2 ​. ​. NNP 10_1101-2021_02_13_429885 295 1 — — : 10_1101-2021_02_13_429885 295 2 — — : 10_1101-2021_02_13_429885 295 3 — — : 10_1101-2021_02_13_429885 295 4 . . . 10_1101-2021_02_13_429885 296 1 2017 2017 CD 10_1101-2021_02_13_429885 296 2 . . . 10_1101-2021_02_13_429885 297 1 “ " `` 10_1101-2021_02_13_429885 297 2 Clonal clonal JJ 10_1101-2021_02_13_429885 297 3 Heterogeneity Heterogeneity NNP 10_1101-2021_02_13_429885 297 4 and and CC 10_1101-2021_02_13_429885 297 5 Tumor Tumor NNP 10_1101-2021_02_13_429885 297 6 Evolution Evolution NNP 10_1101-2021_02_13_429885 297 7 : : : 10_1101-2021_02_13_429885 297 8 Past past JJ 10_1101-2021_02_13_429885 297 9 , , , 10_1101-2021_02_13_429885 297 10 Present present NN 10_1101-2021_02_13_429885 297 11 , , , 10_1101-2021_02_13_429885 297 12 and and CC 10_1101-2021_02_13_429885 297 13 the the DT 10_1101-2021_02_13_429885 297 14 Future Future NNP 10_1101-2021_02_13_429885 297 15 . . . 10_1101-2021_02_13_429885 297 16 ” " '' 10_1101-2021_02_13_429885 297 17 ​Cell ​Cell VBG 10_1101-2021_02_13_429885 297 18 168 168 CD 10_1101-2021_02_13_429885 297 19 ( ( -LRB- 10_1101-2021_02_13_429885 297 20 4 4 CD 10_1101-2021_02_13_429885 297 21 ) ) -RRB- 10_1101-2021_02_13_429885 297 22 : : : 10_1101-2021_02_13_429885 297 23 613–28 613–28 CD 10_1101-2021_02_13_429885 297 24 . . . 10_1101-2021_02_13_429885 298 1 https://doi.org/​10.1016/j.cell.2017.01.018 https://doi.org/​10.1016/j.cell.2017.01.018 NNP 10_1101-2021_02_13_429885 298 2 ​. ​. CD 10_1101-2021_02_13_429885 299 1 Nik Nik NNP 10_1101-2021_02_13_429885 299 2 - - HYPH 10_1101-2021_02_13_429885 299 3 Zainal Zainal NNP 10_1101-2021_02_13_429885 299 4 , , , 10_1101-2021_02_13_429885 299 5 Serena Serena NNP 10_1101-2021_02_13_429885 299 6 , , , 10_1101-2021_02_13_429885 299 7 Peter Peter NNP 10_1101-2021_02_13_429885 299 8 Van Van NNP 10_1101-2021_02_13_429885 299 9 Loo Loo NNP 10_1101-2021_02_13_429885 299 10 , , , 10_1101-2021_02_13_429885 299 11 David David NNP 10_1101-2021_02_13_429885 299 12 C. C. NNP 10_1101-2021_02_13_429885 299 13 Wedge Wedge NNP 10_1101-2021_02_13_429885 299 14 , , , 10_1101-2021_02_13_429885 299 15 Ludmil Ludmil NNP 10_1101-2021_02_13_429885 299 16 B. B. NNP 10_1101-2021_02_13_429885 299 17 Alexandrov Alexandrov NNP 10_1101-2021_02_13_429885 299 18 , , , 10_1101-2021_02_13_429885 299 19 Christopher Christopher NNP 10_1101-2021_02_13_429885 299 20 D. D. NNP 10_1101-2021_02_13_429885 299 21 Greenman Greenman NNP 10_1101-2021_02_13_429885 299 22 , , , 10_1101-2021_02_13_429885 299 23 King King NNP 10_1101-2021_02_13_429885 299 24 Wai Wai NNP 10_1101-2021_02_13_429885 299 25 Lau Lau NNP 10_1101-2021_02_13_429885 299 26 , , , 10_1101-2021_02_13_429885 299 27 Keiran Keiran NNP 10_1101-2021_02_13_429885 299 28 Raine Raine NNP 10_1101-2021_02_13_429885 299 29 , , , 10_1101-2021_02_13_429885 299 30 et et NNP 10_1101-2021_02_13_429885 299 31 al al NNP 10_1101-2021_02_13_429885 299 32 . . . 10_1101-2021_02_13_429885 300 1 2012 2012 CD 10_1101-2021_02_13_429885 300 2 . . . 10_1101-2021_02_13_429885 301 1 “ " `` 10_1101-2021_02_13_429885 301 2 The the DT 10_1101-2021_02_13_429885 301 3 Life Life NNP 10_1101-2021_02_13_429885 301 4 History history NN 10_1101-2021_02_13_429885 301 5 of of IN 10_1101-2021_02_13_429885 301 6 21 21 CD 10_1101-2021_02_13_429885 301 7 Breast Breast NNP 10_1101-2021_02_13_429885 301 8 Cancers Cancers NNPS 10_1101-2021_02_13_429885 301 9 . . . 10_1101-2021_02_13_429885 301 10 ” " '' 10_1101-2021_02_13_429885 301 11 ​Cell​ ​Cell​ NNP 10_1101-2021_02_13_429885 301 12 149 149 CD 10_1101-2021_02_13_429885 301 13 ( ( -LRB- 10_1101-2021_02_13_429885 301 14 5 5 CD 10_1101-2021_02_13_429885 301 15 ) ) -RRB- 10_1101-2021_02_13_429885 301 16 : : : 10_1101-2021_02_13_429885 301 17 994–1007 994–1007 CD 10_1101-2021_02_13_429885 301 18 . . . 10_1101-2021_02_13_429885 302 1 https://doi.org/​10.1016/j.cell.2012.04.023 https://doi.org/​10.1016/j.cell.2012.04.023 NNP 10_1101-2021_02_13_429885 302 2 ​. ​. NNP 10_1101-2021_02_13_429885 303 1 .CC .CC NFP 10_1101-2021_02_13_429885 303 2 - - : 10_1101-2021_02_13_429885 303 3 BY by IN 10_1101-2021_02_13_429885 303 4 - - HYPH 10_1101-2021_02_13_429885 303 5 NC NC NNP 10_1101-2021_02_13_429885 303 6 - - HYPH 10_1101-2021_02_13_429885 303 7 ND ND NNP 10_1101-2021_02_13_429885 303 8 4.0 4.0 CD 10_1101-2021_02_13_429885 303 9 International International NNP 10_1101-2021_02_13_429885 303 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 303 11 under under IN 10_1101-2021_02_13_429885 303 12 a a DT 10_1101-2021_02_13_429885 303 13 ( ( -LRB- 10_1101-2021_02_13_429885 303 14 which which WDT 10_1101-2021_02_13_429885 303 15 was be VBD 10_1101-2021_02_13_429885 303 16 not not RB 10_1101-2021_02_13_429885 303 17 certified certify VBN 10_1101-2021_02_13_429885 303 18 by by IN 10_1101-2021_02_13_429885 303 19 peer peer NN 10_1101-2021_02_13_429885 303 20 review review NN 10_1101-2021_02_13_429885 303 21 ) ) -RRB- 10_1101-2021_02_13_429885 303 22 is be VBZ 10_1101-2021_02_13_429885 303 23 the the DT 10_1101-2021_02_13_429885 303 24 author author NN 10_1101-2021_02_13_429885 303 25 / / SYM 10_1101-2021_02_13_429885 303 26 funder funder NN 10_1101-2021_02_13_429885 303 27 , , , 10_1101-2021_02_13_429885 303 28 who who WP 10_1101-2021_02_13_429885 303 29 has have VBZ 10_1101-2021_02_13_429885 303 30 granted grant VBN 10_1101-2021_02_13_429885 303 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 303 32 a a DT 10_1101-2021_02_13_429885 303 33 license license NN 10_1101-2021_02_13_429885 303 34 to to TO 10_1101-2021_02_13_429885 303 35 display display VB 10_1101-2021_02_13_429885 303 36 the the DT 10_1101-2021_02_13_429885 303 37 preprint preprint NN 10_1101-2021_02_13_429885 303 38 in in IN 10_1101-2021_02_13_429885 303 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 303 40 . . . 10_1101-2021_02_13_429885 304 1 It -PRON- PRP 10_1101-2021_02_13_429885 304 2 is be VBZ 10_1101-2021_02_13_429885 304 3 made make VBN 10_1101-2021_02_13_429885 304 4 The the DT 10_1101-2021_02_13_429885 304 5 copyright copyright NN 10_1101-2021_02_13_429885 304 6 holder holder NN 10_1101-2021_02_13_429885 304 7 for for IN 10_1101-2021_02_13_429885 304 8 this this DT 10_1101-2021_02_13_429885 304 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 304 10 version version NN 10_1101-2021_02_13_429885 304 11 posted post VBD 10_1101-2021_02_13_429885 304 12 February February NNP 10_1101-2021_02_13_429885 304 13 13 13 CD 10_1101-2021_02_13_429885 304 14 , , , 10_1101-2021_02_13_429885 304 15 2021 2021 CD 10_1101-2021_02_13_429885 304 16 . . . 10_1101-2021_02_13_429885 304 17 ; ; : 10_1101-2021_02_13_429885 304 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 304 19 : : : 10_1101-2021_02_13_429885 304 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 304 21 preprint preprint NN 10_1101-2021_02_13_429885 304 22 http://paperpile.com/b/rqVmzs/A7Vg http://paperpile.com/b/rqVmzs/A7Vg NNP 10_1101-2021_02_13_429885 304 23 http://paperpile.com/b/rqVmzs/A7Vg http://paperpile.com/b/rqVmzs/A7Vg NNP 10_1101-2021_02_13_429885 304 24 http://paperpile.com/b/rqVmzs/A7Vg http://paperpile.com/b/rqVmzs/A7Vg NNP 10_1101-2021_02_13_429885 304 25 http://paperpile.com/b/rqVmzs/A7Vg http://paperpile.com/b/rqVmzs/A7Vg NNP 10_1101-2021_02_13_429885 304 26 http://dx.doi.org/10.1016/j.celrep.2014.04.055 http://dx.doi.org/10.1016/j.celrep.2014.04.055 PRP$ 10_1101-2021_02_13_429885 304 27 http://paperpile.com/b/rqVmzs/A7Vg http://paperpile.com/b/rqVmzs/A7Vg NNP 10_1101-2021_02_13_429885 304 28 http://paperpile.com/b/rqVmzs/vQgD http://paperpile.com/b/rqVmzs/vQgD NNP 10_1101-2021_02_13_429885 304 29 http://paperpile.com/b/rqVmzs/vQgD http://paperpile.com/b/rqVmzs/vQgD NNP 10_1101-2021_02_13_429885 304 30 http://paperpile.com/b/rqVmzs/vQgD http://paperpile.com/b/rqVmzs/vQgD NNP 10_1101-2021_02_13_429885 304 31 http://paperpile.com/b/rqVmzs/vQgD http://paperpile.com/b/rqVmzs/vQgD NNP 10_1101-2021_02_13_429885 304 32 http://paperpile.com/b/rqVmzs/vQgD http://paperpile.com/b/rqVmzs/vQgD NNP 10_1101-2021_02_13_429885 304 33 http://dx.doi.org/10.1038/s41586-019-1907-7 http://dx.doi.org/10.1038/s41586-019-1907-7 NNP 10_1101-2021_02_13_429885 304 34 http://paperpile.com/b/rqVmzs/vQgD http://paperpile.com/b/rqVmzs/vQgD NNP 10_1101-2021_02_13_429885 304 35 http://paperpile.com/b/rqVmzs/Glz6 http://paperpile.com/b/rqvmzs/glz6 UH 10_1101-2021_02_13_429885 304 36 http://paperpile.com/b/rqVmzs/Glz6 http://paperpile.com/b/rqvmzs/glz6 UH 10_1101-2021_02_13_429885 304 37 http://paperpile.com/b/rqVmzs/Glz6 http://paperpile.com/b/rqvmzs/glz6 UH 10_1101-2021_02_13_429885 304 38 http://paperpile.com/b/rqVmzs/Glz6 http://paperpile.com/b/rqvmzs/glz6 UH 10_1101-2021_02_13_429885 304 39 http://paperpile.com/b/rqVmzs/Glz6 http://paperpile.com/b/rqvmzs/glz6 UH 10_1101-2021_02_13_429885 304 40 http://paperpile.com/b/rqVmzs/Glz6 http://paperpile.com/b/rqvmzs/glz6 UH 10_1101-2021_02_13_429885 304 41 http://dx.doi.org/10.1038/nmeth.2642 http://dx.doi.org/10.1038/nmeth.2642 UH 10_1101-2021_02_13_429885 304 42 http://paperpile.com/b/rqVmzs/Glz6 http://paperpile.com/b/rqvmzs/glz6 UH 10_1101-2021_02_13_429885 304 43 http://paperpile.com/b/rqVmzs/Pf2 http://paperpile.com/b/rqVmzs/Pf2 NNP 10_1101-2021_02_13_429885 304 44 t t NNP 10_1101-2021_02_13_429885 304 45 http://paperpile.com/b/rqVmzs/Pf2 http://paperpile.com/b/rqVmzs/Pf2 NNP 10_1101-2021_02_13_429885 304 46 t t NNP 10_1101-2021_02_13_429885 304 47 http://paperpile.com/b/rqVmzs/Pf2 http://paperpile.com/b/rqVmzs/Pf2 NNP 10_1101-2021_02_13_429885 304 48 t t NNP 10_1101-2021_02_13_429885 304 49 http://paperpile.com/b/rqVmzs/Pf2 http://paperpile.com/b/rqVmzs/Pf2 NNP 10_1101-2021_02_13_429885 304 50 t t NNP 10_1101-2021_02_13_429885 304 51 http://dx.doi.org/10.1038/nature10762 http://dx.doi.org/10.1038/nature10762 NNP 10_1101-2021_02_13_429885 304 52 http://paperpile.com/b/rqVmzs/Pf2 http://paperpile.com/b/rqVmzs/Pf2 NNP 10_1101-2021_02_13_429885 304 53 t t NNP 10_1101-2021_02_13_429885 304 54 http://paperpile.com/b/rqVmzs/CImd http://paperpile.com/b/rqVmzs/CImd NNP 10_1101-2021_02_13_429885 304 55 http://paperpile.com/b/rqVmzs/CImd http://paperpile.com/b/rqVmzs/CImd NNP 10_1101-2021_02_13_429885 304 56 http://paperpile.com/b/rqVmzs/CImd http://paperpile.com/b/rqVmzs/CImd NNP 10_1101-2021_02_13_429885 304 57 http://paperpile.com/b/rqVmzs/CImd http://paperpile.com/b/rqVmzs/CImd NNP 10_1101-2021_02_13_429885 304 58 http://paperpile.com/b/rqVmzs/CImd http://paperpile.com/b/rqVmzs/CImd VBZ 10_1101-2021_02_13_429885 304 59 http://paperpile.com/b/rqVmzs/CImd http://paperpile.com/b/rqVmzs/CImd VBZ 10_1101-2021_02_13_429885 304 60 http://dx.doi.org/10.1056/NEJMoa1616288 http://dx.doi.org/10.1056/nejmoa1616288 IN 10_1101-2021_02_13_429885 304 61 http://paperpile.com/b/rqVmzs/CImd http://paperpile.com/b/rqVmzs/CImd NNP 10_1101-2021_02_13_429885 304 62 http://paperpile.com/b/rqVmzs/df7V http://paperpile.com/b/rqvmzs/df7v ADD 10_1101-2021_02_13_429885 304 63 http://paperpile.com/b/rqVmzs/df7V http://paperpile.com/b/rqVmzs/df7V NNP 10_1101-2021_02_13_429885 304 64 http://paperpile.com/b/rqVmzs/df7V http://paperpile.com/b/rqVmzs/df7V NNP 10_1101-2021_02_13_429885 304 65 http://paperpile.com/b/rqVmzs/df7V http://paperpile.com/b/rqVmzs/df7V NNP 10_1101-2021_02_13_429885 304 66 http://paperpile.com/b/rqVmzs/df7V http://paperpile.com/b/rqVmzs/df7V : 10_1101-2021_02_13_429885 304 67 http://dx.doi.org/10.1101/cshperspect.a027060 http://dx.doi.org/10.1101/cshperspect.a027060 NNP 10_1101-2021_02_13_429885 304 68 http://paperpile.com/b/rqVmzs/df7V http://paperpile.com/b/rqvmzs/df7v ADD 10_1101-2021_02_13_429885 304 69 http://paperpile.com/b/rqVmzs/tqeT http://paperpile.com/b/rqVmzs/tqeT NNP 10_1101-2021_02_13_429885 304 70 http://paperpile.com/b/rqVmzs/tqeT http://paperpile.com/b/rqVmzs/tqeT NNP 10_1101-2021_02_13_429885 304 71 http://paperpile.com/b/rqVmzs/tqeT http://paperpile.com/b/rqVmzs/tqeT NNP 10_1101-2021_02_13_429885 304 72 http://paperpile.com/b/rqVmzs/tqeT http://paperpile.com/b/rqVmzs/tqeT NNP 10_1101-2021_02_13_429885 304 73 http://paperpile.com/b/rqVmzs/tqeT http://paperpile.com/b/rqVmzs/tqeT NNP 10_1101-2021_02_13_429885 304 74 http://paperpile.com/b/rqVmzs/tqeT http://paperpile.com/b/rqVmzs/tqeT NNP 10_1101-2021_02_13_429885 304 75 http://dx.doi.org/10.1016/j.cell.2013.01.019 http://dx.doi.org/10.1016/j.cell.2013.01.019 NNP 10_1101-2021_02_13_429885 304 76 http://paperpile.com/b/rqVmzs/tqeT http://paperpile.com/b/rqVmzs/tqeT NNP 10_1101-2021_02_13_429885 304 77 http://paperpile.com/b/rqVmzs/SxXl http://paperpile.com/b/rqVmzs/SxXl NNS 10_1101-2021_02_13_429885 304 78 http://paperpile.com/b/rqVmzs/SxXl http://paperpile.com/b/rqVmzs/SxXl NNS 10_1101-2021_02_13_429885 304 79 http://paperpile.com/b/rqVmzs/SxXl http://paperpile.com/b/rqVmzs/SxXl NNS 10_1101-2021_02_13_429885 304 80 http://paperpile.com/b/rqVmzs/SxXl http://paperpile.com/b/rqVmzs/SxXl NNS 10_1101-2021_02_13_429885 304 81 http://paperpile.com/b/rqVmzs/SxXl http://paperpile.com/b/rqVmzs/SxXl NNP 10_1101-2021_02_13_429885 304 82 http://dx.doi.org/10.1016/j.ccell.2018.11.009 http://dx.doi.org/10.1016/j.ccell.2018.11.009 NNP 10_1101-2021_02_13_429885 304 83 http://paperpile.com/b/rqVmzs/SxXl http://paperpile.com/b/rqVmzs/SxXl NNPS 10_1101-2021_02_13_429885 304 84 http://paperpile.com/b/rqVmzs/tMOu http://paperpile.com/b/rqVmzs/tMOu NNP 10_1101-2021_02_13_429885 304 85 http://paperpile.com/b/rqVmzs/tMOu http://paperpile.com/b/rqVmzs/tMOu NNP 10_1101-2021_02_13_429885 304 86 http://paperpile.com/b/rqVmzs/tMOu http://paperpile.com/b/rqVmzs/tMOu NNP 10_1101-2021_02_13_429885 304 87 http://paperpile.com/b/rqVmzs/tMOu http://paperpile.com/b/rqVmzs/tMOu NNP 10_1101-2021_02_13_429885 304 88 http://paperpile.com/b/rqVmzs/tMOu http://paperpile.com/b/rqVmzs/tMOu NNP 10_1101-2021_02_13_429885 304 89 http://dx.doi.org/10.1038/s41586-019-1913-9 http://dx.doi.org/10.1038/s41586-019-1913-9 NNP 10_1101-2021_02_13_429885 304 90 http://paperpile.com/b/rqVmzs/tMOu http://paperpile.com/b/rqVmzs/tMOu NNP 10_1101-2021_02_13_429885 304 91 http://paperpile.com/b/rqVmzs/P1Yv http://paperpile.com/b/rqVmzs/P1Yv NNP 10_1101-2021_02_13_429885 304 92 http://paperpile.com/b/rqVmzs/P1Yv http://paperpile.com/b/rqVmzs/P1Yv NNP 10_1101-2021_02_13_429885 304 93 http://paperpile.com/b/rqVmzs/P1Yv http://paperpile.com/b/rqVmzs/P1Yv NNP 10_1101-2021_02_13_429885 304 94 http://paperpile.com/b/rqVmzs/P1Yv http://paperpile.com/b/rqVmzs/P1Yv NNP 10_1101-2021_02_13_429885 304 95 http://paperpile.com/b/rqVmzs/P1Yv http://paperpile.com/b/rqVmzs/P1Yv NNP 10_1101-2021_02_13_429885 304 96 http://paperpile.com/b/rqVmzs/P1Yv http://paperpile.com/b/rqVmzs/P1Yv NNP 10_1101-2021_02_13_429885 304 97 http://dx.doi.org/10.1038/s41588-018-0179-8 http://dx.doi.org/10.1038/s41588-018-0179-8 NNP 10_1101-2021_02_13_429885 304 98 http://paperpile.com/b/rqVmzs/P1Yv http://paperpile.com/b/rqVmzs/P1Yv NNP 10_1101-2021_02_13_429885 304 99 http://paperpile.com/b/rqVmzs/uG2X http://paperpile.com/b/rqVmzs/uG2X NNP 10_1101-2021_02_13_429885 304 100 http://paperpile.com/b/rqVmzs/uG2X http://paperpile.com/b/rqVmzs/uG2X NNP 10_1101-2021_02_13_429885 304 101 http://paperpile.com/b/rqVmzs/uG2X http://paperpile.com/b/rqVmzs/uG2X NNP 10_1101-2021_02_13_429885 304 102 http://paperpile.com/b/rqVmzs/uG2X http://paperpile.com/b/rqVmzs/uG2X NNP 10_1101-2021_02_13_429885 304 103 http://paperpile.com/b/rqVmzs/uG2X http://paperpile.com/b/rqVmzs/uG2X NNP 10_1101-2021_02_13_429885 304 104 http://paperpile.com/b/rqVmzs/uG2X http://paperpile.com/b/rqVmzs/uG2X NNP 10_1101-2021_02_13_429885 304 105 http://dx.doi.org/10.1126/science.aau3879 http://dx.doi.org/10.1126/science.aau3879 NNP 10_1101-2021_02_13_429885 304 106 http://paperpile.com/b/rqVmzs/uG2X http://paperpile.com/b/rqVmzs/uG2X NNP 10_1101-2021_02_13_429885 304 107 http://paperpile.com/b/rqVmzs/4mqr http://paperpile.com/b/rqVmzs/4mqr NNP 10_1101-2021_02_13_429885 304 108 http://paperpile.com/b/rqVmzs/4mqr http://paperpile.com/b/rqVmzs/4mqr '' 10_1101-2021_02_13_429885 304 109 http://paperpile.com/b/rqVmzs/4mqr http://paperpile.com/b/rqVmzs/4mqr `` 10_1101-2021_02_13_429885 304 110 http://paperpile.com/b/rqVmzs/4mqr http://paperpile.com/b/rqVmzs/4mqr `` 10_1101-2021_02_13_429885 304 111 http://paperpile.com/b/rqVmzs/4mqr http://paperpile.com/b/rqVmzs/4mqr `` 10_1101-2021_02_13_429885 304 112 http://paperpile.com/b/rqVmzs/4mqr http://paperpile.com/b/rqVmzs/4mqr VBZ 10_1101-2021_02_13_429885 304 113 http://dx.doi.org/10.1126/science.aaa6806 http://dx.doi.org/10.1126/science.aaa6806 NNP 10_1101-2021_02_13_429885 304 114 http://paperpile.com/b/rqVmzs/4mqr http://paperpile.com/b/rqVmzs/4mqr '' 10_1101-2021_02_13_429885 304 115 http://paperpile.com/b/rqVmzs/ZoHM http://paperpile.com/b/rqvmzs/zohm JJ 10_1101-2021_02_13_429885 304 116 http://paperpile.com/b/rqVmzs/ZoHM http://paperpile.com/b/rqvmzs/zohm UH 10_1101-2021_02_13_429885 304 117 http://paperpile.com/b/rqVmzs/ZoHM http://paperpile.com/b/rqvmzs/zohm UH 10_1101-2021_02_13_429885 304 118 http://paperpile.com/b/rqVmzs/ZoHM http://paperpile.com/b/rqvmzs/zohm UH 10_1101-2021_02_13_429885 304 119 http://paperpile.com/b/rqVmzs/ZoHM http://paperpile.com/b/rqvmzs/zohm FW 10_1101-2021_02_13_429885 304 120 http://dx.doi.org/10.1016/j.ccell.2014.12.001 http://dx.doi.org/10.1016/j.ccell.2014.12.001 FW 10_1101-2021_02_13_429885 304 121 http://paperpile.com/b/rqVmzs/ZoHM http://paperpile.com/b/rqVmzs/ZoHM NNP 10_1101-2021_02_13_429885 304 122 http://paperpile.com/b/rqVmzs/5LH8 http://paperpile.com/b/rqVmzs/5LH8 NNP 10_1101-2021_02_13_429885 304 123 http://paperpile.com/b/rqVmzs/5LH8 http://paperpile.com/b/rqVmzs/5LH8 NNP 10_1101-2021_02_13_429885 304 124 http://paperpile.com/b/rqVmzs/5LH8 http://paperpile.com/b/rqVmzs/5LH8 NNP 10_1101-2021_02_13_429885 304 125 http://paperpile.com/b/rqVmzs/5LH8 http://paperpile.com/b/rqVmzs/5LH8 NNP 10_1101-2021_02_13_429885 304 126 http://dx.doi.org/10.1016/j.cell.2017.01.018 http://dx.doi.org/10.1016/j.cell.2017.01.018 NNP 10_1101-2021_02_13_429885 304 127 http://paperpile.com/b/rqVmzs/5LH8 http://paperpile.com/b/rqVmzs/5LH8 NNP 10_1101-2021_02_13_429885 304 128 http://paperpile.com/b/rqVmzs/bHGV http://paperpile.com/b/rqvmzs/bhgv UH 10_1101-2021_02_13_429885 304 129 http://paperpile.com/b/rqVmzs/bHGV http://paperpile.com/b/rqvmzs/bhgv UH 10_1101-2021_02_13_429885 304 130 http://paperpile.com/b/rqVmzs/bHGV http://paperpile.com/b/rqvmzs/bhgv UH 10_1101-2021_02_13_429885 304 131 http://paperpile.com/b/rqVmzs/bHGV http://paperpile.com/b/rqvmzs/bhgv UH 10_1101-2021_02_13_429885 304 132 http://paperpile.com/b/rqVmzs/bHGV http://paperpile.com/b/rqvmzs/bhgv UH 10_1101-2021_02_13_429885 304 133 http://dx.doi.org/10.1016/j.cell.2012.04.023 http://dx.doi.org/10.1016/j.cell.2012.04.023 NNP 10_1101-2021_02_13_429885 304 134 http://paperpile.com/b/rqVmzs/bHGV http://paperpile.com/b/rqvmzs/bhgv UH 10_1101-2021_02_13_429885 304 135 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 NNP 10_1101-2021_02_13_429885 304 136 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 304 137 Househam Househam NNP 10_1101-2021_02_13_429885 304 138 et et FW 10_1101-2021_02_13_429885 304 139 al al NNP 10_1101-2021_02_13_429885 304 140 . . . 10_1101-2021_02_13_429885 305 1 A a DT 10_1101-2021_02_13_429885 305 2 fully fully RB 10_1101-2021_02_13_429885 305 3 automated automate VBN 10_1101-2021_02_13_429885 305 4 approach approach NN 10_1101-2021_02_13_429885 305 5 for for IN 10_1101-2021_02_13_429885 305 6 quality quality NN 10_1101-2021_02_13_429885 305 7 control control NN 10_1101-2021_02_13_429885 305 8 of of IN 10_1101-2021_02_13_429885 305 9 cancer cancer NN 10_1101-2021_02_13_429885 305 10 mutations mutation NNS 10_1101-2021_02_13_429885 305 11 in in IN 10_1101-2021_02_13_429885 305 12 the the DT 10_1101-2021_02_13_429885 305 13 era era NN 10_1101-2021_02_13_429885 305 14 of of IN 10_1101-2021_02_13_429885 305 15 high high JJ 10_1101-2021_02_13_429885 305 16 - - HYPH 10_1101-2021_02_13_429885 305 17 resolution resolution NN 10_1101-2021_02_13_429885 305 18 whole whole JJ 10_1101-2021_02_13_429885 305 19 genome genome JJ 10_1101-2021_02_13_429885 305 20 sequencing sequencing NN 10_1101-2021_02_13_429885 305 21 . . . 10_1101-2021_02_13_429885 306 1 Priestley Priestley NNP 10_1101-2021_02_13_429885 306 2 , , , 10_1101-2021_02_13_429885 306 3 Peter Peter NNP 10_1101-2021_02_13_429885 306 4 , , , 10_1101-2021_02_13_429885 306 5 Jonathan Jonathan NNP 10_1101-2021_02_13_429885 306 6 Baber Baber NNP 10_1101-2021_02_13_429885 306 7 , , , 10_1101-2021_02_13_429885 306 8 Martijn Martijn NNP 10_1101-2021_02_13_429885 306 9 P. P. NNP 10_1101-2021_02_13_429885 306 10 Lolkema Lolkema NNP 10_1101-2021_02_13_429885 306 11 , , , 10_1101-2021_02_13_429885 306 12 Neeltje Neeltje NNP 10_1101-2021_02_13_429885 306 13 Steeghs Steeghs NNP 10_1101-2021_02_13_429885 306 14 , , , 10_1101-2021_02_13_429885 306 15 Ewart Ewart NNP 10_1101-2021_02_13_429885 306 16 de de FW 10_1101-2021_02_13_429885 306 17 Bruijn Bruijn NNP 10_1101-2021_02_13_429885 306 18 , , , 10_1101-2021_02_13_429885 306 19 Charles Charles NNP 10_1101-2021_02_13_429885 306 20 Shale Shale NNP 10_1101-2021_02_13_429885 306 21 , , , 10_1101-2021_02_13_429885 306 22 Korneel Korneel NNP 10_1101-2021_02_13_429885 306 23 Duyvesteyn Duyvesteyn NNP 10_1101-2021_02_13_429885 306 24 , , , 10_1101-2021_02_13_429885 306 25 et et NNP 10_1101-2021_02_13_429885 306 26 al al NNP 10_1101-2021_02_13_429885 306 27 . . . 10_1101-2021_02_13_429885 307 1 2019 2019 CD 10_1101-2021_02_13_429885 307 2 . . . 10_1101-2021_02_13_429885 308 1 “ " `` 10_1101-2021_02_13_429885 308 2 Pan Pan NNP 10_1101-2021_02_13_429885 308 3 - - NNP 10_1101-2021_02_13_429885 308 4 Cancer cancer JJ 10_1101-2021_02_13_429885 308 5 Whole whole JJ 10_1101-2021_02_13_429885 308 6 - - HYPH 10_1101-2021_02_13_429885 308 7 Genome genome JJ 10_1101-2021_02_13_429885 308 8 Analyses analysis NNS 10_1101-2021_02_13_429885 308 9 of of IN 10_1101-2021_02_13_429885 308 10 Metastatic metastatic JJ 10_1101-2021_02_13_429885 308 11 Solid Solid NNP 10_1101-2021_02_13_429885 308 12 Tumours Tumours NNPS 10_1101-2021_02_13_429885 308 13 . . . 10_1101-2021_02_13_429885 308 14 ” " '' 10_1101-2021_02_13_429885 308 15 ​Nature​ ​Nature​ NNP 10_1101-2021_02_13_429885 308 16 575 575 CD 10_1101-2021_02_13_429885 308 17 ( ( -LRB- 10_1101-2021_02_13_429885 308 18 7781 7781 CD 10_1101-2021_02_13_429885 308 19 ) ) -RRB- 10_1101-2021_02_13_429885 308 20 : : : 10_1101-2021_02_13_429885 308 21 210–16 210–16 CD 10_1101-2021_02_13_429885 308 22 . . . 10_1101-2021_02_13_429885 309 1 https://doi.org/​10.1038/s41586-019-1689-y​. https://doi.org/​10.1038/s41586-019-1689-y​. LS 10_1101-2021_02_13_429885 310 1 Turajlic Turajlic NNP 10_1101-2021_02_13_429885 310 2 , , , 10_1101-2021_02_13_429885 310 3 Samra Samra NNP 10_1101-2021_02_13_429885 310 4 , , , 10_1101-2021_02_13_429885 310 5 Hang Hang NNP 10_1101-2021_02_13_429885 310 6 Xu Xu NNP 10_1101-2021_02_13_429885 310 7 , , , 10_1101-2021_02_13_429885 310 8 Kevin Kevin NNP 10_1101-2021_02_13_429885 310 9 Litchfield Litchfield NNP 10_1101-2021_02_13_429885 310 10 , , , 10_1101-2021_02_13_429885 310 11 Andrew Andrew NNP 10_1101-2021_02_13_429885 310 12 Rowan Rowan NNP 10_1101-2021_02_13_429885 310 13 , , , 10_1101-2021_02_13_429885 310 14 Stuart Stuart NNP 10_1101-2021_02_13_429885 310 15 Horswell Horswell NNP 10_1101-2021_02_13_429885 310 16 , , , 10_1101-2021_02_13_429885 310 17 Tim Tim NNP 10_1101-2021_02_13_429885 310 18 Chambers Chambers NNP 10_1101-2021_02_13_429885 310 19 , , , 10_1101-2021_02_13_429885 310 20 Tim Tim NNP 10_1101-2021_02_13_429885 310 21 O’Brien O’Brien NNP 10_1101-2021_02_13_429885 310 22 , , , 10_1101-2021_02_13_429885 310 23 et et NNP 10_1101-2021_02_13_429885 310 24 al al NNP 10_1101-2021_02_13_429885 310 25 . . . 10_1101-2021_02_13_429885 311 1 2018 2018 CD 10_1101-2021_02_13_429885 311 2 . . . 10_1101-2021_02_13_429885 312 1 “ " `` 10_1101-2021_02_13_429885 312 2 Deterministic Deterministic NNP 10_1101-2021_02_13_429885 312 3 Evolutionary Evolutionary NNP 10_1101-2021_02_13_429885 312 4 Trajectories Trajectories NNPS 10_1101-2021_02_13_429885 312 5 Influence Influence NNP 10_1101-2021_02_13_429885 312 6 Primary Primary NNP 10_1101-2021_02_13_429885 312 7 Tumor Tumor NNP 10_1101-2021_02_13_429885 312 8 Growth growth NN 10_1101-2021_02_13_429885 312 9 : : : 10_1101-2021_02_13_429885 312 10 TRACERx TRACERx NNPS 10_1101-2021_02_13_429885 312 11 Renal renal JJ 10_1101-2021_02_13_429885 312 12 . . . 10_1101-2021_02_13_429885 312 13 ” " '' 10_1101-2021_02_13_429885 312 14 ​Cell​ ​Cell​ NNP 10_1101-2021_02_13_429885 312 15 173 173 CD 10_1101-2021_02_13_429885 312 16 ( ( -LRB- 10_1101-2021_02_13_429885 312 17 3 3 CD 10_1101-2021_02_13_429885 312 18 ) ) -RRB- 10_1101-2021_02_13_429885 312 19 : : : 10_1101-2021_02_13_429885 312 20 595–610.e11 595–610.e11 LS 10_1101-2021_02_13_429885 312 21 . . . 10_1101-2021_02_13_429885 313 1 https://doi.org/​10.1016/j.cell.2018.03.043 https://doi.org/​10.1016/j.cell.2018.03.043 NNS 10_1101-2021_02_13_429885 313 2 ​. ​. JJ 10_1101-2021_02_13_429885 314 1 Turnbull Turnbull NNP 10_1101-2021_02_13_429885 314 2 , , , 10_1101-2021_02_13_429885 314 3 Clare Clare NNP 10_1101-2021_02_13_429885 314 4 , , , 10_1101-2021_02_13_429885 314 5 Richard Richard NNP 10_1101-2021_02_13_429885 314 6 H. H. NNP 10_1101-2021_02_13_429885 314 7 Scott Scott NNP 10_1101-2021_02_13_429885 314 8 , , , 10_1101-2021_02_13_429885 314 9 Ellen Ellen NNP 10_1101-2021_02_13_429885 314 10 Thomas Thomas NNP 10_1101-2021_02_13_429885 314 11 , , , 10_1101-2021_02_13_429885 314 12 Louise Louise NNP 10_1101-2021_02_13_429885 314 13 Jones Jones NNP 10_1101-2021_02_13_429885 314 14 , , , 10_1101-2021_02_13_429885 314 15 Nirupa Nirupa NNP 10_1101-2021_02_13_429885 314 16 Murugaesu Murugaesu NNP 10_1101-2021_02_13_429885 314 17 , , , 10_1101-2021_02_13_429885 314 18 Freya Freya NNP 10_1101-2021_02_13_429885 314 19 Boardman Boardman NNP 10_1101-2021_02_13_429885 314 20 Pretty Pretty NNP 10_1101-2021_02_13_429885 314 21 , , , 10_1101-2021_02_13_429885 314 22 Dina Dina NNP 10_1101-2021_02_13_429885 314 23 Halai Halai NNP 10_1101-2021_02_13_429885 314 24 , , , 10_1101-2021_02_13_429885 314 25 et et NNP 10_1101-2021_02_13_429885 314 26 al al NNP 10_1101-2021_02_13_429885 314 27 . . . 10_1101-2021_02_13_429885 315 1 2018 2018 CD 10_1101-2021_02_13_429885 315 2 . . . 10_1101-2021_02_13_429885 316 1 “ " `` 10_1101-2021_02_13_429885 316 2 The the DT 10_1101-2021_02_13_429885 316 3 100 100 CD 10_1101-2021_02_13_429885 316 4 000 000 CD 10_1101-2021_02_13_429885 316 5 Genomes Genomes NNP 10_1101-2021_02_13_429885 316 6 Project Project NNP 10_1101-2021_02_13_429885 316 7 : : : 10_1101-2021_02_13_429885 316 8 Bringing bring VBG 10_1101-2021_02_13_429885 316 9 Whole whole JJ 10_1101-2021_02_13_429885 316 10 Genome Genome NNP 10_1101-2021_02_13_429885 316 11 Sequencing sequencing NN 10_1101-2021_02_13_429885 316 12 to to IN 10_1101-2021_02_13_429885 316 13 the the DT 10_1101-2021_02_13_429885 316 14 NHS NHS NNP 10_1101-2021_02_13_429885 316 15 . . . 10_1101-2021_02_13_429885 316 16 ” " '' 10_1101-2021_02_13_429885 316 17 ​BMJ ​BMJ NNP 10_1101-2021_02_13_429885 316 18 ​ ​ NNP 10_1101-2021_02_13_429885 316 19 361 361 CD 10_1101-2021_02_13_429885 316 20 ( ( -LRB- 10_1101-2021_02_13_429885 316 21 April April NNP 10_1101-2021_02_13_429885 316 22 ) ) -RRB- 10_1101-2021_02_13_429885 316 23 : : : 10_1101-2021_02_13_429885 316 24 k1687 k1687 NNP 10_1101-2021_02_13_429885 316 25 . . . 10_1101-2021_02_13_429885 317 1 https://doi.org/​10.1136/bmj.k1687 https://doi.org/​10.1136/bmj.k1687 NNP 10_1101-2021_02_13_429885 317 2 ​. ​. NNP 10_1101-2021_02_13_429885 318 1 Van Van NNP 10_1101-2021_02_13_429885 318 2 Loo Loo NNP 10_1101-2021_02_13_429885 318 3 , , , 10_1101-2021_02_13_429885 318 4 Peter Peter NNP 10_1101-2021_02_13_429885 318 5 , , , 10_1101-2021_02_13_429885 318 6 Silje Silje NNP 10_1101-2021_02_13_429885 318 7 H. H. NNP 10_1101-2021_02_13_429885 318 8 Nordgard Nordgard NNP 10_1101-2021_02_13_429885 318 9 , , , 10_1101-2021_02_13_429885 318 10 Ole Ole NNP 10_1101-2021_02_13_429885 318 11 Christian Christian NNP 10_1101-2021_02_13_429885 318 12 Lingjærde Lingjærde NNP 10_1101-2021_02_13_429885 318 13 , , , 10_1101-2021_02_13_429885 318 14 Hege Hege NNP 10_1101-2021_02_13_429885 318 15 G. G. NNP 10_1101-2021_02_13_429885 318 16 Russnes Russnes NNP 10_1101-2021_02_13_429885 318 17 , , , 10_1101-2021_02_13_429885 318 18 Inga Inga NNP 10_1101-2021_02_13_429885 318 19 H. H. NNP 10_1101-2021_02_13_429885 318 20 Rye Rye NNP 10_1101-2021_02_13_429885 318 21 , , , 10_1101-2021_02_13_429885 318 22 Wei Wei NNP 10_1101-2021_02_13_429885 318 23 Sun Sun NNP 10_1101-2021_02_13_429885 318 24 , , , 10_1101-2021_02_13_429885 318 25 Victor Victor NNP 10_1101-2021_02_13_429885 318 26 J. J. NNP 10_1101-2021_02_13_429885 318 27 Weigman Weigman NNP 10_1101-2021_02_13_429885 318 28 , , , 10_1101-2021_02_13_429885 318 29 et et NNP 10_1101-2021_02_13_429885 318 30 al al NNP 10_1101-2021_02_13_429885 318 31 . . . 10_1101-2021_02_13_429885 319 1 2010 2010 CD 10_1101-2021_02_13_429885 319 2 . . . 10_1101-2021_02_13_429885 320 1 “ " `` 10_1101-2021_02_13_429885 320 2 Allele Allele NNP 10_1101-2021_02_13_429885 320 3 - - HYPH 10_1101-2021_02_13_429885 320 4 Specific Specific NNP 10_1101-2021_02_13_429885 320 5 Copy Copy NNP 10_1101-2021_02_13_429885 320 6 Number Number NNP 10_1101-2021_02_13_429885 320 7 Analysis Analysis NNP 10_1101-2021_02_13_429885 320 8 of of IN 10_1101-2021_02_13_429885 320 9 Tumors Tumors NNPS 10_1101-2021_02_13_429885 320 10 . . . 10_1101-2021_02_13_429885 320 11 ” " '' 10_1101-2021_02_13_429885 320 12 Proceedings proceeding NNS 10_1101-2021_02_13_429885 320 13 of of IN 10_1101-2021_02_13_429885 320 14 the the DT 10_1101-2021_02_13_429885 320 15 National National NNP 10_1101-2021_02_13_429885 320 16 Academy Academy NNP 10_1101-2021_02_13_429885 320 17 of of IN 10_1101-2021_02_13_429885 320 18 Sciences Sciences NNPS 10_1101-2021_02_13_429885 320 19 of of IN 10_1101-2021_02_13_429885 320 20 the the DT 10_1101-2021_02_13_429885 320 21 United United NNP 10_1101-2021_02_13_429885 320 22 States States NNP 10_1101-2021_02_13_429885 320 23 of of IN 10_1101-2021_02_13_429885 320 24 America​ America​ NNP 10_1101-2021_02_13_429885 320 25 107 107 CD 10_1101-2021_02_13_429885 320 26 ( ( -LRB- 10_1101-2021_02_13_429885 320 27 39 39 CD 10_1101-2021_02_13_429885 320 28 ) ) -RRB- 10_1101-2021_02_13_429885 320 29 : : : 10_1101-2021_02_13_429885 320 30 16910–15 16910–15 LS 10_1101-2021_02_13_429885 320 31 . . . 10_1101-2021_02_13_429885 321 1 https://doi.org/​10.1073/pnas.1009843107 https://doi.org/​10.1073/pnas.1009843107 VB 10_1101-2021_02_13_429885 321 2 ​. ​. JJ 10_1101-2021_02_13_429885 322 1 Watkins Watkins NNP 10_1101-2021_02_13_429885 322 2 , , , 10_1101-2021_02_13_429885 322 3 Thomas Thomas NNP 10_1101-2021_02_13_429885 322 4 B. B. NNP 10_1101-2021_02_13_429885 322 5 K. K. NNP 10_1101-2021_02_13_429885 322 6 , , , 10_1101-2021_02_13_429885 322 7 Emilia Emilia NNP 10_1101-2021_02_13_429885 322 8 L. L. NNP 10_1101-2021_02_13_429885 322 9 Lim Lim NNP 10_1101-2021_02_13_429885 322 10 , , , 10_1101-2021_02_13_429885 322 11 Marina Marina NNP 10_1101-2021_02_13_429885 322 12 Petkovic Petkovic NNP 10_1101-2021_02_13_429885 322 13 , , , 10_1101-2021_02_13_429885 322 14 Sergi Sergi NNP 10_1101-2021_02_13_429885 322 15 Elizalde Elizalde NNP 10_1101-2021_02_13_429885 322 16 , , , 10_1101-2021_02_13_429885 322 17 Nicolai Nicolai NNP 10_1101-2021_02_13_429885 322 18 J. J. NNP 10_1101-2021_02_13_429885 322 19 Birkbak Birkbak NNP 10_1101-2021_02_13_429885 322 20 , , , 10_1101-2021_02_13_429885 322 21 Gareth Gareth NNP 10_1101-2021_02_13_429885 322 22 A. A. NNP 10_1101-2021_02_13_429885 322 23 Wilson Wilson NNP 10_1101-2021_02_13_429885 322 24 , , , 10_1101-2021_02_13_429885 322 25 David David NNP 10_1101-2021_02_13_429885 322 26 A. a. NN 10_1101-2021_02_13_429885 322 27 Moore Moore NNP 10_1101-2021_02_13_429885 322 28 , , , 10_1101-2021_02_13_429885 322 29 et et NNP 10_1101-2021_02_13_429885 322 30 al al NNP 10_1101-2021_02_13_429885 322 31 . . . 10_1101-2021_02_13_429885 323 1 11 11 CD 10_1101-2021_02_13_429885 323 2 2020 2020 CD 10_1101-2021_02_13_429885 323 3 . . . 10_1101-2021_02_13_429885 324 1 “ " `` 10_1101-2021_02_13_429885 324 2 Pervasive Pervasive NNP 10_1101-2021_02_13_429885 324 3 Chromosomal Chromosomal NNP 10_1101-2021_02_13_429885 324 4 Instability Instability NNP 10_1101-2021_02_13_429885 324 5 and and CC 10_1101-2021_02_13_429885 324 6 Karyotype Karyotype NNP 10_1101-2021_02_13_429885 324 7 Order Order NNP 10_1101-2021_02_13_429885 324 8 in in IN 10_1101-2021_02_13_429885 324 9 Tumour Tumour NNP 10_1101-2021_02_13_429885 324 10 Evolution Evolution NNP 10_1101-2021_02_13_429885 324 11 . . . 10_1101-2021_02_13_429885 324 12 ” " '' 10_1101-2021_02_13_429885 324 13 ​Nature​ ​Nature​ NNP 10_1101-2021_02_13_429885 324 14 587 587 CD 10_1101-2021_02_13_429885 324 15 ( ( -LRB- 10_1101-2021_02_13_429885 324 16 7832 7832 CD 10_1101-2021_02_13_429885 324 17 ) ) -RRB- 10_1101-2021_02_13_429885 324 18 : : : 10_1101-2021_02_13_429885 324 19 126–32 126–32 CD 10_1101-2021_02_13_429885 324 20 . . . 10_1101-2021_02_13_429885 325 1 https://doi.org/​10.1038/s41586-020-2698-6 https://doi.org/​10.1038/s41586-020-2698-6 ADD 10_1101-2021_02_13_429885 325 2 ​. ​. CD 10_1101-2021_02_13_429885 326 1 Zaccaria Zaccaria NNP 10_1101-2021_02_13_429885 326 2 , , , 10_1101-2021_02_13_429885 326 3 Simone Simone NNP 10_1101-2021_02_13_429885 326 4 , , , 10_1101-2021_02_13_429885 326 5 and and CC 10_1101-2021_02_13_429885 326 6 Benjamin Benjamin NNP 10_1101-2021_02_13_429885 326 7 J. J. NNP 10_1101-2021_02_13_429885 326 8 Raphael Raphael NNP 10_1101-2021_02_13_429885 326 9 . . . 10_1101-2021_02_13_429885 327 1 2020 2020 CD 10_1101-2021_02_13_429885 327 2 . . . 10_1101-2021_02_13_429885 328 1 “ " `` 10_1101-2021_02_13_429885 328 2 Accurate accurate JJ 10_1101-2021_02_13_429885 328 3 Quantification quantification NN 10_1101-2021_02_13_429885 328 4 of of IN 10_1101-2021_02_13_429885 328 5 Copy Copy NNP 10_1101-2021_02_13_429885 328 6 - - HYPH 10_1101-2021_02_13_429885 328 7 Number Number NNP 10_1101-2021_02_13_429885 328 8 Aberrations Aberrations NNPS 10_1101-2021_02_13_429885 328 9 and and CC 10_1101-2021_02_13_429885 328 10 Whole whole JJ 10_1101-2021_02_13_429885 328 11 - - HYPH 10_1101-2021_02_13_429885 328 12 Genome genome JJ 10_1101-2021_02_13_429885 328 13 Duplications Duplications NNPS 10_1101-2021_02_13_429885 328 14 in in IN 10_1101-2021_02_13_429885 328 15 Multi Multi NNP 10_1101-2021_02_13_429885 328 16 - - HYPH 10_1101-2021_02_13_429885 328 17 Sample Sample NNP 10_1101-2021_02_13_429885 328 18 Tumor Tumor NNP 10_1101-2021_02_13_429885 328 19 Sequencing Sequencing NNP 10_1101-2021_02_13_429885 328 20 Data Data NNPS 10_1101-2021_02_13_429885 328 21 . . . 10_1101-2021_02_13_429885 328 22 ” " '' 10_1101-2021_02_13_429885 328 23 Nature Nature NNP 10_1101-2021_02_13_429885 328 24 Communications​ Communications​ NNP 10_1101-2021_02_13_429885 328 25 11 11 CD 10_1101-2021_02_13_429885 328 26 ( ( -LRB- 10_1101-2021_02_13_429885 328 27 1 1 CD 10_1101-2021_02_13_429885 328 28 ) ) -RRB- 10_1101-2021_02_13_429885 328 29 : : : 10_1101-2021_02_13_429885 328 30 4301 4301 CD 10_1101-2021_02_13_429885 328 31 . . . 10_1101-2021_02_13_429885 328 32 https://doi.org/​10.1038/s41467-020-17967-y​. https://doi.org/​10.1038/s41467-020-17967-y​. CD 10_1101-2021_02_13_429885 329 1 Data Data NNP 10_1101-2021_02_13_429885 329 2 Availability Availability NNP 10_1101-2021_02_13_429885 329 3 Multiregion multiregion NN 10_1101-2021_02_13_429885 329 4 ​colorectal ​colorectal , 10_1101-2021_02_13_429885 329 5 cancer cancer NN 10_1101-2021_02_13_429885 329 6 data datum NNS 10_1101-2021_02_13_429885 329 7 is be VBZ 10_1101-2021_02_13_429885 329 8 deposited deposit VBN 10_1101-2021_02_13_429885 329 9 in in IN 10_1101-2021_02_13_429885 329 10 EGA EGA NNP 10_1101-2021_02_13_429885 329 11 under under IN 10_1101-2021_02_13_429885 329 12 accession accession NN 10_1101-2021_02_13_429885 329 13 number number NN 10_1101-2021_02_13_429885 329 14 EGAS00001003066 EGAS00001003066 NNP 10_1101-2021_02_13_429885 329 15 . . . 10_1101-2021_02_13_429885 330 1 PCAWG pcawg NN 10_1101-2021_02_13_429885 330 2 calls call NNS 10_1101-2021_02_13_429885 330 3 are be VBP 10_1101-2021_02_13_429885 330 4 publicly publicly RB 10_1101-2021_02_13_429885 330 5 available available JJ 10_1101-2021_02_13_429885 330 6 at at IN 10_1101-2021_02_13_429885 330 7 ( ( -LRB- 10_1101-2021_02_13_429885 330 8 ​https://dcc.icgc.org/​ ​https://dcc.icgc.org/​ NNP 10_1101-2021_02_13_429885 330 9 ) ) -RRB- 10_1101-2021_02_13_429885 330 10 , , , 10_1101-2021_02_13_429885 330 11 the the DT 10_1101-2021_02_13_429885 330 12 ICGC ICGC NNP 10_1101-2021_02_13_429885 330 13 Data Data NNP 10_1101-2021_02_13_429885 330 14 Portal Portal NNP 10_1101-2021_02_13_429885 330 15 . . . 10_1101-2021_02_13_429885 331 1 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 331 2 is be VBZ 10_1101-2021_02_13_429885 331 3 implemented implement VBN 10_1101-2021_02_13_429885 331 4 as as IN 10_1101-2021_02_13_429885 331 5 an an DT 10_1101-2021_02_13_429885 331 6 open open JJ 10_1101-2021_02_13_429885 331 7 source source NN 10_1101-2021_02_13_429885 331 8 R r NN 10_1101-2021_02_13_429885 331 9 package package NN 10_1101-2021_02_13_429885 331 10 that that WDT 10_1101-2021_02_13_429885 331 11 is be VBZ 10_1101-2021_02_13_429885 331 12 hosted host VBN 10_1101-2021_02_13_429885 331 13 at at IN 10_1101-2021_02_13_429885 331 14 the the DT 10_1101-2021_02_13_429885 331 15 GitHub GitHub NNP 10_1101-2021_02_13_429885 331 16 space space NN 10_1101-2021_02_13_429885 331 17 of of IN 10_1101-2021_02_13_429885 331 18 the the DT 10_1101-2021_02_13_429885 331 19 Caravagna Caravagna NNP 10_1101-2021_02_13_429885 331 20 Lab Lab NNP 10_1101-2021_02_13_429885 331 21 https://caravagnalab.github.io/CNAqc/​. https://caravagnalab.github.io/cnaqc/​. NN 10_1101-2021_02_13_429885 332 1 The the DT 10_1101-2021_02_13_429885 332 2 tool tool NN 10_1101-2021_02_13_429885 332 3 webpage webpage NN 10_1101-2021_02_13_429885 332 4 contains contain VBZ 10_1101-2021_02_13_429885 332 5 RMarkdown RMarkdown NNP 10_1101-2021_02_13_429885 332 6 tutorial tutorial JJ 10_1101-2021_02_13_429885 332 7 vignettes vignette NNS 10_1101-2021_02_13_429885 332 8 to to TO 10_1101-2021_02_13_429885 332 9 run run VB 10_1101-2021_02_13_429885 332 10 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 332 11 analysis analysis NN 10_1101-2021_02_13_429885 332 12 of of IN 10_1101-2021_02_13_429885 332 13 a a DT 10_1101-2021_02_13_429885 332 14 generic generic JJ 10_1101-2021_02_13_429885 332 15 dataset dataset NN 10_1101-2021_02_13_429885 332 16 , , , 10_1101-2021_02_13_429885 332 17 as as RB 10_1101-2021_02_13_429885 332 18 well well RB 10_1101-2021_02_13_429885 332 19 as as IN 10_1101-2021_02_13_429885 332 20 documents document NNS 10_1101-2021_02_13_429885 332 21 that that WDT 10_1101-2021_02_13_429885 332 22 explain explain VBP 10_1101-2021_02_13_429885 332 23 visualisation visualisation NN 10_1101-2021_02_13_429885 332 24 and and CC 10_1101-2021_02_13_429885 332 25 parameterizations parameterization NNS 10_1101-2021_02_13_429885 332 26 of of IN 10_1101-2021_02_13_429885 332 27 the the DT 10_1101-2021_02_13_429885 332 28 execution execution NN 10_1101-2021_02_13_429885 332 29 . . . 10_1101-2021_02_13_429885 333 1 All all DT 10_1101-2021_02_13_429885 333 2 analyses analysis NNS 10_1101-2021_02_13_429885 333 3 in in IN 10_1101-2021_02_13_429885 333 4 this this DT 10_1101-2021_02_13_429885 333 5 paper paper NN 10_1101-2021_02_13_429885 333 6 can can MD 10_1101-2021_02_13_429885 333 7 be be VB 10_1101-2021_02_13_429885 333 8 replicated replicate VBN 10_1101-2021_02_13_429885 333 9 following follow VBG 10_1101-2021_02_13_429885 333 10 the the DT 10_1101-2021_02_13_429885 333 11 vignettes vignette NNS 10_1101-2021_02_13_429885 333 12 . . . 10_1101-2021_02_13_429885 334 1 Authors author NNS 10_1101-2021_02_13_429885 334 2 contribution contribution NN 10_1101-2021_02_13_429885 334 3 .CC .CC , 10_1101-2021_02_13_429885 334 4 - - : 10_1101-2021_02_13_429885 334 5 BY by IN 10_1101-2021_02_13_429885 334 6 - - HYPH 10_1101-2021_02_13_429885 334 7 NC NC NNP 10_1101-2021_02_13_429885 334 8 - - HYPH 10_1101-2021_02_13_429885 334 9 ND ND NNP 10_1101-2021_02_13_429885 334 10 4.0 4.0 CD 10_1101-2021_02_13_429885 334 11 International International NNP 10_1101-2021_02_13_429885 334 12 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 334 13 under under IN 10_1101-2021_02_13_429885 334 14 a a DT 10_1101-2021_02_13_429885 334 15 ( ( -LRB- 10_1101-2021_02_13_429885 334 16 which which WDT 10_1101-2021_02_13_429885 334 17 was be VBD 10_1101-2021_02_13_429885 334 18 not not RB 10_1101-2021_02_13_429885 334 19 certified certify VBN 10_1101-2021_02_13_429885 334 20 by by IN 10_1101-2021_02_13_429885 334 21 peer peer NN 10_1101-2021_02_13_429885 334 22 review review NN 10_1101-2021_02_13_429885 334 23 ) ) -RRB- 10_1101-2021_02_13_429885 334 24 is be VBZ 10_1101-2021_02_13_429885 334 25 the the DT 10_1101-2021_02_13_429885 334 26 author author NN 10_1101-2021_02_13_429885 334 27 / / SYM 10_1101-2021_02_13_429885 334 28 funder funder NN 10_1101-2021_02_13_429885 334 29 , , , 10_1101-2021_02_13_429885 334 30 who who WP 10_1101-2021_02_13_429885 334 31 has have VBZ 10_1101-2021_02_13_429885 334 32 granted grant VBN 10_1101-2021_02_13_429885 334 33 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 334 34 a a DT 10_1101-2021_02_13_429885 334 35 license license NN 10_1101-2021_02_13_429885 334 36 to to TO 10_1101-2021_02_13_429885 334 37 display display VB 10_1101-2021_02_13_429885 334 38 the the DT 10_1101-2021_02_13_429885 334 39 preprint preprint NN 10_1101-2021_02_13_429885 334 40 in in IN 10_1101-2021_02_13_429885 334 41 perpetuity perpetuity NN 10_1101-2021_02_13_429885 334 42 . . . 10_1101-2021_02_13_429885 335 1 It -PRON- PRP 10_1101-2021_02_13_429885 335 2 is be VBZ 10_1101-2021_02_13_429885 335 3 made make VBN 10_1101-2021_02_13_429885 335 4 The the DT 10_1101-2021_02_13_429885 335 5 copyright copyright NN 10_1101-2021_02_13_429885 335 6 holder holder NN 10_1101-2021_02_13_429885 335 7 for for IN 10_1101-2021_02_13_429885 335 8 this this DT 10_1101-2021_02_13_429885 335 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 335 10 version version NN 10_1101-2021_02_13_429885 335 11 posted post VBD 10_1101-2021_02_13_429885 335 12 February February NNP 10_1101-2021_02_13_429885 335 13 13 13 CD 10_1101-2021_02_13_429885 335 14 , , , 10_1101-2021_02_13_429885 335 15 2021 2021 CD 10_1101-2021_02_13_429885 335 16 . . . 10_1101-2021_02_13_429885 335 17 ; ; : 10_1101-2021_02_13_429885 335 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 335 19 : : : 10_1101-2021_02_13_429885 335 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 335 21 preprint preprint NNP 10_1101-2021_02_13_429885 335 22 http://paperpile.com/b/rqVmzs/67up http://paperpile.com/b/rqVmzs/67up NNP 10_1101-2021_02_13_429885 335 23 http://paperpile.com/b/rqVmzs/67up http://paperpile.com/b/rqVmzs/67up NNP 10_1101-2021_02_13_429885 335 24 http://paperpile.com/b/rqVmzs/67up http://paperpile.com/b/rqVmzs/67up NNP 10_1101-2021_02_13_429885 335 25 http://paperpile.com/b/rqVmzs/67up http://paperpile.com/b/rqVmzs/67up NNP 10_1101-2021_02_13_429885 335 26 http://paperpile.com/b/rqVmzs/67up http://paperpile.com/b/rqVmzs/67up NNP 10_1101-2021_02_13_429885 335 27 http://paperpile.com/b/rqVmzs/67up http://paperpile.com/b/rqVmzs/67up NNP 10_1101-2021_02_13_429885 335 28 http://dx.doi.org/10.1038/s41586-019-1689-y http://dx.doi.org/10.1038/s41586-019-1689-y NNP 10_1101-2021_02_13_429885 335 29 http://paperpile.com/b/rqVmzs/67up http://paperpile.com/b/rqVmzs/67up NNP 10_1101-2021_02_13_429885 335 30 http://paperpile.com/b/rqVmzs/JI4a http://paperpile.com/b/rqVmzs/JI4a NNP 10_1101-2021_02_13_429885 335 31 http://paperpile.com/b/rqVmzs/JI4a http://paperpile.com/b/rqVmzs/JI4a NNP 10_1101-2021_02_13_429885 335 32 http://paperpile.com/b/rqVmzs/JI4a http://paperpile.com/b/rqVmzs/JI4a NNP 10_1101-2021_02_13_429885 335 33 http://paperpile.com/b/rqVmzs/JI4a http://paperpile.com/b/rqVmzs/JI4a NNP 10_1101-2021_02_13_429885 335 34 http://paperpile.com/b/rqVmzs/JI4a http://paperpile.com/b/rqVmzs/JI4a NNP 10_1101-2021_02_13_429885 335 35 http://paperpile.com/b/rqVmzs/JI4a http://paperpile.com/b/rqVmzs/JI4a NNP 10_1101-2021_02_13_429885 335 36 http://dx.doi.org/10.1016/j.cell.2018.03.043 http://dx.doi.org/10.1016/j.cell.2018.03.043 NNP 10_1101-2021_02_13_429885 335 37 http://paperpile.com/b/rqVmzs/JI4a http://paperpile.com/b/rqVmzs/JI4a NNP 10_1101-2021_02_13_429885 335 38 http://paperpile.com/b/rqVmzs/mWfz http://paperpile.com/b/rqVmzs/mWfz NNP 10_1101-2021_02_13_429885 335 39 http://paperpile.com/b/rqVmzs/mWfz http://paperpile.com/b/rqvmzs/mwfz UH 10_1101-2021_02_13_429885 335 40 http://paperpile.com/b/rqVmzs/mWfz http://paperpile.com/b/rqvmzs/mwfz UH 10_1101-2021_02_13_429885 335 41 http://paperpile.com/b/rqVmzs/mWfz http://paperpile.com/b/rqvmzs/mwfz UH 10_1101-2021_02_13_429885 335 42 http://paperpile.com/b/rqVmzs/mWfz http://paperpile.com/b/rqvmzs/mwfz UH 10_1101-2021_02_13_429885 335 43 http://paperpile.com/b/rqVmzs/mWfz http://paperpile.com/b/rqvmzs/mwfz UH 10_1101-2021_02_13_429885 335 44 http://dx.doi.org/10.1136/bmj.k1687 http://dx.doi.org/10.1136/bmj.k1687 PRP$ 10_1101-2021_02_13_429885 335 45 http://paperpile.com/b/rqVmzs/mWfz http://paperpile.com/b/rqvmzs/mwfz NN 10_1101-2021_02_13_429885 335 46 http://paperpile.com/b/rqVmzs/yAgN http://paperpile.com/b/rqVmzs/yAgN NNP 10_1101-2021_02_13_429885 335 47 http://paperpile.com/b/rqVmzs/yAgN http://paperpile.com/b/rqVmzs/yAgN NNP 10_1101-2021_02_13_429885 335 48 http://paperpile.com/b/rqVmzs/yAgN http://paperpile.com/b/rqVmzs/yAgN NNP 10_1101-2021_02_13_429885 335 49 http://paperpile.com/b/rqVmzs/yAgN http://paperpile.com/b/rqVmzs/yAgN NNP 10_1101-2021_02_13_429885 335 50 http://paperpile.com/b/rqVmzs/yAgN http://paperpile.com/b/rqVmzs/yAgN NNP 10_1101-2021_02_13_429885 335 51 http://dx.doi.org/10.1073/pnas.1009843107 http://dx.doi.org/10.1073/pnas.1009843107 NNP 10_1101-2021_02_13_429885 335 52 http://paperpile.com/b/rqVmzs/yAgN http://paperpile.com/b/rqVmzs/yAgN NNP 10_1101-2021_02_13_429885 335 53 http://paperpile.com/b/rqVmzs/NCPJ http://paperpile.com/b/rqvmzs/ncpj ADD 10_1101-2021_02_13_429885 335 54 http://paperpile.com/b/rqVmzs/NCPJ http://paperpile.com/b/rqvmzs/ncpj ADD 10_1101-2021_02_13_429885 335 55 http://paperpile.com/b/rqVmzs/NCPJ http://paperpile.com/b/rqvmzs/ncpj ADD 10_1101-2021_02_13_429885 335 56 http://paperpile.com/b/rqVmzs/NCPJ http://paperpile.com/b/rqvmzs/ncpj ADD 10_1101-2021_02_13_429885 335 57 http://paperpile.com/b/rqVmzs/NCPJ http://paperpile.com/b/rqvmzs/ncpj ADD 10_1101-2021_02_13_429885 335 58 http://paperpile.com/b/rqVmzs/NCPJ http://paperpile.com/b/rqvmzs/ncpj ADD 10_1101-2021_02_13_429885 335 59 http://dx.doi.org/10.1038/s41586-020-2698-6 http://dx.doi.org/10.1038/s41586-020-2698-6 NNP 10_1101-2021_02_13_429885 335 60 http://paperpile.com/b/rqVmzs/NCPJ http://paperpile.com/b/rqvmzs/ncpj ADD 10_1101-2021_02_13_429885 335 61 http://paperpile.com/b/rqVmzs/rmmC http://paperpile.com/b/rqVmzs/rmmC NNP 10_1101-2021_02_13_429885 335 62 http://paperpile.com/b/rqVmzs/rmmC http://paperpile.com/b/rqVmzs/rmmC NNP 10_1101-2021_02_13_429885 335 63 http://paperpile.com/b/rqVmzs/rmmC http://paperpile.com/b/rqVmzs/rmmC NNP 10_1101-2021_02_13_429885 335 64 http://paperpile.com/b/rqVmzs/rmmC http://paperpile.com/b/rqVmzs/rmmC NNP 10_1101-2021_02_13_429885 335 65 http://dx.doi.org/10.1038/s41467-020-17967-y http://dx.doi.org/10.1038/s41467-020-17967-y NNP 10_1101-2021_02_13_429885 335 66 http://paperpile.com/b/rqVmzs/rmmC http://paperpile.com/b/rqVmzs/rmmC NNP 10_1101-2021_02_13_429885 335 67 https://dcc.icgc.org/ https://dcc.icgc.org/ VBZ 10_1101-2021_02_13_429885 335 68 https://caravagnalab.github.io/CNAqc/ https://caravagnalab.github.io/cnaqc/ ADD 10_1101-2021_02_13_429885 335 69 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 NNP 10_1101-2021_02_13_429885 335 70 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 335 71 Househam Househam NNP 10_1101-2021_02_13_429885 335 72 et et FW 10_1101-2021_02_13_429885 335 73 al al NNP 10_1101-2021_02_13_429885 335 74 . . . 10_1101-2021_02_13_429885 336 1 A a DT 10_1101-2021_02_13_429885 336 2 fully fully RB 10_1101-2021_02_13_429885 336 3 automated automate VBN 10_1101-2021_02_13_429885 336 4 approach approach NN 10_1101-2021_02_13_429885 336 5 for for IN 10_1101-2021_02_13_429885 336 6 quality quality NN 10_1101-2021_02_13_429885 336 7 control control NN 10_1101-2021_02_13_429885 336 8 of of IN 10_1101-2021_02_13_429885 336 9 cancer cancer NN 10_1101-2021_02_13_429885 336 10 mutations mutation NNS 10_1101-2021_02_13_429885 336 11 in in IN 10_1101-2021_02_13_429885 336 12 the the DT 10_1101-2021_02_13_429885 336 13 era era NN 10_1101-2021_02_13_429885 336 14 of of IN 10_1101-2021_02_13_429885 336 15 high high JJ 10_1101-2021_02_13_429885 336 16 - - HYPH 10_1101-2021_02_13_429885 336 17 resolution resolution NN 10_1101-2021_02_13_429885 336 18 whole whole JJ 10_1101-2021_02_13_429885 336 19 genome genome JJ 10_1101-2021_02_13_429885 336 20 sequencing sequencing NN 10_1101-2021_02_13_429885 336 21 . . . 10_1101-2021_02_13_429885 337 1 All all DT 10_1101-2021_02_13_429885 337 2 authors author NNS 10_1101-2021_02_13_429885 337 3 conceived conceive VBD 10_1101-2021_02_13_429885 337 4 the the DT 10_1101-2021_02_13_429885 337 5 method method NN 10_1101-2021_02_13_429885 337 6 , , , 10_1101-2021_02_13_429885 337 7 which which WDT 10_1101-2021_02_13_429885 337 8 GC GC NNP 10_1101-2021_02_13_429885 337 9 formalised formalise VBD 10_1101-2021_02_13_429885 337 10 and and CC 10_1101-2021_02_13_429885 337 11 implemented implement VBD 10_1101-2021_02_13_429885 337 12 . . . 10_1101-2021_02_13_429885 338 1 All all DT 10_1101-2021_02_13_429885 338 2 authors author NNS 10_1101-2021_02_13_429885 338 3 analysed analyse VBD 10_1101-2021_02_13_429885 338 4 the the DT 10_1101-2021_02_13_429885 338 5 data datum NNS 10_1101-2021_02_13_429885 338 6 and and CC 10_1101-2021_02_13_429885 338 7 wrote write VBD 10_1101-2021_02_13_429885 338 8 the the DT 10_1101-2021_02_13_429885 338 9 manuscript manuscript NN 10_1101-2021_02_13_429885 338 10 . . . 10_1101-2021_02_13_429885 339 1 Competing compete VBG 10_1101-2021_02_13_429885 339 2 interests interest NNS 10_1101-2021_02_13_429885 339 3 . . . 10_1101-2021_02_13_429885 340 1 The the DT 10_1101-2021_02_13_429885 340 2 authors author NNS 10_1101-2021_02_13_429885 340 3 declare declare VBP 10_1101-2021_02_13_429885 340 4 no no DT 10_1101-2021_02_13_429885 340 5 competing compete VBG 10_1101-2021_02_13_429885 340 6 interests interest NNS 10_1101-2021_02_13_429885 340 7 . . . 10_1101-2021_02_13_429885 341 1 Online online JJ 10_1101-2021_02_13_429885 341 2 methods method NNS 10_1101-2021_02_13_429885 341 3 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 341 4 supports support VBZ 10_1101-2021_02_13_429885 341 5 two two CD 10_1101-2021_02_13_429885 341 6 human human JJ 10_1101-2021_02_13_429885 341 7 genome genome JJ 10_1101-2021_02_13_429885 341 8 references reference NNS 10_1101-2021_02_13_429885 341 9 ( ( -LRB- 10_1101-2021_02_13_429885 341 10 GRCh38 GRCh38 NNP 10_1101-2021_02_13_429885 341 11 and and CC 10_1101-2021_02_13_429885 341 12 hg19 hg19 NNP 10_1101-2021_02_13_429885 341 13 ) ) -RRB- 10_1101-2021_02_13_429885 341 14 , , , 10_1101-2021_02_13_429885 341 15 and and CC 10_1101-2021_02_13_429885 341 16 the the DT 10_1101-2021_02_13_429885 341 17 most most RBS 10_1101-2021_02_13_429885 341 18 common common JJ 10_1101-2021_02_13_429885 341 19 CNA CNA NNP 10_1101-2021_02_13_429885 341 20 profiles profile NNS 10_1101-2021_02_13_429885 341 21 found find VBN 10_1101-2021_02_13_429885 341 22 in in IN 10_1101-2021_02_13_429885 341 23 cancers cancer NNS 10_1101-2021_02_13_429885 341 24 : : : 10_1101-2021_02_13_429885 341 25 ● ● . 10_1101-2021_02_13_429885 341 26 heterozygous heterozygous JJ 10_1101-2021_02_13_429885 341 27 diploid diploid NNP 10_1101-2021_02_13_429885 341 28 states state NNS 10_1101-2021_02_13_429885 341 29 ( ( -LRB- 10_1101-2021_02_13_429885 341 30 1:1 1:1 CD 10_1101-2021_02_13_429885 341 31 ) ) -RRB- 10_1101-2021_02_13_429885 341 32 ; ; : 10_1101-2021_02_13_429885 341 33 2 2 LS 10_1101-2021_02_13_429885 341 34 ● ● : 10_1101-2021_02_13_429885 341 35 loss loss NN 10_1101-2021_02_13_429885 341 36 of of IN 10_1101-2021_02_13_429885 341 37 heterozygosity heterozygosity NN 10_1101-2021_02_13_429885 341 38 ( ( -LRB- 10_1101-2021_02_13_429885 341 39 LOH LOH NNP 10_1101-2021_02_13_429885 341 40 ) ) -RRB- 10_1101-2021_02_13_429885 341 41 in in IN 10_1101-2021_02_13_429885 341 42 monosomy monosomy NNP 10_1101-2021_02_13_429885 341 43 ( ( -LRB- 10_1101-2021_02_13_429885 341 44 1:0 1:0 CD 10_1101-2021_02_13_429885 341 45 ) ) -RRB- 10_1101-2021_02_13_429885 341 46 and and CC 10_1101-2021_02_13_429885 341 47 copy copy NN 10_1101-2021_02_13_429885 341 48 - - HYPH 10_1101-2021_02_13_429885 341 49 neutral neutral JJ 10_1101-2021_02_13_429885 341 50 ( ( -LRB- 10_1101-2021_02_13_429885 341 51 2:0 2:0 CD 10_1101-2021_02_13_429885 341 52 ) ) -RRB- 10_1101-2021_02_13_429885 341 53 states state NNS 10_1101-2021_02_13_429885 341 54 ; ; : 10_1101-2021_02_13_429885 341 55 ● ● NFP 10_1101-2021_02_13_429885 341 56 triploid triploid NNP 10_1101-2021_02_13_429885 341 57 ( ( -LRB- 10_1101-2021_02_13_429885 341 58 AAB AAB NNP 10_1101-2021_02_13_429885 341 59 or or CC 10_1101-2021_02_13_429885 341 60 2:1 2:1 CD 10_1101-2021_02_13_429885 341 61 ) ) -RRB- 10_1101-2021_02_13_429885 341 62 or or CC 10_1101-2021_02_13_429885 341 63 tetraploid tetraploid NN 10_1101-2021_02_13_429885 341 64 ( ( -LRB- 10_1101-2021_02_13_429885 341 65 AABB AABB NNP 10_1101-2021_02_13_429885 341 66 or or CC 10_1101-2021_02_13_429885 341 67 2:2 2:2 CD 10_1101-2021_02_13_429885 341 68 ) ) -RRB- 10_1101-2021_02_13_429885 341 69 states state NNS 10_1101-2021_02_13_429885 341 70 . . . 10_1101-2021_02_13_429885 342 1 We -PRON- PRP 10_1101-2021_02_13_429885 342 2 make make VBP 10_1101-2021_02_13_429885 342 3 a a DT 10_1101-2021_02_13_429885 342 4 simplifying simplify VBG 10_1101-2021_02_13_429885 342 5 assumption assumption NN 10_1101-2021_02_13_429885 342 6 , , , 10_1101-2021_02_13_429885 342 7 whereby whereby WRB 10_1101-2021_02_13_429885 342 8 CNAs cna NNS 10_1101-2021_02_13_429885 342 9 have have VBP 10_1101-2021_02_13_429885 342 10 been be VBN 10_1101-2021_02_13_429885 342 11 acquired acquire VBN 10_1101-2021_02_13_429885 342 12 in in IN 10_1101-2021_02_13_429885 342 13 one one CD 10_1101-2021_02_13_429885 342 14 step step NN 10_1101-2021_02_13_429885 342 15 , , , 10_1101-2021_02_13_429885 342 16 starting start VBG 10_1101-2021_02_13_429885 342 17 from from IN 10_1101-2021_02_13_429885 342 18 a a DT 10_1101-2021_02_13_429885 342 19 simple simple JJ 10_1101-2021_02_13_429885 342 20 heterozygous heterozygous JJ 10_1101-2021_02_13_429885 342 21 diploid diploid NN 10_1101-2021_02_13_429885 342 22 state state NN 10_1101-2021_02_13_429885 342 23 ( ( -LRB- 10_1101-2021_02_13_429885 342 24 the the DT 10_1101-2021_02_13_429885 342 25 germline germline NN 10_1101-2021_02_13_429885 342 26 ) ) -RRB- 10_1101-2021_02_13_429885 342 27 . . . 10_1101-2021_02_13_429885 343 1 For for IN 10_1101-2021_02_13_429885 343 2 this this DT 10_1101-2021_02_13_429885 343 3 reason reason NN 10_1101-2021_02_13_429885 343 4 , , , 10_1101-2021_02_13_429885 343 5 for for IN 10_1101-2021_02_13_429885 343 6 tetraploid tetraploid NN 10_1101-2021_02_13_429885 343 7 segments segment NNS 10_1101-2021_02_13_429885 343 8 we -PRON- PRP 10_1101-2021_02_13_429885 343 9 only only RB 10_1101-2021_02_13_429885 343 10 consider consider VBP 10_1101-2021_02_13_429885 343 11 copy copy NN 10_1101-2021_02_13_429885 343 12 state state NN 10_1101-2021_02_13_429885 343 13 2:2 2:2 CD 10_1101-2021_02_13_429885 343 14 , , , 10_1101-2021_02_13_429885 343 15 instead instead RB 10_1101-2021_02_13_429885 343 16 of of IN 10_1101-2021_02_13_429885 343 17 3:1 3:1 CD 10_1101-2021_02_13_429885 343 18 or or CC 10_1101-2021_02_13_429885 343 19 4:0 4:0 CD 10_1101-2021_02_13_429885 343 20 . . . 10_1101-2021_02_13_429885 344 1 This this DT 10_1101-2021_02_13_429885 344 2 allows allow VBZ 10_1101-2021_02_13_429885 344 3 us -PRON- PRP 10_1101-2021_02_13_429885 344 4 to to TO 10_1101-2021_02_13_429885 344 5 make make VB 10_1101-2021_02_13_429885 344 6 simpler simple JJR 10_1101-2021_02_13_429885 344 7 computations computation NNS 10_1101-2021_02_13_429885 344 8 . . . 10_1101-2021_02_13_429885 345 1 In in IN 10_1101-2021_02_13_429885 345 2 practice practice NN 10_1101-2021_02_13_429885 345 3 , , , 10_1101-2021_02_13_429885 345 4 we -PRON- PRP 10_1101-2021_02_13_429885 345 5 avoid avoid VBP 10_1101-2021_02_13_429885 345 6 working work VBG 10_1101-2021_02_13_429885 345 7 with with IN 10_1101-2021_02_13_429885 345 8 copy copy NN 10_1101-2021_02_13_429885 345 9 states state NNS 10_1101-2021_02_13_429885 345 10 for for IN 10_1101-2021_02_13_429885 345 11 which which WDT 10_1101-2021_02_13_429885 345 12 the the DT 10_1101-2021_02_13_429885 345 13 computation computation NN 10_1101-2021_02_13_429885 345 14 of of IN 10_1101-2021_02_13_429885 345 15 CCFs ccf NNS 10_1101-2021_02_13_429885 345 16 is be VBZ 10_1101-2021_02_13_429885 345 17 very very RB 10_1101-2021_02_13_429885 345 18 difficult difficult JJ 10_1101-2021_02_13_429885 345 19 , , , 10_1101-2021_02_13_429885 345 20 and and CC 10_1101-2021_02_13_429885 345 21 that that WDT 10_1101-2021_02_13_429885 345 22 are be VBP 10_1101-2021_02_13_429885 345 23 quite quite RB 10_1101-2021_02_13_429885 345 24 unlikely unlikely JJ 10_1101-2021_02_13_429885 345 25 to to TO 10_1101-2021_02_13_429885 345 26 be be VB 10_1101-2021_02_13_429885 345 27 observed observe VBN 10_1101-2021_02_13_429885 345 28 in in IN 10_1101-2021_02_13_429885 345 29 real real JJ 10_1101-2021_02_13_429885 345 30 data datum NNS 10_1101-2021_02_13_429885 345 31 . . . 10_1101-2021_02_13_429885 346 1 Also also RB 10_1101-2021_02_13_429885 346 2 , , , 10_1101-2021_02_13_429885 346 3 we -PRON- PRP 10_1101-2021_02_13_429885 346 4 consider consider VBP 10_1101-2021_02_13_429885 346 5 only only RB 10_1101-2021_02_13_429885 346 6 clonal clonal JJ 10_1101-2021_02_13_429885 346 7 CNA cna NN 10_1101-2021_02_13_429885 346 8 segments segment NNS 10_1101-2021_02_13_429885 346 9 . . . 10_1101-2021_02_13_429885 347 1 While while IN 10_1101-2021_02_13_429885 347 2 subclonal subclonal JJ 10_1101-2021_02_13_429885 347 3 CNA CNA NNP 10_1101-2021_02_13_429885 347 4 segments segment NNS 10_1101-2021_02_13_429885 347 5 are be VBP 10_1101-2021_02_13_429885 347 6 certainly certainly RB 10_1101-2021_02_13_429885 347 7 important important JJ 10_1101-2021_02_13_429885 347 8 for for IN 10_1101-2021_02_13_429885 347 9 cancer cancer NN 10_1101-2021_02_13_429885 347 10 genomics genomic NNS 10_1101-2021_02_13_429885 347 11 , , , 10_1101-2021_02_13_429885 347 12 the the DT 10_1101-2021_02_13_429885 347 13 calls call NNS 10_1101-2021_02_13_429885 347 14 that that WDT 10_1101-2021_02_13_429885 347 15 we -PRON- PRP 10_1101-2021_02_13_429885 347 16 seek seek VBP 10_1101-2021_02_13_429885 347 17 to to TO 10_1101-2021_02_13_429885 347 18 quality quality NN 10_1101-2021_02_13_429885 347 19 check check NN 10_1101-2021_02_13_429885 347 20 regard regard NN 10_1101-2021_02_13_429885 347 21 just just RB 10_1101-2021_02_13_429885 347 22 clonal clonal JJ 10_1101-2021_02_13_429885 347 23 CNA CNA NNP 10_1101-2021_02_13_429885 347 24 events event NNS 10_1101-2021_02_13_429885 347 25 ; ; : 10_1101-2021_02_13_429885 347 26 being be VBG 10_1101-2021_02_13_429885 347 27 the the DT 10_1101-2021_02_13_429885 347 28 one one NN 10_1101-2021_02_13_429885 347 29 most most RBS 10_1101-2021_02_13_429885 347 30 prevalent prevalent JJ 10_1101-2021_02_13_429885 347 31 in in IN 10_1101-2021_02_13_429885 347 32 the the DT 10_1101-2021_02_13_429885 347 33 majority majority NN 10_1101-2021_02_13_429885 347 34 of of IN 10_1101-2021_02_13_429885 347 35 cancer cancer NN 10_1101-2021_02_13_429885 347 36 cells cell NNS 10_1101-2021_02_13_429885 347 37 , , , 10_1101-2021_02_13_429885 347 38 they -PRON- PRP 10_1101-2021_02_13_429885 347 39 have have VBP 10_1101-2021_02_13_429885 347 40 to to TO 10_1101-2021_02_13_429885 347 41 be be VB 10_1101-2021_02_13_429885 347 42 prioritised prioritise VBN 10_1101-2021_02_13_429885 347 43 , , , 10_1101-2021_02_13_429885 347 44 with with IN 10_1101-2021_02_13_429885 347 45 subclonal subclonal JJ 10_1101-2021_02_13_429885 347 46 CNAs cna NNS 10_1101-2021_02_13_429885 347 47 being be VBG 10_1101-2021_02_13_429885 347 48 only only RB 10_1101-2021_02_13_429885 347 49 reliable reliable JJ 10_1101-2021_02_13_429885 347 50 for for IN 10_1101-2021_02_13_429885 347 51 tumours tumour NNS 10_1101-2021_02_13_429885 347 52 with with IN 10_1101-2021_02_13_429885 347 53 good good JJ 10_1101-2021_02_13_429885 347 54 clonal clonal JJ 10_1101-2021_02_13_429885 347 55 CNA CNA NNP 10_1101-2021_02_13_429885 347 56 calls call NNS 10_1101-2021_02_13_429885 347 57 . . . 10_1101-2021_02_13_429885 348 1 CNAqc cnaqc NN 10_1101-2021_02_13_429885 348 2 works work VBZ 10_1101-2021_02_13_429885 348 3 primarily primarily RB 10_1101-2021_02_13_429885 348 4 with with IN 10_1101-2021_02_13_429885 348 5 Whole whole JJ 10_1101-2021_02_13_429885 348 6 - - HYPH 10_1101-2021_02_13_429885 348 7 Genome genome NN 10_1101-2021_02_13_429885 348 8 Sequencing sequencing NN 10_1101-2021_02_13_429885 348 9 ( ( -LRB- 10_1101-2021_02_13_429885 348 10 WGS WGS NNP 10_1101-2021_02_13_429885 348 11 ) ) -RRB- 10_1101-2021_02_13_429885 348 12 data datum NNS 10_1101-2021_02_13_429885 348 13 . . . 10_1101-2021_02_13_429885 349 1 For for IN 10_1101-2021_02_13_429885 349 2 exome exome JJ 10_1101-2021_02_13_429885 349 3 data datum NNS 10_1101-2021_02_13_429885 349 4 , , , 10_1101-2021_02_13_429885 349 5 the the DT 10_1101-2021_02_13_429885 349 6 reduced reduce VBN 10_1101-2021_02_13_429885 349 7 exonic exonic JJ 10_1101-2021_02_13_429885 349 8 mutation mutation NN 10_1101-2021_02_13_429885 349 9 burden burden NN 10_1101-2021_02_13_429885 349 10 can can MD 10_1101-2021_02_13_429885 349 11 make make VB 10_1101-2021_02_13_429885 349 12 it -PRON- PRP 10_1101-2021_02_13_429885 349 13 more more RBR 10_1101-2021_02_13_429885 349 14 difficult difficult JJ 10_1101-2021_02_13_429885 349 15 to to TO 10_1101-2021_02_13_429885 349 16 work work VB 10_1101-2021_02_13_429885 349 17 with with IN 10_1101-2021_02_13_429885 349 18 the the DT 10_1101-2021_02_13_429885 349 19 spectrum spectrum NN 10_1101-2021_02_13_429885 349 20 of of IN 10_1101-2021_02_13_429885 349 21 the the DT 10_1101-2021_02_13_429885 349 22 VAF VAF NNP 10_1101-2021_02_13_429885 349 23 distribution distribution NN 10_1101-2021_02_13_429885 349 24 . . . 10_1101-2021_02_13_429885 350 1 In in IN 10_1101-2021_02_13_429885 350 2 general general JJ 10_1101-2021_02_13_429885 350 3 , , , 10_1101-2021_02_13_429885 350 4 the the DT 10_1101-2021_02_13_429885 350 5 key key JJ 10_1101-2021_02_13_429885 350 6 determinant determinant NN 10_1101-2021_02_13_429885 350 7 to to TO 10_1101-2021_02_13_429885 350 8 detect detect VB 10_1101-2021_02_13_429885 350 9 peaks peak NNS 10_1101-2021_02_13_429885 350 10 in in IN 10_1101-2021_02_13_429885 350 11 the the DT 10_1101-2021_02_13_429885 350 12 VAF VAF NNP 10_1101-2021_02_13_429885 350 13 , , , 10_1101-2021_02_13_429885 350 14 is be VBZ 10_1101-2021_02_13_429885 350 15 the the DT 10_1101-2021_02_13_429885 350 16 number number NN 10_1101-2021_02_13_429885 350 17 of of IN 10_1101-2021_02_13_429885 350 18 mutations mutation NNS 10_1101-2021_02_13_429885 350 19 per per IN 10_1101-2021_02_13_429885 350 20 copy copy NN 10_1101-2021_02_13_429885 350 21 state state NN 10_1101-2021_02_13_429885 350 22 . . . 10_1101-2021_02_13_429885 351 1 For for IN 10_1101-2021_02_13_429885 351 2 tumours tumour NNS 10_1101-2021_02_13_429885 351 3 with with IN 10_1101-2021_02_13_429885 351 4 strong strong JJ 10_1101-2021_02_13_429885 351 5 endogenous endogenous JJ 10_1101-2021_02_13_429885 351 6 mutant mutant NN 10_1101-2021_02_13_429885 351 7 factors factor NNS 10_1101-2021_02_13_429885 351 8 ( ( -LRB- 10_1101-2021_02_13_429885 351 9 e.g. e.g. RB 10_1101-2021_02_13_429885 351 10 , , , 10_1101-2021_02_13_429885 351 11 smoking smoke VBG 10_1101-2021_02_13_429885 351 12 ) ) -RRB- 10_1101-2021_02_13_429885 351 13 or or CC 10_1101-2021_02_13_429885 351 14 very very RB 10_1101-2021_02_13_429885 351 15 high high JJ 10_1101-2021_02_13_429885 351 16 mutation mutation NN 10_1101-2021_02_13_429885 351 17 rate rate NN 10_1101-2021_02_13_429885 351 18 ( ( -LRB- 10_1101-2021_02_13_429885 351 19 e.g. e.g. RB 10_1101-2021_02_13_429885 351 20 , , , 10_1101-2021_02_13_429885 351 21 microsatellite microsatellite NNP 10_1101-2021_02_13_429885 351 22 unstable unstable JJ 10_1101-2021_02_13_429885 351 23 tumours tumour NNS 10_1101-2021_02_13_429885 351 24 ) ) -RRB- 10_1101-2021_02_13_429885 351 25 , , , 10_1101-2021_02_13_429885 351 26 the the DT 10_1101-2021_02_13_429885 351 27 number number NN 10_1101-2021_02_13_429885 351 28 of of IN 10_1101-2021_02_13_429885 351 29 exonic exonic JJ 10_1101-2021_02_13_429885 351 30 mutations mutation NNS 10_1101-2021_02_13_429885 351 31 could could MD 10_1101-2021_02_13_429885 351 32 be be VB 10_1101-2021_02_13_429885 351 33 high high JJ 10_1101-2021_02_13_429885 351 34 enough enough RB 10_1101-2021_02_13_429885 351 35 to to TO 10_1101-2021_02_13_429885 351 36 use use VB 10_1101-2021_02_13_429885 351 37 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 351 38 . . . 10_1101-2021_02_13_429885 352 1 Peak peak NN 10_1101-2021_02_13_429885 352 2 - - HYPH 10_1101-2021_02_13_429885 352 3 detection detection NN 10_1101-2021_02_13_429885 352 4 QC qc NN 10_1101-2021_02_13_429885 352 5 2 2 CD 10_1101-2021_02_13_429885 352 6 The the DT 10_1101-2021_02_13_429885 352 7 notation notation NN 10_1101-2021_02_13_429885 352 8 1:1 1:1 CD 10_1101-2021_02_13_429885 352 9 is be VBZ 10_1101-2021_02_13_429885 352 10 sometimes sometimes RB 10_1101-2021_02_13_429885 352 11 analogously analogously RB 10_1101-2021_02_13_429885 352 12 expressed express VBN 10_1101-2021_02_13_429885 352 13 as as IN 10_1101-2021_02_13_429885 352 14 genotype genotype NN 10_1101-2021_02_13_429885 352 15 AB AB NNP 10_1101-2021_02_13_429885 352 16 , , , 10_1101-2021_02_13_429885 352 17 1:0 1:0 CD 10_1101-2021_02_13_429885 352 18 as as IN 10_1101-2021_02_13_429885 352 19 A a NN 10_1101-2021_02_13_429885 352 20 , , , 10_1101-2021_02_13_429885 352 21 2:1 2:1 CD 10_1101-2021_02_13_429885 352 22 as as IN 10_1101-2021_02_13_429885 352 23 AAB AAB NNP 10_1101-2021_02_13_429885 352 24 and and CC 10_1101-2021_02_13_429885 352 25 2:2 2:2 CD 10_1101-2021_02_13_429885 352 26 as as IN 10_1101-2021_02_13_429885 352 27 AABB AABB NNP 10_1101-2021_02_13_429885 352 28 . . . 10_1101-2021_02_13_429885 353 1 .CC .CC NFP 10_1101-2021_02_13_429885 353 2 - - : 10_1101-2021_02_13_429885 353 3 BY by IN 10_1101-2021_02_13_429885 353 4 - - HYPH 10_1101-2021_02_13_429885 353 5 NC NC NNP 10_1101-2021_02_13_429885 353 6 - - HYPH 10_1101-2021_02_13_429885 353 7 ND ND NNP 10_1101-2021_02_13_429885 353 8 4.0 4.0 CD 10_1101-2021_02_13_429885 353 9 International International NNP 10_1101-2021_02_13_429885 353 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 353 11 under under IN 10_1101-2021_02_13_429885 353 12 a a DT 10_1101-2021_02_13_429885 353 13 ( ( -LRB- 10_1101-2021_02_13_429885 353 14 which which WDT 10_1101-2021_02_13_429885 353 15 was be VBD 10_1101-2021_02_13_429885 353 16 not not RB 10_1101-2021_02_13_429885 353 17 certified certify VBN 10_1101-2021_02_13_429885 353 18 by by IN 10_1101-2021_02_13_429885 353 19 peer peer NN 10_1101-2021_02_13_429885 353 20 review review NN 10_1101-2021_02_13_429885 353 21 ) ) -RRB- 10_1101-2021_02_13_429885 353 22 is be VBZ 10_1101-2021_02_13_429885 353 23 the the DT 10_1101-2021_02_13_429885 353 24 author author NN 10_1101-2021_02_13_429885 353 25 / / SYM 10_1101-2021_02_13_429885 353 26 funder funder NN 10_1101-2021_02_13_429885 353 27 , , , 10_1101-2021_02_13_429885 353 28 who who WP 10_1101-2021_02_13_429885 353 29 has have VBZ 10_1101-2021_02_13_429885 353 30 granted grant VBN 10_1101-2021_02_13_429885 353 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 353 32 a a DT 10_1101-2021_02_13_429885 353 33 license license NN 10_1101-2021_02_13_429885 353 34 to to TO 10_1101-2021_02_13_429885 353 35 display display VB 10_1101-2021_02_13_429885 353 36 the the DT 10_1101-2021_02_13_429885 353 37 preprint preprint NN 10_1101-2021_02_13_429885 353 38 in in IN 10_1101-2021_02_13_429885 353 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 353 40 . . . 10_1101-2021_02_13_429885 354 1 It -PRON- PRP 10_1101-2021_02_13_429885 354 2 is be VBZ 10_1101-2021_02_13_429885 354 3 made make VBN 10_1101-2021_02_13_429885 354 4 The the DT 10_1101-2021_02_13_429885 354 5 copyright copyright NN 10_1101-2021_02_13_429885 354 6 holder holder NN 10_1101-2021_02_13_429885 354 7 for for IN 10_1101-2021_02_13_429885 354 8 this this DT 10_1101-2021_02_13_429885 354 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 354 10 version version NN 10_1101-2021_02_13_429885 354 11 posted post VBD 10_1101-2021_02_13_429885 354 12 February February NNP 10_1101-2021_02_13_429885 354 13 13 13 CD 10_1101-2021_02_13_429885 354 14 , , , 10_1101-2021_02_13_429885 354 15 2021 2021 CD 10_1101-2021_02_13_429885 354 16 . . . 10_1101-2021_02_13_429885 354 17 ; ; : 10_1101-2021_02_13_429885 354 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 354 19 : : : 10_1101-2021_02_13_429885 354 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 354 21 preprint preprint NN 10_1101-2021_02_13_429885 354 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 354 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 354 24 Househam Househam NNP 10_1101-2021_02_13_429885 354 25 et et FW 10_1101-2021_02_13_429885 354 26 al al NNP 10_1101-2021_02_13_429885 354 27 . . . 10_1101-2021_02_13_429885 355 1 A a DT 10_1101-2021_02_13_429885 355 2 fully fully RB 10_1101-2021_02_13_429885 355 3 automated automate VBN 10_1101-2021_02_13_429885 355 4 approach approach NN 10_1101-2021_02_13_429885 355 5 for for IN 10_1101-2021_02_13_429885 355 6 quality quality NN 10_1101-2021_02_13_429885 355 7 control control NN 10_1101-2021_02_13_429885 355 8 of of IN 10_1101-2021_02_13_429885 355 9 cancer cancer NN 10_1101-2021_02_13_429885 355 10 mutations mutation NNS 10_1101-2021_02_13_429885 355 11 in in IN 10_1101-2021_02_13_429885 355 12 the the DT 10_1101-2021_02_13_429885 355 13 era era NN 10_1101-2021_02_13_429885 355 14 of of IN 10_1101-2021_02_13_429885 355 15 high high JJ 10_1101-2021_02_13_429885 355 16 - - HYPH 10_1101-2021_02_13_429885 355 17 resolution resolution NN 10_1101-2021_02_13_429885 355 18 whole whole JJ 10_1101-2021_02_13_429885 355 19 genome genome JJ 10_1101-2021_02_13_429885 355 20 sequencing sequencing NN 10_1101-2021_02_13_429885 355 21 . . . 10_1101-2021_02_13_429885 356 1 We -PRON- PRP 10_1101-2021_02_13_429885 356 2 consider consider VBP 10_1101-2021_02_13_429885 356 3 a a DT 10_1101-2021_02_13_429885 356 4 somatic somatic JJ 10_1101-2021_02_13_429885 356 5 mutation mutation NN 10_1101-2021_02_13_429885 356 6 ​present ​present CD 10_1101-2021_02_13_429885 356 7 in in IN 10_1101-2021_02_13_429885 356 8 𝑚 𝑚 DT 10_1101-2021_02_13_429885 356 9 copies copy NNS 10_1101-2021_02_13_429885 356 10 of of IN 10_1101-2021_02_13_429885 356 11 the the DT 10_1101-2021_02_13_429885 356 12 tumour tumour NN 10_1101-2021_02_13_429885 356 13 genome genome NN 10_1101-2021_02_13_429885 356 14 , , , 10_1101-2021_02_13_429885 356 15 when when WRB 10_1101-2021_02_13_429885 356 16 the the DT 10_1101-2021_02_13_429885 356 17 sample sample NN 10_1101-2021_02_13_429885 356 18 purity purity NN 10_1101-2021_02_13_429885 356 19 is be VBZ 10_1101-2021_02_13_429885 356 20 𝜋 𝜋 NN 10_1101-2021_02_13_429885 356 21 and and CC 10_1101-2021_02_13_429885 356 22 the the DT 10_1101-2021_02_13_429885 356 23 segment segment NN 10_1101-2021_02_13_429885 356 24 ploidy ploidy NN 10_1101-2021_02_13_429885 356 25 is be VBZ 10_1101-2021_02_13_429885 356 26 𝑝. 𝑝. NN 10_1101-2021_02_13_429885 357 1 Note note VB 10_1101-2021_02_13_429885 357 2 that that DT 10_1101-2021_02_13_429885 357 3 can can MD 10_1101-2021_02_13_429885 357 4 be be VB 10_1101-2021_02_13_429885 357 5 computed compute VBN 10_1101-2021_02_13_429885 357 6 summing sum VBG 10_1101-2021_02_13_429885 357 7 p p NN 10_1101-2021_02_13_429885 357 8 the the DT 10_1101-2021_02_13_429885 357 9 total total JJ 10_1101-2021_02_13_429885 357 10 number number NN 10_1101-2021_02_13_429885 357 11 of of IN 10_1101-2021_02_13_429885 357 12 copies copy NNS 10_1101-2021_02_13_429885 357 13 of of IN 10_1101-2021_02_13_429885 357 14 the the DT 10_1101-2021_02_13_429885 357 15 minor minor JJ 10_1101-2021_02_13_429885 357 16 and and CC 10_1101-2021_02_13_429885 357 17 major major JJ 10_1101-2021_02_13_429885 357 18 allele allele NNS 10_1101-2021_02_13_429885 357 19 at at IN 10_1101-2021_02_13_429885 357 20 the the DT 10_1101-2021_02_13_429885 357 21 mutation mutation NN 10_1101-2021_02_13_429885 357 22 locus locus NN 10_1101-2021_02_13_429885 357 23 ( ( -LRB- 10_1101-2021_02_13_429885 357 24 ​Figure ​figure NN 10_1101-2021_02_13_429885 357 25 1 1 CD 10_1101-2021_02_13_429885 357 26 ​ ​ UH 10_1101-2021_02_13_429885 357 27 ) ) -RRB- 10_1101-2021_02_13_429885 357 28 . . . 10_1101-2021_02_13_429885 358 1 The the DT 10_1101-2021_02_13_429885 358 2 key key JJ 10_1101-2021_02_13_429885 358 3 equations equation NNS 10_1101-2021_02_13_429885 358 4 for for IN 10_1101-2021_02_13_429885 358 5 the the DT 10_1101-2021_02_13_429885 358 6 expected expect VBN 10_1101-2021_02_13_429885 358 7 VAF VAF NNP 10_1101-2021_02_13_429885 358 8 of of IN 10_1101-2021_02_13_429885 358 9 a a DT 10_1101-2021_02_13_429885 358 10 clonal clonal JJ 10_1101-2021_02_13_429885 358 11 mutation mutation NN 10_1101-2021_02_13_429885 358 12 and and CC 10_1101-2021_02_13_429885 358 13 its -PRON- PRP$ 10_1101-2021_02_13_429885 358 14 CCF ccf NN 10_1101-2021_02_13_429885 358 15 are be VBP 10_1101-2021_02_13_429885 358 16 presented present VBN 10_1101-2021_02_13_429885 358 17 in in IN 10_1101-2021_02_13_429885 358 18 the the DT 10_1101-2021_02_13_429885 358 19 Main Main NNP 10_1101-2021_02_13_429885 358 20 Text Text NNP 10_1101-2021_02_13_429885 358 21 . . . 10_1101-2021_02_13_429885 359 1 Here here RB 10_1101-2021_02_13_429885 359 2 we -PRON- PRP 10_1101-2021_02_13_429885 359 3 discuss discuss VBP 10_1101-2021_02_13_429885 359 4 how how WRB 10_1101-2021_02_13_429885 359 5 ​peaks ​peak NNS 10_1101-2021_02_13_429885 359 6 can can MD 10_1101-2021_02_13_429885 359 7 be be VB 10_1101-2021_02_13_429885 359 8 used use VBN 10_1101-2021_02_13_429885 359 9 to to IN 10_1101-2021_02_13_429885 359 10 QC QC NNP 10_1101-2021_02_13_429885 359 11 both both CC 10_1101-2021_02_13_429885 359 12 tumour tumour NN 10_1101-2021_02_13_429885 359 13 purity purity NN 10_1101-2021_02_13_429885 359 14 and and CC 10_1101-2021_02_13_429885 359 15 CNA CNA NNP 10_1101-2021_02_13_429885 359 16 segments segment NNS 10_1101-2021_02_13_429885 359 17 and and CC 10_1101-2021_02_13_429885 359 18 , , , 10_1101-2021_02_13_429885 359 19 consequently consequently RB 10_1101-2021_02_13_429885 359 20 , , , 10_1101-2021_02_13_429885 359 21 overall overall JJ 10_1101-2021_02_13_429885 359 22 tumour tumour NN 10_1101-2021_02_13_429885 359 23 ploidy ploidy NN 10_1101-2021_02_13_429885 359 24 . . . 10_1101-2021_02_13_429885 360 1 From from IN 10_1101-2021_02_13_429885 360 2 a a DT 10_1101-2021_02_13_429885 360 3 QC QC NNP 10_1101-2021_02_13_429885 360 4 perspective perspective NN 10_1101-2021_02_13_429885 360 5 , , , 10_1101-2021_02_13_429885 360 6 if if IN 10_1101-2021_02_13_429885 360 7 we -PRON- PRP 10_1101-2021_02_13_429885 360 8 solve solve VBP 10_1101-2021_02_13_429885 360 9 for for IN 10_1101-2021_02_13_429885 360 10 and and CC 10_1101-2021_02_13_429885 360 11 the the DT 10_1101-2021_02_13_429885 360 12 equations equation NNS 10_1101-2021_02_13_429885 360 13 , , , 10_1101-2021_02_13_429885 360 14 we -PRON- PRP 10_1101-2021_02_13_429885 360 15 can can MD 10_1101-2021_02_13_429885 360 16 get get VB 10_1101-2021_02_13_429885 360 17 as as IN 10_1101-2021_02_13_429885 360 18 which which WDT 10_1101-2021_02_13_429885 360 19 means mean VBZ 10_1101-2021_02_13_429885 360 20 that that IN 10_1101-2021_02_13_429885 360 21 if if IN 10_1101-2021_02_13_429885 360 22 we -PRON- PRP 10_1101-2021_02_13_429885 360 23 know know VBP 10_1101-2021_02_13_429885 360 24 tumour tumour NN 10_1101-2021_02_13_429885 360 25 purity purity NN 10_1101-2021_02_13_429885 360 26 and and CC 10_1101-2021_02_13_429885 360 27 CNA CNA NNP 10_1101-2021_02_13_429885 360 28 , , , 10_1101-2021_02_13_429885 360 29 we -PRON- PRP 10_1101-2021_02_13_429885 360 30 expect expect VBP 10_1101-2021_02_13_429885 360 31 a a DT 10_1101-2021_02_13_429885 360 32 peak peak NN 10_1101-2021_02_13_429885 360 33 at at IN 10_1101-2021_02_13_429885 360 34 VAF VAF NNP 10_1101-2021_02_13_429885 360 35 , , , 10_1101-2021_02_13_429885 360 36 for for IN 10_1101-2021_02_13_429885 360 37 a a DT 10_1101-2021_02_13_429885 360 38 given give VBN 10_1101-2021_02_13_429885 360 39 value value NN 10_1101-2021_02_13_429885 360 40 of of IN 10_1101-2021_02_13_429885 360 41 , , , 10_1101-2021_02_13_429885 360 42 in in IN 10_1101-2021_02_13_429885 360 43 the the DT 10_1101-2021_02_13_429885 360 44 data data NN 10_1101-2021_02_13_429885 360 45 distribution distribution NN 10_1101-2021_02_13_429885 360 46 ( ( -LRB- 10_1101-2021_02_13_429885 360 47 ​Figure ​figure NN 10_1101-2021_02_13_429885 360 48 1a 1a CD 10_1101-2021_02_13_429885 360 49 and and CC 10_1101-2021_02_13_429885 360 50 ​1b ​1b NNP 10_1101-2021_02_13_429885 360 51 ​ ​ NNP 10_1101-2021_02_13_429885 360 52 ) ) -RRB- 10_1101-2021_02_13_429885 360 53 . . . 10_1101-2021_02_13_429885 361 1 For for IN 10_1101-2021_02_13_429885 361 2 instance instance NN 10_1101-2021_02_13_429885 361 3 , , , 10_1101-2021_02_13_429885 361 4 for for IN 10_1101-2021_02_13_429885 361 5 a a DT 10_1101-2021_02_13_429885 361 6 1:1 1:1 CD 10_1101-2021_02_13_429885 361 7 segment segment NN 10_1101-2021_02_13_429885 361 8 ( ( -LRB- 10_1101-2021_02_13_429885 361 9 ) ) -RRB- 10_1101-2021_02_13_429885 361 10 , , , 10_1101-2021_02_13_429885 361 11 the the DT 10_1101-2021_02_13_429885 361 12 expected expect VBN 10_1101-2021_02_13_429885 361 13 VAF VAF NNP 10_1101-2021_02_13_429885 361 14 for for IN 10_1101-2021_02_13_429885 361 15 a a DT 10_1101-2021_02_13_429885 361 16 heterozygous heterozygous JJ 10_1101-2021_02_13_429885 361 17 clonal clonal NN 10_1101-2021_02_13_429885 361 18 ( ( -LRB- 10_1101-2021_02_13_429885 361 19 ) ) -RRB- 10_1101-2021_02_13_429885 361 20 mutation mutation NN 10_1101-2021_02_13_429885 361 21 is be VBZ 10_1101-2021_02_13_429885 361 22 25 25 CD 10_1101-2021_02_13_429885 361 23 % % NN 10_1101-2021_02_13_429885 361 24 p p NN 10_1101-2021_02_13_429885 361 25 = = SYM 10_1101-2021_02_13_429885 361 26 2 2 CD 10_1101-2021_02_13_429885 361 27 m m NN 10_1101-2021_02_13_429885 361 28 = = SYM 10_1101-2021_02_13_429885 361 29 1 1 CD 10_1101-2021_02_13_429885 361 30 for for IN 10_1101-2021_02_13_429885 361 31 a a DT 10_1101-2021_02_13_429885 361 32 50%-purity 50%-purity CD 10_1101-2021_02_13_429885 361 33 tumour tumour NN 10_1101-2021_02_13_429885 361 34 , , , 10_1101-2021_02_13_429885 361 35 and and CC 10_1101-2021_02_13_429885 361 36 50 50 CD 10_1101-2021_02_13_429885 361 37 % % NN 10_1101-2021_02_13_429885 361 38 for for IN 10_1101-2021_02_13_429885 361 39 a a DT 10_1101-2021_02_13_429885 361 40 100%-purity 100%-purity CD 10_1101-2021_02_13_429885 361 41 tumour tumour NN 10_1101-2021_02_13_429885 361 42 . . . 10_1101-2021_02_13_429885 362 1 Similarly similarly RB 10_1101-2021_02_13_429885 362 2 , , , 10_1101-2021_02_13_429885 362 3 for for IN 10_1101-2021_02_13_429885 362 4 a a DT 10_1101-2021_02_13_429885 362 5 2:2 2:2 CD 10_1101-2021_02_13_429885 362 6 genome genome NN 10_1101-2021_02_13_429885 362 7 ( ( -LRB- 10_1101-2021_02_13_429885 362 8 ) ) -RRB- 10_1101-2021_02_13_429885 362 9 of of IN 10_1101-2021_02_13_429885 362 10 a a DT 10_1101-2021_02_13_429885 362 11 tumour tumour NN 10_1101-2021_02_13_429885 362 12 with with IN 10_1101-2021_02_13_429885 362 13 75 75 CD 10_1101-2021_02_13_429885 362 14 % % NN 10_1101-2021_02_13_429885 362 15 purity purity NN 10_1101-2021_02_13_429885 362 16 , , , 10_1101-2021_02_13_429885 362 17 the the DT 10_1101-2021_02_13_429885 362 18 expected expect VBN 10_1101-2021_02_13_429885 362 19 VAF VAF NNP 10_1101-2021_02_13_429885 362 20 for for IN 10_1101-2021_02_13_429885 362 21 clonal clonal JJ 10_1101-2021_02_13_429885 362 22 mutations mutation NNS 10_1101-2021_02_13_429885 362 23 accruedp accruedp VBP 10_1101-2021_02_13_429885 362 24 = = SYM 10_1101-2021_02_13_429885 362 25 4 4 CD 10_1101-2021_02_13_429885 362 26 before before IN 10_1101-2021_02_13_429885 362 27 genome genome JJ 10_1101-2021_02_13_429885 362 28 doubling doubling NN 10_1101-2021_02_13_429885 362 29 and and CC 10_1101-2021_02_13_429885 362 30 therefore therefore RB 10_1101-2021_02_13_429885 362 31 visible visible JJ 10_1101-2021_02_13_429885 362 32 in in IN 10_1101-2021_02_13_429885 362 33 two two CD 10_1101-2021_02_13_429885 362 34 copies copy NNS 10_1101-2021_02_13_429885 362 35 ( ( -LRB- 10_1101-2021_02_13_429885 362 36 ) ) -RRB- 10_1101-2021_02_13_429885 362 37 is be VBZ 10_1101-2021_02_13_429885 362 38 ~54 ~54 NFP 10_1101-2021_02_13_429885 362 39 % % NN 10_1101-2021_02_13_429885 362 40 , , , 10_1101-2021_02_13_429885 362 41 while while IN 10_1101-2021_02_13_429885 362 42 for for IN 10_1101-2021_02_13_429885 362 43 m m NN 10_1101-2021_02_13_429885 362 44 = = SYM 10_1101-2021_02_13_429885 362 45 2 2 CD 10_1101-2021_02_13_429885 362 46 those those DT 10_1101-2021_02_13_429885 362 47 accrued accrue VBN 10_1101-2021_02_13_429885 362 48 after after IN 10_1101-2021_02_13_429885 362 49 genome genome JJ 10_1101-2021_02_13_429885 362 50 doubling doubling NN 10_1101-2021_02_13_429885 362 51 , , , 10_1101-2021_02_13_429885 362 52 and and CC 10_1101-2021_02_13_429885 362 53 therefore therefore RB 10_1101-2021_02_13_429885 362 54 present present JJ 10_1101-2021_02_13_429885 362 55 in in IN 10_1101-2021_02_13_429885 362 56 single single JJ 10_1101-2021_02_13_429885 362 57 copy copy NN 10_1101-2021_02_13_429885 362 58 ( ( -LRB- 10_1101-2021_02_13_429885 362 59 ) ) -RRB- 10_1101-2021_02_13_429885 362 60 , , , 10_1101-2021_02_13_429885 362 61 we -PRON- PRP 10_1101-2021_02_13_429885 362 62 m m NN 10_1101-2021_02_13_429885 362 63 = = SYM 10_1101-2021_02_13_429885 362 64 1 1 CD 10_1101-2021_02_13_429885 362 65 expect expect VB 10_1101-2021_02_13_429885 362 66 a a DT 10_1101-2021_02_13_429885 362 67 ~21 ~21 NNP 10_1101-2021_02_13_429885 362 68 % % NN 10_1101-2021_02_13_429885 362 69 VAF VAF NNP 10_1101-2021_02_13_429885 362 70 ​(Dentro ​(Dentro NNP 10_1101-2021_02_13_429885 362 71 , , , 10_1101-2021_02_13_429885 362 72 Wedge Wedge NNP 10_1101-2021_02_13_429885 362 73 , , , 10_1101-2021_02_13_429885 362 74 and and CC 10_1101-2021_02_13_429885 362 75 Van Van NNP 10_1101-2021_02_13_429885 362 76 Loo Loo NNP 10_1101-2021_02_13_429885 362 77 2017)​. 2017)​. CD 10_1101-2021_02_13_429885 363 1 CNAqc cnaqc NN 10_1101-2021_02_13_429885 363 2 checks check VBZ 10_1101-2021_02_13_429885 363 3 the the DT 10_1101-2021_02_13_429885 363 4 data datum NNS 10_1101-2021_02_13_429885 363 5 for for IN 10_1101-2021_02_13_429885 363 6 peaks peak NNS 10_1101-2021_02_13_429885 363 7 at at IN 10_1101-2021_02_13_429885 363 8 these these DT 10_1101-2021_02_13_429885 363 9 VAFs vaf NNS 10_1101-2021_02_13_429885 363 10 , , , 10_1101-2021_02_13_429885 363 11 with with IN 10_1101-2021_02_13_429885 363 12 a a DT 10_1101-2021_02_13_429885 363 13 tolerance tolerance NN 10_1101-2021_02_13_429885 363 14 . . . 10_1101-2021_02_13_429885 364 1 From from IN 10_1101-2021_02_13_429885 364 2 the the DT 10_1101-2021_02_13_429885 364 3 distance distance NN 10_1101-2021_02_13_429885 364 4 between between IN 10_1101-2021_02_13_429885 364 5 the the DT 10_1101-2021_02_13_429885 364 6 theoretical theoretical JJ 10_1101-2021_02_13_429885 364 7 expectation expectation NN 10_1101-2021_02_13_429885 364 8 and and CC 10_1101-2021_02_13_429885 364 9 the the DT 10_1101-2021_02_13_429885 364 10 estimator estimator NN 10_1101-2021_02_13_429885 364 11 derived derive VBN 10_1101-2021_02_13_429885 364 12 from from IN 10_1101-2021_02_13_429885 364 13 data datum NNS 10_1101-2021_02_13_429885 364 14 , , , 10_1101-2021_02_13_429885 364 15 we -PRON- PRP 10_1101-2021_02_13_429885 364 16 obtain obtain VBP 10_1101-2021_02_13_429885 364 17 an an DT 10_1101-2021_02_13_429885 364 18 error error NN 10_1101-2021_02_13_429885 364 19 metric metric JJ 10_1101-2021_02_13_429885 364 20 for for IN 10_1101-2021_02_13_429885 364 21 the the DT 10_1101-2021_02_13_429885 364 22 calls call NNS 10_1101-2021_02_13_429885 364 23 . . . 10_1101-2021_02_13_429885 365 1 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 365 2 first first RB 10_1101-2021_02_13_429885 365 3 performs perform VBZ 10_1101-2021_02_13_429885 365 4 peak peak NN 10_1101-2021_02_13_429885 365 5 detection detection NN 10_1101-2021_02_13_429885 365 6 from from IN 10_1101-2021_02_13_429885 365 7 the the DT 10_1101-2021_02_13_429885 365 8 input input NN 10_1101-2021_02_13_429885 365 9 VAF VAF NNP 10_1101-2021_02_13_429885 365 10 with with IN 10_1101-2021_02_13_429885 365 11 two two CD 10_1101-2021_02_13_429885 365 12 , , , 10_1101-2021_02_13_429885 365 13 separate separate JJ 10_1101-2021_02_13_429885 365 14 , , , 10_1101-2021_02_13_429885 365 15 methods method NNS 10_1101-2021_02_13_429885 365 16 : : : 10_1101-2021_02_13_429885 365 17 1 1 CD 10_1101-2021_02_13_429885 365 18 . . . 10_1101-2021_02_13_429885 366 1 Via via IN 10_1101-2021_02_13_429885 366 2 a a DT 10_1101-2021_02_13_429885 366 3 kernel kernel NN 10_1101-2021_02_13_429885 366 4 density density NN 10_1101-2021_02_13_429885 366 5 estimation estimation NN 10_1101-2021_02_13_429885 366 6 with with IN 10_1101-2021_02_13_429885 366 7 fixed fix VBN 10_1101-2021_02_13_429885 366 8 bandwidth bandwidth NN 10_1101-2021_02_13_429885 366 9 , , , 10_1101-2021_02_13_429885 366 10 which which WDT 10_1101-2021_02_13_429885 366 11 is be VBZ 10_1101-2021_02_13_429885 366 12 used use VBN 10_1101-2021_02_13_429885 366 13 to to TO 10_1101-2021_02_13_429885 366 14 determine determine VB 10_1101-2021_02_13_429885 366 15 a a DT 10_1101-2021_02_13_429885 366 16 smooth smooth JJ 10_1101-2021_02_13_429885 366 17 density density NN 10_1101-2021_02_13_429885 366 18 profile profile NN 10_1101-2021_02_13_429885 366 19 . . . 10_1101-2021_02_13_429885 367 1 Peaks peak NNS 10_1101-2021_02_13_429885 367 2 are be VBP 10_1101-2021_02_13_429885 367 3 then then RB 10_1101-2021_02_13_429885 367 4 estimated estimate VBN 10_1101-2021_02_13_429885 367 5 from from IN 10_1101-2021_02_13_429885 367 6 the the DT 10_1101-2021_02_13_429885 367 7 discretized discretized JJ 10_1101-2021_02_13_429885 367 8 smooth smooth NN 10_1101-2021_02_13_429885 367 9 , , , 10_1101-2021_02_13_429885 367 10 using use VBG 10_1101-2021_02_13_429885 367 11 specialised specialise VBN 10_1101-2021_02_13_429885 367 12 R r NN 10_1101-2021_02_13_429885 367 13 packages package NNS 10_1101-2021_02_13_429885 367 14 for for IN 10_1101-2021_02_13_429885 367 15 peak peak NN 10_1101-2021_02_13_429885 367 16 - - HYPH 10_1101-2021_02_13_429885 367 17 detection detection NN 10_1101-2021_02_13_429885 367 18 and and CC 10_1101-2021_02_13_429885 367 19 removing remove VBG 10_1101-2021_02_13_429885 367 20 peaks peak NNS 10_1101-2021_02_13_429885 367 21 with with IN 10_1101-2021_02_13_429885 367 22 density density NN 10_1101-2021_02_13_429885 367 23 below below IN 10_1101-2021_02_13_429885 367 24 a a DT 10_1101-2021_02_13_429885 367 25 parameterized parameterize VBN 10_1101-2021_02_13_429885 367 26 cutoff cutoff NN 10_1101-2021_02_13_429885 367 27 . . . 10_1101-2021_02_13_429885 368 1 2 2 LS 10_1101-2021_02_13_429885 368 2 . . . 10_1101-2021_02_13_429885 369 1 Via Via NNP 10_1101-2021_02_13_429885 369 2 Binomial Binomial NNP 10_1101-2021_02_13_429885 369 3 mixture mixture NN 10_1101-2021_02_13_429885 369 4 from from IN 10_1101-2021_02_13_429885 369 5 the the DT 10_1101-2021_02_13_429885 369 6 BMix BMix NNP 10_1101-2021_02_13_429885 369 7 ​(Caravagna ​(Caravagna NNP 10_1101-2021_02_13_429885 369 8 et et NNP 10_1101-2021_02_13_429885 369 9 al al NNP 10_1101-2021_02_13_429885 369 10 . . . 10_1101-2021_02_13_429885 370 1 2020 2020 LS 10_1101-2021_02_13_429885 370 2 ) ) -RRB- 10_1101-2021_02_13_429885 370 3 package package NN 10_1101-2021_02_13_429885 370 4 ( ( -LRB- 10_1101-2021_02_13_429885 370 5 ​https://caravagn.github.io ​https://caravagn.github.io NNP 10_1101-2021_02_13_429885 370 6 / / SYM 10_1101-2021_02_13_429885 370 7 BMix/​ BMix/​ NNP 10_1101-2021_02_13_429885 370 8 ) ) -RRB- 10_1101-2021_02_13_429885 370 9 , , , 10_1101-2021_02_13_429885 370 10 a a DT 10_1101-2021_02_13_429885 370 11 peak peak NN 10_1101-2021_02_13_429885 370 12 is be VBZ 10_1101-2021_02_13_429885 370 13 associated associate VBN 10_1101-2021_02_13_429885 370 14 with with IN 10_1101-2021_02_13_429885 370 15 each each DT 10_1101-2021_02_13_429885 370 16 Binomial binomial JJ 10_1101-2021_02_13_429885 370 17 probability probability NN 10_1101-2021_02_13_429885 370 18 , , , 10_1101-2021_02_13_429885 370 19 for for IN 10_1101-2021_02_13_429885 370 20 all all DT 10_1101-2021_02_13_429885 370 21 mixture mixture NN 10_1101-2021_02_13_429885 370 22 components component NNS 10_1101-2021_02_13_429885 370 23 . . . 10_1101-2021_02_13_429885 371 1 Peaks peak NNS 10_1101-2021_02_13_429885 371 2 are be VBP 10_1101-2021_02_13_429885 371 3 matched match VBN 10_1101-2021_02_13_429885 371 4 to to IN 10_1101-2021_02_13_429885 371 5 the the DT 10_1101-2021_02_13_429885 371 6 expected expect VBN 10_1101-2021_02_13_429885 371 7 theoretical theoretical JJ 10_1101-2021_02_13_429885 371 8 values value NNS 10_1101-2021_02_13_429885 371 9 based base VBN 10_1101-2021_02_13_429885 371 10 on on IN 10_1101-2021_02_13_429885 371 11 their -PRON- PRP$ 10_1101-2021_02_13_429885 371 12 euclidean euclidean JJ 10_1101-2021_02_13_429885 371 13 distance distance NN 10_1101-2021_02_13_429885 371 14 . . . 10_1101-2021_02_13_429885 372 1 A a DT 10_1101-2021_02_13_429885 372 2 theoretical theoretical JJ 10_1101-2021_02_13_429885 372 3 peak peak NN 10_1101-2021_02_13_429885 372 4 can can MD 10_1101-2021_02_13_429885 372 5 be be VB 10_1101-2021_02_13_429885 372 6 matched match VBN 10_1101-2021_02_13_429885 372 7 to to IN 10_1101-2021_02_13_429885 372 8 the the DT 10_1101-2021_02_13_429885 372 9 closest close JJS 10_1101-2021_02_13_429885 372 10 peak peak NN 10_1101-2021_02_13_429885 372 11 in in IN 10_1101-2021_02_13_429885 372 12 the the DT 10_1101-2021_02_13_429885 372 13 data datum NNS 10_1101-2021_02_13_429885 372 14 , , , 10_1101-2021_02_13_429885 372 15 or or CC 10_1101-2021_02_13_429885 372 16 the the DT 10_1101-2021_02_13_429885 372 17 one one NN 10_1101-2021_02_13_429885 372 18 to to IN 10_1101-2021_02_13_429885 372 19 the the DT 10_1101-2021_02_13_429885 372 20 most most RBS 10_1101-2021_02_13_429885 372 21 right right JJ 10_1101-2021_02_13_429885 372 22 side side NN 10_1101-2021_02_13_429885 372 23 of of IN 10_1101-2021_02_13_429885 372 24 the the DT 10_1101-2021_02_13_429885 372 25 frequency frequency NN 10_1101-2021_02_13_429885 372 26 spectrum spectrum NN 10_1101-2021_02_13_429885 372 27 . . . 10_1101-2021_02_13_429885 373 1 This this DT 10_1101-2021_02_13_429885 373 2 latter latter JJ 10_1101-2021_02_13_429885 373 3 strategy strategy NN 10_1101-2021_02_13_429885 373 4 works work VBZ 10_1101-2021_02_13_429885 373 5 only only RB 10_1101-2021_02_13_429885 373 6 if if IN 10_1101-2021_02_13_429885 373 7 there there EX 10_1101-2021_02_13_429885 373 8 are be VBP 10_1101-2021_02_13_429885 373 9 no no DT 10_1101-2021_02_13_429885 373 10 miscalled miscalled JJ 10_1101-2021_02_13_429885 373 11 CNAs cna NNS 10_1101-2021_02_13_429885 373 12 . . . 10_1101-2021_02_13_429885 374 1 The the DT 10_1101-2021_02_13_429885 374 2 first first JJ 10_1101-2021_02_13_429885 374 3 strategy strategy NN 10_1101-2021_02_13_429885 374 4 ( ( -LRB- 10_1101-2021_02_13_429885 374 5 closest close JJS 10_1101-2021_02_13_429885 374 6 match match NN 10_1101-2021_02_13_429885 374 7 ) ) -RRB- 10_1101-2021_02_13_429885 374 8 , , , 10_1101-2021_02_13_429885 374 9 is be VBZ 10_1101-2021_02_13_429885 374 10 the the DT 10_1101-2021_02_13_429885 374 11 default default NN 10_1101-2021_02_13_429885 374 12 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 374 13 choice choice NN 10_1101-2021_02_13_429885 374 14 . . . 10_1101-2021_02_13_429885 375 1 .CC .CC NFP 10_1101-2021_02_13_429885 375 2 - - : 10_1101-2021_02_13_429885 375 3 BY by IN 10_1101-2021_02_13_429885 375 4 - - HYPH 10_1101-2021_02_13_429885 375 5 NC NC NNP 10_1101-2021_02_13_429885 375 6 - - HYPH 10_1101-2021_02_13_429885 375 7 ND ND NNP 10_1101-2021_02_13_429885 375 8 4.0 4.0 CD 10_1101-2021_02_13_429885 375 9 International International NNP 10_1101-2021_02_13_429885 375 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 375 11 under under IN 10_1101-2021_02_13_429885 375 12 a a DT 10_1101-2021_02_13_429885 375 13 ( ( -LRB- 10_1101-2021_02_13_429885 375 14 which which WDT 10_1101-2021_02_13_429885 375 15 was be VBD 10_1101-2021_02_13_429885 375 16 not not RB 10_1101-2021_02_13_429885 375 17 certified certify VBN 10_1101-2021_02_13_429885 375 18 by by IN 10_1101-2021_02_13_429885 375 19 peer peer NN 10_1101-2021_02_13_429885 375 20 review review NN 10_1101-2021_02_13_429885 375 21 ) ) -RRB- 10_1101-2021_02_13_429885 375 22 is be VBZ 10_1101-2021_02_13_429885 375 23 the the DT 10_1101-2021_02_13_429885 375 24 author author NN 10_1101-2021_02_13_429885 375 25 / / SYM 10_1101-2021_02_13_429885 375 26 funder funder NN 10_1101-2021_02_13_429885 375 27 , , , 10_1101-2021_02_13_429885 375 28 who who WP 10_1101-2021_02_13_429885 375 29 has have VBZ 10_1101-2021_02_13_429885 375 30 granted grant VBN 10_1101-2021_02_13_429885 375 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 375 32 a a DT 10_1101-2021_02_13_429885 375 33 license license NN 10_1101-2021_02_13_429885 375 34 to to TO 10_1101-2021_02_13_429885 375 35 display display VB 10_1101-2021_02_13_429885 375 36 the the DT 10_1101-2021_02_13_429885 375 37 preprint preprint NN 10_1101-2021_02_13_429885 375 38 in in IN 10_1101-2021_02_13_429885 375 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 375 40 . . . 10_1101-2021_02_13_429885 376 1 It -PRON- PRP 10_1101-2021_02_13_429885 376 2 is be VBZ 10_1101-2021_02_13_429885 376 3 made make VBN 10_1101-2021_02_13_429885 376 4 The the DT 10_1101-2021_02_13_429885 376 5 copyright copyright NN 10_1101-2021_02_13_429885 376 6 holder holder NN 10_1101-2021_02_13_429885 376 7 for for IN 10_1101-2021_02_13_429885 376 8 this this DT 10_1101-2021_02_13_429885 376 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 376 10 version version NN 10_1101-2021_02_13_429885 376 11 posted post VBD 10_1101-2021_02_13_429885 376 12 February February NNP 10_1101-2021_02_13_429885 376 13 13 13 CD 10_1101-2021_02_13_429885 376 14 , , , 10_1101-2021_02_13_429885 376 15 2021 2021 CD 10_1101-2021_02_13_429885 376 16 . . . 10_1101-2021_02_13_429885 376 17 ; ; : 10_1101-2021_02_13_429885 376 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 376 19 : : : 10_1101-2021_02_13_429885 376 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 376 21 preprint preprint NN 10_1101-2021_02_13_429885 376 22 https://www.codecogs.com/eqnedit.php?latex=m#0 https://www.codecogs.com/eqnedit.php?latex=m#0 NNP 10_1101-2021_02_13_429885 376 23 https://www.codecogs.com/eqnedit.php?latex=%5Cpi#0 https://www.codecogs.com/eqnedit.php?latex=%5Cpi#0 NNP 10_1101-2021_02_13_429885 376 24 https://www.codecogs.com/eqnedit.php?latex=v#0 https://www.codecogs.com/eqnedit.php?latex=v#0 NNP 10_1101-2021_02_13_429885 376 25 https://www.codecogs.com/eqnedit.php?latex=v%20%3D%20%5Cdfrac%7Bv%5B(p-2)%5Cpi%20%2B%202%5D%7D%7Bm%5Cpi%7D%20#0 https://www.codecogs.com/eqnedit.php?latex=v%20%3D%20%5Cdfrac%7Bv%5B(p-2)%5Cpi%20%2B%202%5D%7D%7Bm%5Cpi%7D%20#0 NNP 10_1101-2021_02_13_429885 376 26 https://www.codecogs.com/eqnedit.php?latex=v#0 https://www.codecogs.com/eqnedit.php?latex=v#0 NNP 10_1101-2021_02_13_429885 376 27 https://www.codecogs.com/eqnedit.php?latex=m#0 https://www.codecogs.com/eqnedit.php?latex=m#0 NNP 10_1101-2021_02_13_429885 376 28 https://paperpile.com/c/rqVmzs/Uxwc https://paperpile.com/c/rqvmzs/uxwc JJ 10_1101-2021_02_13_429885 376 29 https://www.codecogs.com/eqnedit.php?latex=%5Cepsilon%3E0#0 https://www.codecogs.com/eqnedit.php?latex=%5cepsilon%3e0#0 NN 10_1101-2021_02_13_429885 376 30 https://paperpile.com/c/rqVmzs/chqB https://paperpile.com/c/rqVmzs/chqB NNP 10_1101-2021_02_13_429885 376 31 https://caravagn.github.io/BMix/ https://caravagn.github.io/bmix/ JJ 10_1101-2021_02_13_429885 376 32 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 NNP 10_1101-2021_02_13_429885 376 33 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 376 34 Househam Househam NNP 10_1101-2021_02_13_429885 376 35 et et FW 10_1101-2021_02_13_429885 376 36 al al NNP 10_1101-2021_02_13_429885 376 37 . . . 10_1101-2021_02_13_429885 377 1 A a DT 10_1101-2021_02_13_429885 377 2 fully fully RB 10_1101-2021_02_13_429885 377 3 automated automate VBN 10_1101-2021_02_13_429885 377 4 approach approach NN 10_1101-2021_02_13_429885 377 5 for for IN 10_1101-2021_02_13_429885 377 6 quality quality NN 10_1101-2021_02_13_429885 377 7 control control NN 10_1101-2021_02_13_429885 377 8 of of IN 10_1101-2021_02_13_429885 377 9 cancer cancer NN 10_1101-2021_02_13_429885 377 10 mutations mutation NNS 10_1101-2021_02_13_429885 377 11 in in IN 10_1101-2021_02_13_429885 377 12 the the DT 10_1101-2021_02_13_429885 377 13 era era NN 10_1101-2021_02_13_429885 377 14 of of IN 10_1101-2021_02_13_429885 377 15 high high JJ 10_1101-2021_02_13_429885 377 16 - - HYPH 10_1101-2021_02_13_429885 377 17 resolution resolution NN 10_1101-2021_02_13_429885 377 18 whole whole JJ 10_1101-2021_02_13_429885 377 19 genome genome JJ 10_1101-2021_02_13_429885 377 20 sequencing sequencing NN 10_1101-2021_02_13_429885 377 21 . . . 10_1101-2021_02_13_429885 378 1 For for IN 10_1101-2021_02_13_429885 378 2 every every DT 10_1101-2021_02_13_429885 378 3 peak peak NN 10_1101-2021_02_13_429885 378 4 a a DT 10_1101-2021_02_13_429885 378 5 QC QC NNP 10_1101-2021_02_13_429885 378 6 value value NN 10_1101-2021_02_13_429885 378 7 ( ( -LRB- 10_1101-2021_02_13_429885 378 8 PASS pas NNS 10_1101-2021_02_13_429885 378 9 or or CC 10_1101-2021_02_13_429885 378 10 FAIL FAIL NNP 10_1101-2021_02_13_429885 378 11 ) ) -RRB- 10_1101-2021_02_13_429885 378 12 is be VBZ 10_1101-2021_02_13_429885 378 13 determined determine VBN 10_1101-2021_02_13_429885 378 14 based base VBN 10_1101-2021_02_13_429885 378 15 on on IN 10_1101-2021_02_13_429885 378 16 some some DT 10_1101-2021_02_13_429885 378 17 tolerance tolerance NN 10_1101-2021_02_13_429885 378 18 . . . 10_1101-2021_02_13_429885 379 1 The the DT 10_1101-2021_02_13_429885 379 2 overall overall JJ 10_1101-2021_02_13_429885 379 3 QC QC NNP 10_1101-2021_02_13_429885 379 4 status status NN 10_1101-2021_02_13_429885 379 5 of of IN 10_1101-2021_02_13_429885 379 6 copy copy NN 10_1101-2021_02_13_429885 379 7 states state NNS 10_1101-2021_02_13_429885 379 8 with with IN 10_1101-2021_02_13_429885 379 9 multiple multiple JJ 10_1101-2021_02_13_429885 379 10 peaks peak NNS 10_1101-2021_02_13_429885 379 11 is be VBZ 10_1101-2021_02_13_429885 379 12 the the DT 10_1101-2021_02_13_429885 379 13 QC QC NNP 10_1101-2021_02_13_429885 379 14 of of IN 10_1101-2021_02_13_429885 379 15 the the DT 10_1101-2021_02_13_429885 379 16 peakε peakε NN 10_1101-2021_02_13_429885 379 17 > > XX 10_1101-2021_02_13_429885 379 18 0 0 CD 10_1101-2021_02_13_429885 379 19 with with IN 10_1101-2021_02_13_429885 379 20 most most JJS 10_1101-2021_02_13_429885 379 21 mutations mutation NNS 10_1101-2021_02_13_429885 379 22 underneath underneath RB 10_1101-2021_02_13_429885 379 23 . . . 10_1101-2021_02_13_429885 380 1 The the DT 10_1101-2021_02_13_429885 380 2 overall overall JJ 10_1101-2021_02_13_429885 380 3 QC QC NNP 10_1101-2021_02_13_429885 380 4 status status NN 10_1101-2021_02_13_429885 380 5 for for IN 10_1101-2021_02_13_429885 380 6 a a DT 10_1101-2021_02_13_429885 380 7 sample sample NN 10_1101-2021_02_13_429885 380 8 with with IN 10_1101-2021_02_13_429885 380 9 many many JJ 10_1101-2021_02_13_429885 380 10 copy copy NN 10_1101-2021_02_13_429885 380 11 states state NNS 10_1101-2021_02_13_429885 380 12 is be VBZ 10_1101-2021_02_13_429885 380 13 determined determine VBN 10_1101-2021_02_13_429885 380 14 by by IN 10_1101-2021_02_13_429885 380 15 summing sum VBG 10_1101-2021_02_13_429885 380 16 up up RP 10_1101-2021_02_13_429885 380 17 the the DT 10_1101-2021_02_13_429885 380 18 QC QC NNP 10_1101-2021_02_13_429885 380 19 status status NN 10_1101-2021_02_13_429885 380 20 of of IN 10_1101-2021_02_13_429885 380 21 individual individual JJ 10_1101-2021_02_13_429885 380 22 copy copy NN 10_1101-2021_02_13_429885 380 23 states state NNS 10_1101-2021_02_13_429885 380 24 , , , 10_1101-2021_02_13_429885 380 25 and and CC 10_1101-2021_02_13_429885 380 26 weighting weight VBG 10_1101-2021_02_13_429885 380 27 them -PRON- PRP 10_1101-2021_02_13_429885 380 28 by by IN 10_1101-2021_02_13_429885 380 29 the the DT 10_1101-2021_02_13_429885 380 30 number number NN 10_1101-2021_02_13_429885 380 31 of of IN 10_1101-2021_02_13_429885 380 32 mutations mutation NNS 10_1101-2021_02_13_429885 380 33 associated associate VBN 10_1101-2021_02_13_429885 380 34 ( ( -LRB- 10_1101-2021_02_13_429885 380 35 majority majority NNP 10_1101-2021_02_13_429885 380 36 rule rule NN 10_1101-2021_02_13_429885 380 37 ) ) -RRB- 10_1101-2021_02_13_429885 380 38 . . . 10_1101-2021_02_13_429885 381 1 CCF ccf NN 10_1101-2021_02_13_429885 381 2 estimation estimation NN 10_1101-2021_02_13_429885 381 3 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 381 4 can can MD 10_1101-2021_02_13_429885 381 5 compute compute VB 10_1101-2021_02_13_429885 381 6 CCFs ccf NNS 10_1101-2021_02_13_429885 381 7 in in IN 10_1101-2021_02_13_429885 381 8 two two CD 10_1101-2021_02_13_429885 381 9 ways way NNS 10_1101-2021_02_13_429885 381 10 . . . 10_1101-2021_02_13_429885 382 1 One one CD 10_1101-2021_02_13_429885 382 2 of of IN 10_1101-2021_02_13_429885 382 3 the the DT 10_1101-2021_02_13_429885 382 4 two two CD 10_1101-2021_02_13_429885 382 5 uses use VBZ 10_1101-2021_02_13_429885 382 6 the the DT 10_1101-2021_02_13_429885 382 7 idea idea NN 10_1101-2021_02_13_429885 382 8 of of IN 10_1101-2021_02_13_429885 382 9 the the DT 10_1101-2021_02_13_429885 382 10 mixture mixture NN 10_1101-2021_02_13_429885 382 11 highlighted highlight VBN 10_1101-2021_02_13_429885 382 12 in in IN 10_1101-2021_02_13_429885 382 13 ​Figure ​figure NN 10_1101-2021_02_13_429885 382 14 1c 1c CD 10_1101-2021_02_13_429885 382 15 ​ ​ NNP 10_1101-2021_02_13_429885 382 16 , , , 10_1101-2021_02_13_429885 382 17 the the DT 10_1101-2021_02_13_429885 382 18 other other JJ 10_1101-2021_02_13_429885 382 19 is be VBZ 10_1101-2021_02_13_429885 382 20 simpler simple JJR 10_1101-2021_02_13_429885 382 21 and and CC 10_1101-2021_02_13_429885 382 22 works work VBZ 10_1101-2021_02_13_429885 382 23 better well RBR 10_1101-2021_02_13_429885 382 24 when when WRB 10_1101-2021_02_13_429885 382 25 data datum NNS 10_1101-2021_02_13_429885 382 26 resolution resolution NN 10_1101-2021_02_13_429885 382 27 is be VBZ 10_1101-2021_02_13_429885 382 28 low low JJ 10_1101-2021_02_13_429885 382 29 , , , 10_1101-2021_02_13_429885 382 30 and and CC 10_1101-2021_02_13_429885 382 31 the the DT 10_1101-2021_02_13_429885 382 32 entropy entropy JJ 10_1101-2021_02_13_429885 382 33 of of IN 10_1101-2021_02_13_429885 382 34 the the DT 10_1101-2021_02_13_429885 382 35 mixture mixture NN 10_1101-2021_02_13_429885 382 36 model model NN 10_1101-2021_02_13_429885 382 37 would would MD 10_1101-2021_02_13_429885 382 38 leave leave VB 10_1101-2021_02_13_429885 382 39 too too RB 10_1101-2021_02_13_429885 382 40 many many JJ 10_1101-2021_02_13_429885 382 41 mutations mutation NNS 10_1101-2021_02_13_429885 382 42 unassigned unassigne VBN 10_1101-2021_02_13_429885 382 43 . . . 10_1101-2021_02_13_429885 383 1 For for IN 10_1101-2021_02_13_429885 383 2 the the DT 10_1101-2021_02_13_429885 383 3 mixture mixture NN 10_1101-2021_02_13_429885 383 4 approach approach NN 10_1101-2021_02_13_429885 383 5 , , , 10_1101-2021_02_13_429885 383 6 we -PRON- PRP 10_1101-2021_02_13_429885 383 7 build build VBP 10_1101-2021_02_13_429885 383 8 a a DT 10_1101-2021_02_13_429885 383 9 2-components 2-components CD 10_1101-2021_02_13_429885 383 10 Binomial Binomial NNP 10_1101-2021_02_13_429885 383 11 mixture mixture NN 10_1101-2021_02_13_429885 383 12 from from IN 10_1101-2021_02_13_429885 383 13 the the DT 10_1101-2021_02_13_429885 383 14 theoretical theoretical JJ 10_1101-2021_02_13_429885 383 15 expectations expectation NNS 10_1101-2021_02_13_429885 383 16 and and CC 10_1101-2021_02_13_429885 383 17 the the DT 10_1101-2021_02_13_429885 383 18 data datum NNS 10_1101-2021_02_13_429885 383 19 . . . 10_1101-2021_02_13_429885 384 1 This this DT 10_1101-2021_02_13_429885 384 2 implicitly implicitly RB 10_1101-2021_02_13_429885 384 3 assumes assume VBZ 10_1101-2021_02_13_429885 384 4 that that IN 10_1101-2021_02_13_429885 384 5 peaks peak NNS 10_1101-2021_02_13_429885 384 6 have have VBP 10_1101-2021_02_13_429885 384 7 been be VBN 10_1101-2021_02_13_429885 384 8 QCed QCed NNP 10_1101-2021_02_13_429885 384 9 first first RB 10_1101-2021_02_13_429885 384 10 . . . 10_1101-2021_02_13_429885 385 1 We -PRON- PRP 10_1101-2021_02_13_429885 385 2 constraint constraint VBP 10_1101-2021_02_13_429885 385 3 the the DT 10_1101-2021_02_13_429885 385 4 success success NN 10_1101-2021_02_13_429885 385 5 parameters parameter NNS 10_1101-2021_02_13_429885 385 6 to to TO 10_1101-2021_02_13_429885 385 7 match match VB 10_1101-2021_02_13_429885 385 8 the the DT 10_1101-2021_02_13_429885 385 9 expected expect VBN 10_1101-2021_02_13_429885 385 10 VAF VAF NNP 10_1101-2021_02_13_429885 385 11 , , , 10_1101-2021_02_13_429885 385 12 and and CC 10_1101-2021_02_13_429885 385 13 use use VB 10_1101-2021_02_13_429885 385 14 the the DT 10_1101-2021_02_13_429885 385 15 proportion proportion NN 10_1101-2021_02_13_429885 385 16 of of IN 10_1101-2021_02_13_429885 385 17 mutations mutation NNS 10_1101-2021_02_13_429885 385 18 that that WDT 10_1101-2021_02_13_429885 385 19 appear appear VBP 10_1101-2021_02_13_429885 385 20 underneath underneath IN 10_1101-2021_02_13_429885 385 21 a a DT 10_1101-2021_02_13_429885 385 22 peak peak NN 10_1101-2021_02_13_429885 385 23 as as IN 10_1101-2021_02_13_429885 385 24 mixing mix VBG 10_1101-2021_02_13_429885 385 25 proportions proportion NNS 10_1101-2021_02_13_429885 385 26 . . . 10_1101-2021_02_13_429885 386 1 π π LS 10_1101-2021_02_13_429885 386 2 Then then RB 10_1101-2021_02_13_429885 386 3 , , , 10_1101-2021_02_13_429885 386 4 from from IN 10_1101-2021_02_13_429885 386 5 the the DT 10_1101-2021_02_13_429885 386 6 latent latent NN 10_1101-2021_02_13_429885 386 7 variables variable NNS 10_1101-2021_02_13_429885 386 8 of of IN 10_1101-2021_02_13_429885 386 9 the the DT 10_1101-2021_02_13_429885 386 10 model model NN 10_1101-2021_02_13_429885 386 11 we -PRON- PRP 10_1101-2021_02_13_429885 386 12 compute compute VBP 10_1101-2021_02_13_429885 386 13 the the DT 10_1101-2021_02_13_429885 386 14 probability probability NN 10_1101-2021_02_13_429885 386 15 of of IN 10_1101-2021_02_13_429885 386 16 assigning assign VBG 10_1101-2021_02_13_429885 386 17 a a DT 10_1101-2021_02_13_429885 386 18 z z NN 10_1101-2021_02_13_429885 386 19 mutation mutation NN 10_1101-2021_02_13_429885 386 20 with with IN 10_1101-2021_02_13_429885 386 21 VAF VAF NNP 10_1101-2021_02_13_429885 386 22 to to TO 10_1101-2021_02_13_429885 386 23 cluster cluster VB 10_1101-2021_02_13_429885 386 24 , , , 10_1101-2021_02_13_429885 386 25 xn xn NNP 10_1101-2021_02_13_429885 386 26 c c NNP 10_1101-2021_02_13_429885 386 27 . . . 10_1101-2021_02_13_429885 387 1 ( ( -LRB- 10_1101-2021_02_13_429885 387 2 z z NNP 10_1101-2021_02_13_429885 387 3 | | NNP 10_1101-2021_02_13_429885 387 4 θ θ NNP 10_1101-2021_02_13_429885 387 5 , , , 10_1101-2021_02_13_429885 387 6 ) ) -RRB- 10_1101-2021_02_13_429885 387 7 p p NN 10_1101-2021_02_13_429885 387 8 n n NN 10_1101-2021_02_13_429885 387 9 , , , 10_1101-2021_02_13_429885 387 10 k k NNP 10_1101-2021_02_13_429885 387 11 = = SYM 10_1101-2021_02_13_429885 387 12 c c NNP 10_1101-2021_02_13_429885 387 13 π π NN 10_1101-2021_02_13_429885 387 14 From from IN 10_1101-2021_02_13_429885 387 15 this this DT 10_1101-2021_02_13_429885 387 16 information information NN 10_1101-2021_02_13_429885 387 17 we -PRON- PRP 10_1101-2021_02_13_429885 387 18 obtain obtain VBP 10_1101-2021_02_13_429885 387 19 the the DT 10_1101-2021_02_13_429885 387 20 entropy entropy JJ 10_1101-2021_02_13_429885 387 21 of of IN 10_1101-2021_02_13_429885 387 22 , , , 10_1101-2021_02_13_429885 387 23 which which WDT 10_1101-2021_02_13_429885 387 24 is be VBZ 10_1101-2021_02_13_429885 387 25 low low JJ 10_1101-2021_02_13_429885 387 26 for for IN 10_1101-2021_02_13_429885 387 27 values value NNS 10_1101-2021_02_13_429885 387 28 that that WDT 10_1101-2021_02_13_429885 387 29 are be VBP 10_1101-2021_02_13_429885 387 30 ( ( -LRB- 10_1101-2021_02_13_429885 387 31 z)H z)H NNP 10_1101-2021_02_13_429885 387 32 z z NN 10_1101-2021_02_13_429885 387 33 assignable assignable JJ 10_1101-2021_02_13_429885 387 34 to to IN 10_1101-2021_02_13_429885 387 35 only only RB 10_1101-2021_02_13_429885 387 36 one one CD 10_1101-2021_02_13_429885 387 37 cluster cluster NN 10_1101-2021_02_13_429885 387 38 . . . 10_1101-2021_02_13_429885 388 1 Recall recall NN 10_1101-2021_02_13_429885 388 2 in in IN 10_1101-2021_02_13_429885 388 3 this this DT 10_1101-2021_02_13_429885 388 4 respect respect NN 10_1101-2021_02_13_429885 388 5 that that IN 10_1101-2021_02_13_429885 388 6 the the DT 10_1101-2021_02_13_429885 388 7 maximum maximum JJ 10_1101-2021_02_13_429885 388 8 entropy entropy JJ 10_1101-2021_02_13_429885 388 9 distribution distribution NN 10_1101-2021_02_13_429885 388 10 is be VBZ 10_1101-2021_02_13_429885 388 11 the the DT 10_1101-2021_02_13_429885 388 12 uniform uniform JJ 10_1101-2021_02_13_429885 388 13 one one CD 10_1101-2021_02_13_429885 388 14 , , , 10_1101-2021_02_13_429885 388 15 which which WDT 10_1101-2021_02_13_429885 388 16 is be VBZ 10_1101-2021_02_13_429885 388 17 when when WRB 10_1101-2021_02_13_429885 388 18 a a DT 10_1101-2021_02_13_429885 388 19 mutation mutation NN 10_1101-2021_02_13_429885 388 20 can can MD 10_1101-2021_02_13_429885 388 21 be be VB 10_1101-2021_02_13_429885 388 22 equally equally RB 10_1101-2021_02_13_429885 388 23 likely likely JJ 10_1101-2021_02_13_429885 388 24 in in IN 10_1101-2021_02_13_429885 388 25 1 1 CD 10_1101-2021_02_13_429885 388 26 or or CC 10_1101-2021_02_13_429885 388 27 2 2 CD 10_1101-2021_02_13_429885 388 28 copies copy NNS 10_1101-2021_02_13_429885 388 29 , , , 10_1101-2021_02_13_429885 388 30 based base VBN 10_1101-2021_02_13_429885 388 31 on on IN 10_1101-2021_02_13_429885 388 32 VAF VAF NNP 10_1101-2021_02_13_429885 388 33 . . . 10_1101-2021_02_13_429885 389 1 We -PRON- PRP 10_1101-2021_02_13_429885 389 2 use use VBP 10_1101-2021_02_13_429885 389 3 a a DT 10_1101-2021_02_13_429885 389 4 simple simple JJ 10_1101-2021_02_13_429885 389 5 peak peak NN 10_1101-2021_02_13_429885 389 6 detection detection NN 10_1101-2021_02_13_429885 389 7 heuristic heuristic JJ 10_1101-2021_02_13_429885 389 8 to to TO 10_1101-2021_02_13_429885 389 9 find find VB 10_1101-2021_02_13_429885 389 10 2 2 CD 10_1101-2021_02_13_429885 389 11 points point NNS 10_1101-2021_02_13_429885 389 12 of of IN 10_1101-2021_02_13_429885 389 13 changes change NNS 10_1101-2021_02_13_429885 389 14 in in IN 10_1101-2021_02_13_429885 389 15 ; ; : 10_1101-2021_02_13_429885 389 16 in in IN 10_1101-2021_02_13_429885 389 17 ( ( -LRB- 10_1101-2021_02_13_429885 389 18 z)H z)H NNS 10_1101-2021_02_13_429885 389 19 between between IN 10_1101-2021_02_13_429885 389 20 those those DT 10_1101-2021_02_13_429885 389 21 values value NNS 10_1101-2021_02_13_429885 389 22 we -PRON- PRP 10_1101-2021_02_13_429885 389 23 can can MD 10_1101-2021_02_13_429885 389 24 not not RB 10_1101-2021_02_13_429885 389 25 reliably reliably RB 10_1101-2021_02_13_429885 389 26 assess assess VB 10_1101-2021_02_13_429885 389 27 , , , 10_1101-2021_02_13_429885 389 28 i.e. i.e. FW 10_1101-2021_02_13_429885 390 1 assess assess VB 10_1101-2021_02_13_429885 390 2 if if IN 10_1101-2021_02_13_429885 390 3 the the DT 10_1101-2021_02_13_429885 390 4 mutation mutation NN 10_1101-2021_02_13_429885 390 5 is be VBZ 10_1101-2021_02_13_429885 390 6 in in IN 10_1101-2021_02_13_429885 390 7 m m NNP 10_1101-2021_02_13_429885 390 8 single single JJ 10_1101-2021_02_13_429885 390 9 or or CC 10_1101-2021_02_13_429885 390 10 double double JJ 10_1101-2021_02_13_429885 390 11 copy copy NN 10_1101-2021_02_13_429885 390 12 . . . 10_1101-2021_02_13_429885 391 1 For for IN 10_1101-2021_02_13_429885 391 2 these these DT 10_1101-2021_02_13_429885 391 3 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 391 4 leaves leave VBZ 10_1101-2021_02_13_429885 391 5 the the DT 10_1101-2021_02_13_429885 391 6 CCF CCF NNP 10_1101-2021_02_13_429885 391 7 value value NN 10_1101-2021_02_13_429885 391 8 as as IN 10_1101-2021_02_13_429885 391 9 NA NA NNP 10_1101-2021_02_13_429885 391 10 . . . 10_1101-2021_02_13_429885 392 1 The the DT 10_1101-2021_02_13_429885 392 2 alternative alternative JJ 10_1101-2021_02_13_429885 392 3 approach approach NN 10_1101-2021_02_13_429885 392 4 uses use VBZ 10_1101-2021_02_13_429885 392 5 a a DT 10_1101-2021_02_13_429885 392 6 simpler simple JJR 10_1101-2021_02_13_429885 392 7 idea idea NN 10_1101-2021_02_13_429885 392 8 , , , 10_1101-2021_02_13_429885 392 9 still still RB 10_1101-2021_02_13_429885 392 10 working work VBG 10_1101-2021_02_13_429885 392 11 on on IN 10_1101-2021_02_13_429885 392 12 the the DT 10_1101-2021_02_13_429885 392 13 expected expect VBN 10_1101-2021_02_13_429885 392 14 theoretical theoretical JJ 10_1101-2021_02_13_429885 392 15 VAF VAF NNP 10_1101-2021_02_13_429885 392 16 . . . 10_1101-2021_02_13_429885 393 1 Here here RB 10_1101-2021_02_13_429885 393 2 instead instead RB 10_1101-2021_02_13_429885 393 3 of of IN 10_1101-2021_02_13_429885 393 4 fitting fit VBG 10_1101-2021_02_13_429885 393 5 a a DT 10_1101-2021_02_13_429885 393 6 mixture mixture NN 10_1101-2021_02_13_429885 393 7 we -PRON- PRP 10_1101-2021_02_13_429885 393 8 determine determine VBP 10_1101-2021_02_13_429885 393 9 the the DT 10_1101-2021_02_13_429885 393 10 midpoint midpoint NN 10_1101-2021_02_13_429885 393 11 , , , 10_1101-2021_02_13_429885 393 12 between between IN 10_1101-2021_02_13_429885 393 13 the the DT 10_1101-2021_02_13_429885 393 14 two two CD 10_1101-2021_02_13_429885 393 15 o o XX 10_1101-2021_02_13_429885 393 16 expected expect VBN 10_1101-2021_02_13_429885 393 17 theoretical theoretical JJ 10_1101-2021_02_13_429885 393 18 VAF VAF NNP 10_1101-2021_02_13_429885 393 19 peaks peak NNS 10_1101-2021_02_13_429885 393 20 . . . 10_1101-2021_02_13_429885 394 1 The the DT 10_1101-2021_02_13_429885 394 2 midpoint midpoint NN 10_1101-2021_02_13_429885 394 3 is be VBZ 10_1101-2021_02_13_429885 394 4 computed compute VBN 10_1101-2021_02_13_429885 394 5 by by IN 10_1101-2021_02_13_429885 394 6 weighting weight VBG 10_1101-2021_02_13_429885 394 7 each each DT 10_1101-2021_02_13_429885 394 8 of of IN 10_1101-2021_02_13_429885 394 9 the the DT 10_1101-2021_02_13_429885 394 10 two two CD 10_1101-2021_02_13_429885 394 11 peaks peak NNS 10_1101-2021_02_13_429885 394 12 proportionally proportionally RB 10_1101-2021_02_13_429885 394 13 to to IN 10_1101-2021_02_13_429885 394 14 the the DT 10_1101-2021_02_13_429885 394 15 number number NN 10_1101-2021_02_13_429885 394 16 of of IN 10_1101-2021_02_13_429885 394 17 mutations mutation NNS 10_1101-2021_02_13_429885 394 18 that that WDT 10_1101-2021_02_13_429885 394 19 appear appear VBP 10_1101-2021_02_13_429885 394 20 underneath underneath IN 10_1101-2021_02_13_429885 394 21 each each DT 10_1101-2021_02_13_429885 394 22 peak peak NN 10_1101-2021_02_13_429885 394 23 . . . 10_1101-2021_02_13_429885 395 1 The the DT 10_1101-2021_02_13_429885 395 2 midpoint midpoint NN 10_1101-2021_02_13_429885 395 3 is be VBZ 10_1101-2021_02_13_429885 395 4 a a DT 10_1101-2021_02_13_429885 395 5 cut cut NN 10_1101-2021_02_13_429885 395 6 : : : 10_1101-2021_02_13_429885 395 7 values value NNS 10_1101-2021_02_13_429885 395 8 below below RB 10_1101-2021_02_13_429885 395 9 are be VBP 10_1101-2021_02_13_429885 395 10 in in IN 10_1101-2021_02_13_429885 395 11 single single JJ 10_1101-2021_02_13_429885 395 12 copy copy NN 10_1101-2021_02_13_429885 395 13 , , , 10_1101-2021_02_13_429885 395 14 values value NNS 10_1101-2021_02_13_429885 395 15 above above RB 10_1101-2021_02_13_429885 395 16 in in IN 10_1101-2021_02_13_429885 395 17 two two CD 10_1101-2021_02_13_429885 395 18 . . . 10_1101-2021_02_13_429885 396 1 This this DT 10_1101-2021_02_13_429885 396 2 o o NN 10_1101-2021_02_13_429885 396 3 procedure procedure NN 10_1101-2021_02_13_429885 396 4 requires require VBZ 10_1101-2021_02_13_429885 396 5 data datum NNS 10_1101-2021_02_13_429885 396 6 with with IN 10_1101-2021_02_13_429885 396 7 good good JJ 10_1101-2021_02_13_429885 396 8 sequencing sequencing NN 10_1101-2021_02_13_429885 396 9 coverage coverage NN 10_1101-2021_02_13_429885 396 10 , , , 10_1101-2021_02_13_429885 396 11 and and CC 10_1101-2021_02_13_429885 396 12 a a DT 10_1101-2021_02_13_429885 396 13 good good JJ 10_1101-2021_02_13_429885 396 14 general general JJ 10_1101-2021_02_13_429885 396 15 quality quality NN 10_1101-2021_02_13_429885 396 16 . . . 10_1101-2021_02_13_429885 397 1 When when WRB 10_1101-2021_02_13_429885 397 2 mutation mutation NN 10_1101-2021_02_13_429885 397 3 multiplicities multiplicity NNS 10_1101-2021_02_13_429885 397 4 have have VBP 10_1101-2021_02_13_429885 397 5 been be VBN 10_1101-2021_02_13_429885 397 6 determined determine VBN 10_1101-2021_02_13_429885 397 7 , , , 10_1101-2021_02_13_429885 397 8 CCF CCF NNP 10_1101-2021_02_13_429885 397 9 computation computation NN 10_1101-2021_02_13_429885 397 10 is be VBZ 10_1101-2021_02_13_429885 397 11 trivial trivial JJ 10_1101-2021_02_13_429885 397 12 , , , 10_1101-2021_02_13_429885 397 13 and and CC 10_1101-2021_02_13_429885 397 14 follows follow VBZ 10_1101-2021_02_13_429885 397 15 the the DT 10_1101-2021_02_13_429885 397 16 formula formula NN 10_1101-2021_02_13_429885 397 17 presented present VBN 10_1101-2021_02_13_429885 397 18 in in IN 10_1101-2021_02_13_429885 397 19 the the DT 10_1101-2021_02_13_429885 397 20 Main Main NNP 10_1101-2021_02_13_429885 397 21 Text Text NNP 10_1101-2021_02_13_429885 397 22 . . . 10_1101-2021_02_13_429885 398 1 A a DT 10_1101-2021_02_13_429885 398 2 QC qc NN 10_1101-2021_02_13_429885 398 3 PASS PASS NNP 10_1101-2021_02_13_429885 398 4 status status NN 10_1101-2021_02_13_429885 398 5 is be VBZ 10_1101-2021_02_13_429885 398 6 assigned assign VBN 10_1101-2021_02_13_429885 398 7 to to IN 10_1101-2021_02_13_429885 398 8 the the DT 10_1101-2021_02_13_429885 398 9 .CC .CC : 10_1101-2021_02_13_429885 398 10 - - HYPH 10_1101-2021_02_13_429885 398 11 BY by IN 10_1101-2021_02_13_429885 398 12 - - HYPH 10_1101-2021_02_13_429885 398 13 NC NC NNP 10_1101-2021_02_13_429885 398 14 - - HYPH 10_1101-2021_02_13_429885 398 15 ND ND NNP 10_1101-2021_02_13_429885 398 16 4.0 4.0 CD 10_1101-2021_02_13_429885 398 17 International International NNP 10_1101-2021_02_13_429885 398 18 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 398 19 under under IN 10_1101-2021_02_13_429885 398 20 a a DT 10_1101-2021_02_13_429885 398 21 ( ( -LRB- 10_1101-2021_02_13_429885 398 22 which which WDT 10_1101-2021_02_13_429885 398 23 was be VBD 10_1101-2021_02_13_429885 398 24 not not RB 10_1101-2021_02_13_429885 398 25 certified certify VBN 10_1101-2021_02_13_429885 398 26 by by IN 10_1101-2021_02_13_429885 398 27 peer peer NN 10_1101-2021_02_13_429885 398 28 review review NN 10_1101-2021_02_13_429885 398 29 ) ) -RRB- 10_1101-2021_02_13_429885 398 30 is be VBZ 10_1101-2021_02_13_429885 398 31 the the DT 10_1101-2021_02_13_429885 398 32 author author NN 10_1101-2021_02_13_429885 398 33 / / SYM 10_1101-2021_02_13_429885 398 34 funder funder NN 10_1101-2021_02_13_429885 398 35 , , , 10_1101-2021_02_13_429885 398 36 who who WP 10_1101-2021_02_13_429885 398 37 has have VBZ 10_1101-2021_02_13_429885 398 38 granted grant VBN 10_1101-2021_02_13_429885 398 39 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 398 40 a a DT 10_1101-2021_02_13_429885 398 41 license license NN 10_1101-2021_02_13_429885 398 42 to to TO 10_1101-2021_02_13_429885 398 43 display display VB 10_1101-2021_02_13_429885 398 44 the the DT 10_1101-2021_02_13_429885 398 45 preprint preprint NN 10_1101-2021_02_13_429885 398 46 in in IN 10_1101-2021_02_13_429885 398 47 perpetuity perpetuity NN 10_1101-2021_02_13_429885 398 48 . . . 10_1101-2021_02_13_429885 399 1 It -PRON- PRP 10_1101-2021_02_13_429885 399 2 is be VBZ 10_1101-2021_02_13_429885 399 3 made make VBN 10_1101-2021_02_13_429885 399 4 The the DT 10_1101-2021_02_13_429885 399 5 copyright copyright NN 10_1101-2021_02_13_429885 399 6 holder holder NN 10_1101-2021_02_13_429885 399 7 for for IN 10_1101-2021_02_13_429885 399 8 this this DT 10_1101-2021_02_13_429885 399 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 399 10 version version NN 10_1101-2021_02_13_429885 399 11 posted post VBD 10_1101-2021_02_13_429885 399 12 February February NNP 10_1101-2021_02_13_429885 399 13 13 13 CD 10_1101-2021_02_13_429885 399 14 , , , 10_1101-2021_02_13_429885 399 15 2021 2021 CD 10_1101-2021_02_13_429885 399 16 . . . 10_1101-2021_02_13_429885 399 17 ; ; : 10_1101-2021_02_13_429885 399 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 399 19 : : : 10_1101-2021_02_13_429885 399 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 399 21 preprint preprint NN 10_1101-2021_02_13_429885 399 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 399 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 399 24 Househam Househam NNP 10_1101-2021_02_13_429885 399 25 et et FW 10_1101-2021_02_13_429885 399 26 al al NNP 10_1101-2021_02_13_429885 399 27 . . . 10_1101-2021_02_13_429885 400 1 A a DT 10_1101-2021_02_13_429885 400 2 fully fully RB 10_1101-2021_02_13_429885 400 3 automated automate VBN 10_1101-2021_02_13_429885 400 4 approach approach NN 10_1101-2021_02_13_429885 400 5 for for IN 10_1101-2021_02_13_429885 400 6 quality quality NN 10_1101-2021_02_13_429885 400 7 control control NN 10_1101-2021_02_13_429885 400 8 of of IN 10_1101-2021_02_13_429885 400 9 cancer cancer NN 10_1101-2021_02_13_429885 400 10 mutations mutation NNS 10_1101-2021_02_13_429885 400 11 in in IN 10_1101-2021_02_13_429885 400 12 the the DT 10_1101-2021_02_13_429885 400 13 era era NN 10_1101-2021_02_13_429885 400 14 of of IN 10_1101-2021_02_13_429885 400 15 high high JJ 10_1101-2021_02_13_429885 400 16 - - HYPH 10_1101-2021_02_13_429885 400 17 resolution resolution NN 10_1101-2021_02_13_429885 400 18 whole whole JJ 10_1101-2021_02_13_429885 400 19 genome genome JJ 10_1101-2021_02_13_429885 400 20 sequencing sequencing NN 10_1101-2021_02_13_429885 400 21 . . . 10_1101-2021_02_13_429885 401 1 CCF ccf NN 10_1101-2021_02_13_429885 401 2 values value NNS 10_1101-2021_02_13_429885 401 3 for for IN 10_1101-2021_02_13_429885 401 4 a a DT 10_1101-2021_02_13_429885 401 5 copy copy NN 10_1101-2021_02_13_429885 401 6 state state NN 10_1101-2021_02_13_429885 401 7 , , , 10_1101-2021_02_13_429885 401 8 if if IN 10_1101-2021_02_13_429885 401 9 less less JJR 10_1101-2021_02_13_429885 401 10 than than IN 10_1101-2021_02_13_429885 401 11 10 10 CD 10_1101-2021_02_13_429885 401 12 % % NN 10_1101-2021_02_13_429885 401 13 ( ( -LRB- 10_1101-2021_02_13_429885 401 14 or or CC 10_1101-2021_02_13_429885 401 15 any any DT 10_1101-2021_02_13_429885 401 16 custom custom NN 10_1101-2021_02_13_429885 401 17 threshold threshold NN 10_1101-2021_02_13_429885 401 18 ) ) -RRB- 10_1101-2021_02_13_429885 401 19 are be VBP 10_1101-2021_02_13_429885 401 20 unassigned unassigned JJ 10_1101-2021_02_13_429885 401 21 . . . 10_1101-2021_02_13_429885 402 1 The the DT 10_1101-2021_02_13_429885 402 2 overall overall JJ 10_1101-2021_02_13_429885 402 3 sample sample NN 10_1101-2021_02_13_429885 402 4 is be VBZ 10_1101-2021_02_13_429885 402 5 given give VBN 10_1101-2021_02_13_429885 402 6 a a DT 10_1101-2021_02_13_429885 402 7 QC QC NNP 10_1101-2021_02_13_429885 402 8 status status NN 10_1101-2021_02_13_429885 402 9 based base VBN 10_1101-2021_02_13_429885 402 10 on on IN 10_1101-2021_02_13_429885 402 11 a a DT 10_1101-2021_02_13_429885 402 12 majority majority NN 10_1101-2021_02_13_429885 402 13 policy policy NN 10_1101-2021_02_13_429885 402 14 . . . 10_1101-2021_02_13_429885 403 1 Genome genome JJ 10_1101-2021_02_13_429885 403 2 fragmentation fragmentation NN 10_1101-2021_02_13_429885 403 3 Some some DT 10_1101-2021_02_13_429885 403 4 recently recently RB 10_1101-2021_02_13_429885 403 5 identified identify VBN 10_1101-2021_02_13_429885 403 6 patterns pattern NNS 10_1101-2021_02_13_429885 403 7 of of IN 10_1101-2021_02_13_429885 403 8 somatic somatic JJ 10_1101-2021_02_13_429885 403 9 CNA cna NN 10_1101-2021_02_13_429885 403 10 changes change NNS 10_1101-2021_02_13_429885 403 11 can can MD 10_1101-2021_02_13_429885 403 12 be be VB 10_1101-2021_02_13_429885 403 13 attributed attribute VBN 10_1101-2021_02_13_429885 403 14 to to IN 10_1101-2021_02_13_429885 403 15 the the DT 10_1101-2021_02_13_429885 403 16 presence presence NN 10_1101-2021_02_13_429885 403 17 of of IN 10_1101-2021_02_13_429885 403 18 highly highly RB 10_1101-2021_02_13_429885 403 19 fragmented fragmented JJ 10_1101-2021_02_13_429885 403 20 tumour tumour NN 10_1101-2021_02_13_429885 403 21 genomes genome NNS 10_1101-2021_02_13_429885 403 22 , , , 10_1101-2021_02_13_429885 403 23 termed term VBD 10_1101-2021_02_13_429885 403 24 chromothripsis chromothripsis NN 10_1101-2021_02_13_429885 403 25 and and CC 10_1101-2021_02_13_429885 403 26 chromoplexy chromoplexy NN 10_1101-2021_02_13_429885 403 27 , , , 10_1101-2021_02_13_429885 403 28 or or CC 10_1101-2021_02_13_429885 403 29 localised localise VBN 10_1101-2021_02_13_429885 403 30 hypermutation hypermutation NN 10_1101-2021_02_13_429885 403 31 patterns pattern NNS 10_1101-2021_02_13_429885 403 32 , , , 10_1101-2021_02_13_429885 403 33 termed term VBD 10_1101-2021_02_13_429885 403 34 kataegis kataegis NNP 10_1101-2021_02_13_429885 403 35 ​(Cortés ​(Cortés NNPS 10_1101-2021_02_13_429885 403 36 - - HYPH 10_1101-2021_02_13_429885 403 37 Ciriano Ciriano NNP 10_1101-2021_02_13_429885 403 38 et et NNP 10_1101-2021_02_13_429885 403 39 al al NNP 10_1101-2021_02_13_429885 403 40 . . . 10_1101-2021_02_13_429885 404 1 2020)​. 2020)​. CD 10_1101-2021_02_13_429885 405 1 While while IN 10_1101-2021_02_13_429885 405 2 these these DT 10_1101-2021_02_13_429885 405 3 can can MD 10_1101-2021_02_13_429885 405 4 be be VB 10_1101-2021_02_13_429885 405 5 identified identify VBN 10_1101-2021_02_13_429885 405 6 using use VBG 10_1101-2021_02_13_429885 405 7 dedicated dedicated JJ 10_1101-2021_02_13_429885 405 8 bioinformatics bioinformatic NNS 10_1101-2021_02_13_429885 405 9 tools tool NNS 10_1101-2021_02_13_429885 405 10 , , , 10_1101-2021_02_13_429885 405 11 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 405 12 offers offer VBZ 10_1101-2021_02_13_429885 405 13 a a DT 10_1101-2021_02_13_429885 405 14 simple simple JJ 10_1101-2021_02_13_429885 405 15 statistical statistical JJ 10_1101-2021_02_13_429885 405 16 test test NN 10_1101-2021_02_13_429885 405 17 to to TO 10_1101-2021_02_13_429885 405 18 detect detect VB 10_1101-2021_02_13_429885 405 19 the the DT 10_1101-2021_02_13_429885 405 20 presence presence NN 10_1101-2021_02_13_429885 405 21 of of IN 10_1101-2021_02_13_429885 405 22 over over IN 10_1101-2021_02_13_429885 405 23 - - HYPH 10_1101-2021_02_13_429885 405 24 fragmentation fragmentation NN 10_1101-2021_02_13_429885 405 25 in in IN 10_1101-2021_02_13_429885 405 26 a a DT 10_1101-2021_02_13_429885 405 27 chromosome chromosome NN 10_1101-2021_02_13_429885 405 28 arm arm NN 10_1101-2021_02_13_429885 405 29 , , , 10_1101-2021_02_13_429885 405 30 a a DT 10_1101-2021_02_13_429885 405 31 prerequisite prerequisite NN 10_1101-2021_02_13_429885 405 32 that that WDT 10_1101-2021_02_13_429885 405 33 could could MD 10_1101-2021_02_13_429885 405 34 point point VB 10_1101-2021_02_13_429885 405 35 to to IN 10_1101-2021_02_13_429885 405 36 the the DT 10_1101-2021_02_13_429885 405 37 presence presence NN 10_1101-2021_02_13_429885 405 38 of of IN 10_1101-2021_02_13_429885 405 39 such such JJ 10_1101-2021_02_13_429885 405 40 patterns pattern NNS 10_1101-2021_02_13_429885 405 41 . . . 10_1101-2021_02_13_429885 406 1 The the DT 10_1101-2021_02_13_429885 406 2 test test NN 10_1101-2021_02_13_429885 406 3 works work VBZ 10_1101-2021_02_13_429885 406 4 at at IN 10_1101-2021_02_13_429885 406 5 the the DT 10_1101-2021_02_13_429885 406 6 level level NN 10_1101-2021_02_13_429885 406 7 of of IN 10_1101-2021_02_13_429885 406 8 each each DT 10_1101-2021_02_13_429885 406 9 chromosome chromosome NN 10_1101-2021_02_13_429885 406 10 arm arm NN 10_1101-2021_02_13_429885 406 11 ( ( -LRB- 10_1101-2021_02_13_429885 406 12 1p 1p NNP 10_1101-2021_02_13_429885 406 13 , , , 10_1101-2021_02_13_429885 406 14 1q 1q CD 10_1101-2021_02_13_429885 406 15 , , , 10_1101-2021_02_13_429885 406 16 2p 2p NNP 10_1101-2021_02_13_429885 406 17 , , , 10_1101-2021_02_13_429885 406 18 2q 2q CD 10_1101-2021_02_13_429885 406 19 , , , 10_1101-2021_02_13_429885 406 20 etc etc FW 10_1101-2021_02_13_429885 406 21 . . . 10_1101-2021_02_13_429885 407 1 ) ) -RRB- 10_1101-2021_02_13_429885 407 2 , , , 10_1101-2021_02_13_429885 407 3 and and CC 10_1101-2021_02_13_429885 407 4 uses use VBZ 10_1101-2021_02_13_429885 407 5 the the DT 10_1101-2021_02_13_429885 407 6 length length NN 10_1101-2021_02_13_429885 407 7 of of IN 10_1101-2021_02_13_429885 407 8 each each DT 10_1101-2021_02_13_429885 407 9 input input NN 10_1101-2021_02_13_429885 407 10 CNA CNA NNP 10_1101-2021_02_13_429885 407 11 segment segment NN 10_1101-2021_02_13_429885 407 12 to to TO 10_1101-2021_02_13_429885 407 13 assign assign VB 10_1101-2021_02_13_429885 407 14 a a DT 10_1101-2021_02_13_429885 407 15 “ " `` 10_1101-2021_02_13_429885 407 16 long long JJ 10_1101-2021_02_13_429885 407 17 segment segment NN 10_1101-2021_02_13_429885 407 18 ” " '' 10_1101-2021_02_13_429885 407 19 or or CC 10_1101-2021_02_13_429885 407 20 “ " `` 10_1101-2021_02_13_429885 407 21 short short JJ 10_1101-2021_02_13_429885 407 22 segment segment NN 10_1101-2021_02_13_429885 407 23 ” " '' 10_1101-2021_02_13_429885 407 24 status status NN 10_1101-2021_02_13_429885 407 25 . . . 10_1101-2021_02_13_429885 408 1 This this DT 10_1101-2021_02_13_429885 408 2 is be VBZ 10_1101-2021_02_13_429885 408 3 determined determine VBN 10_1101-2021_02_13_429885 408 4 by by IN 10_1101-2021_02_13_429885 408 5 a a DT 10_1101-2021_02_13_429885 408 6 cut cut NN 10_1101-2021_02_13_429885 408 7 parameter parameter NN 10_1101-2021_02_13_429885 408 8 that that WDT 10_1101-2021_02_13_429885 408 9 is be VBZ 10_1101-2021_02_13_429885 408 10 set set VBN 10_1101-2021_02_13_429885 408 11 , , , 10_1101-2021_02_13_429885 408 12 by by IN 10_1101-2021_02_13_429885 408 13 default default NN 10_1101-2021_02_13_429885 408 14 , , , 10_1101-2021_02_13_429885 408 15 to to IN 10_1101-2021_02_13_429885 408 16 20 20 CD 10_1101-2021_02_13_429885 408 17 % % NN 10_1101-2021_02_13_429885 408 18 ( ( -LRB- 10_1101-2021_02_13_429885 408 19 i.e. i.e. FW 10_1101-2021_02_13_429885 408 20 , , , 10_1101-2021_02_13_429885 408 21 ) ) -RRB- 10_1101-2021_02_13_429885 408 22 . . . 10_1101-2021_02_13_429885 409 1 μ μ LS 10_1101-2021_02_13_429885 409 2 .2μ .2μ NFP 10_1101-2021_02_13_429885 409 3 = = NFP 10_1101-2021_02_13_429885 409 4 0 0 NFP 10_1101-2021_02_13_429885 409 5 Then then RB 10_1101-2021_02_13_429885 409 6 , , , 10_1101-2021_02_13_429885 409 7 a a DT 10_1101-2021_02_13_429885 409 8 null null NN 10_1101-2021_02_13_429885 409 9 hypothesis hypothesis NN 10_1101-2021_02_13_429885 409 10 is be VBZ 10_1101-2021_02_13_429885 409 11 used use VBN 10_1101-2021_02_13_429885 409 12 to to TO 10_1101-2021_02_13_429885 409 13 compute compute VB 10_1101-2021_02_13_429885 409 14 a a DT 10_1101-2021_02_13_429885 409 15 p p NN 10_1101-2021_02_13_429885 409 16 - - HYPH 10_1101-2021_02_13_429885 409 17 value value NN 10_1101-2021_02_13_429885 409 18 . . . 10_1101-2021_02_13_429885 410 1 That that DT 10_1101-2021_02_13_429885 410 2 is be VBZ 10_1101-2021_02_13_429885 410 3 defined define VBN 10_1101-2021_02_13_429885 410 4 using use VBG 10_1101-2021_02_13_429885 410 5 a a DT 10_1101-2021_02_13_429885 410 6 Binomial Binomial NNP 10_1101-2021_02_13_429885 410 7 test test NN 10_1101-2021_02_13_429885 410 8 based base VBN 10_1101-2021_02_13_429885 410 9 on on IN 10_1101-2021_02_13_429885 410 10 , , , 10_1101-2021_02_13_429885 410 11 the the DT 10_1101-2021_02_13_429885 410 12 number number NN 10_1101-2021_02_13_429885 410 13 of of IN 10_1101-2021_02_13_429885 410 14 trials trial NNS 10_1101-2021_02_13_429885 410 15 given give VBN 10_1101-2021_02_13_429885 410 16 by by IN 10_1101-2021_02_13_429885 410 17 the the DT 10_1101-2021_02_13_429885 410 18 total total JJ 10_1101-2021_02_13_429885 410 19 segment segment NN 10_1101-2021_02_13_429885 410 20 counts count NNS 10_1101-2021_02_13_429885 410 21 in in IN 10_1101-2021_02_13_429885 410 22 the the DT 10_1101-2021_02_13_429885 410 23 arm arm NN 10_1101-2021_02_13_429885 410 24 , , , 10_1101-2021_02_13_429885 410 25 and and CC 10_1101-2021_02_13_429885 410 26 k k NNP 10_1101-2021_02_13_429885 410 27 the the DT 10_1101-2021_02_13_429885 410 28 observed observed JJ 10_1101-2021_02_13_429885 410 29 number number NN 10_1101-2021_02_13_429885 410 30 of of IN 10_1101-2021_02_13_429885 410 31 short short JJ 10_1101-2021_02_13_429885 410 32 segments segment NNS 10_1101-2021_02_13_429885 410 33 . . . 10_1101-2021_02_13_429885 411 1 The the DT 10_1101-2021_02_13_429885 411 2 Binomial Binomial NNP 10_1101-2021_02_13_429885 411 3 distribution distribution NN 10_1101-2021_02_13_429885 411 4 for for IN 10_1101-2021_02_13_429885 411 5 is be VBZ 10_1101-2021_02_13_429885 411 6 defined define VBN 10_1101-2021_02_13_429885 411 7 s s POS 10_1101-2021_02_13_429885 411 8 H0 h0 NN 10_1101-2021_02_13_429885 411 9 by by IN 10_1101-2021_02_13_429885 411 10 , , , 10_1101-2021_02_13_429885 411 11 and and CC 10_1101-2021_02_13_429885 411 12 the the DT 10_1101-2021_02_13_429885 411 13 null null NN 10_1101-2021_02_13_429885 411 14 is be VBZ 10_1101-2021_02_13_429885 411 15 the the DT 10_1101-2021_02_13_429885 411 16 probability probability NN 10_1101-2021_02_13_429885 411 17 of of IN 10_1101-2021_02_13_429885 411 18 observing observe VBG 10_1101-2021_02_13_429885 411 19 at at IN 10_1101-2021_02_13_429885 411 20 least least JJS 10_1101-2021_02_13_429885 411 21 short short JJ 10_1101-2021_02_13_429885 411 22 segments segment NNS 10_1101-2021_02_13_429885 411 23 , , , 10_1101-2021_02_13_429885 411 24 a a DT 10_1101-2021_02_13_429885 411 25 one one CD 10_1101-2021_02_13_429885 411 26 - - HYPH 10_1101-2021_02_13_429885 411 27 tailed tail VBN 10_1101-2021_02_13_429885 411 28 μ μ NNP 10_1101-2021_02_13_429885 411 29 s s NNP 10_1101-2021_02_13_429885 411 30 test test NN 10_1101-2021_02_13_429885 411 31 for for IN 10_1101-2021_02_13_429885 411 32 whether whether IN 10_1101-2021_02_13_429885 411 33 the the DT 10_1101-2021_02_13_429885 411 34 observations observation NNS 10_1101-2021_02_13_429885 411 35 are be VBP 10_1101-2021_02_13_429885 411 36 biased biased JJ 10_1101-2021_02_13_429885 411 37 towards towards IN 10_1101-2021_02_13_429885 411 38 shorter short JJR 10_1101-2021_02_13_429885 411 39 segments segment NNS 10_1101-2021_02_13_429885 411 40 . . . 10_1101-2021_02_13_429885 412 1 The the DT 10_1101-2021_02_13_429885 412 2 p p NN 10_1101-2021_02_13_429885 412 3 - - HYPH 10_1101-2021_02_13_429885 412 4 value value NN 10_1101-2021_02_13_429885 412 5 is be VBZ 10_1101-2021_02_13_429885 412 6 adjusted adjust VBN 10_1101-2021_02_13_429885 412 7 for for IN 10_1101-2021_02_13_429885 412 8 family family NN 10_1101-2021_02_13_429885 412 9 - - HYPH 10_1101-2021_02_13_429885 412 10 wise wise JJ 10_1101-2021_02_13_429885 412 11 error error NN 10_1101-2021_02_13_429885 412 12 rate rate NN 10_1101-2021_02_13_429885 412 13 by by IN 10_1101-2021_02_13_429885 412 14 Bonferroni Bonferroni NNP 10_1101-2021_02_13_429885 412 15 , , , 10_1101-2021_02_13_429885 412 16 dividing divide VBG 10_1101-2021_02_13_429885 412 17 the the DT 10_1101-2021_02_13_429885 412 18 desired desire VBN 10_1101-2021_02_13_429885 412 19 -value -value NN 10_1101-2021_02_13_429885 412 20 by by IN 10_1101-2021_02_13_429885 412 21 the the DT 10_1101-2021_02_13_429885 412 22 α α NNP 10_1101-2021_02_13_429885 412 23 number number NN 10_1101-2021_02_13_429885 412 24 of of IN 10_1101-2021_02_13_429885 412 25 tests test NNS 10_1101-2021_02_13_429885 412 26 . . . 10_1101-2021_02_13_429885 413 1 This this DT 10_1101-2021_02_13_429885 413 2 test test NN 10_1101-2021_02_13_429885 413 3 is be VBZ 10_1101-2021_02_13_429885 413 4 applied apply VBN 10_1101-2021_02_13_429885 413 5 to to IN 10_1101-2021_02_13_429885 413 6 a a DT 10_1101-2021_02_13_429885 413 7 subset subset NN 10_1101-2021_02_13_429885 413 8 of of IN 10_1101-2021_02_13_429885 413 9 chromosome chromosome NN 10_1101-2021_02_13_429885 413 10 arms arm NNS 10_1101-2021_02_13_429885 413 11 with with IN 10_1101-2021_02_13_429885 413 12 a a DT 10_1101-2021_02_13_429885 413 13 minimum minimum JJ 10_1101-2021_02_13_429885 413 14 number number NN 10_1101-2021_02_13_429885 413 15 of of IN 10_1101-2021_02_13_429885 413 16 segments segment NNS 10_1101-2021_02_13_429885 413 17 , , , 10_1101-2021_02_13_429885 413 18 and and CC 10_1101-2021_02_13_429885 413 19 that that IN 10_1101-2021_02_13_429885 413 20 “ " `` 10_1101-2021_02_13_429885 413 21 jump jump VB 10_1101-2021_02_13_429885 413 22 ” " '' 10_1101-2021_02_13_429885 413 23 in in IN 10_1101-2021_02_13_429885 413 24 ploidy ploidy NN 10_1101-2021_02_13_429885 413 25 by by IN 10_1101-2021_02_13_429885 413 26 a a DT 10_1101-2021_02_13_429885 413 27 minimum minimum JJ 10_1101-2021_02_13_429885 413 28 amount amount NN 10_1101-2021_02_13_429885 413 29 ( ( -LRB- 10_1101-2021_02_13_429885 413 30 empirical empirical JJ 10_1101-2021_02_13_429885 413 31 default default NN 10_1101-2021_02_13_429885 413 32 values value NNS 10_1101-2021_02_13_429885 413 33 estimated estimate VBN 10_1101-2021_02_13_429885 413 34 from from IN 10_1101-2021_02_13_429885 413 35 trial trial NN 10_1101-2021_02_13_429885 413 36 data datum NNS 10_1101-2021_02_13_429885 413 37 ) ) -RRB- 10_1101-2021_02_13_429885 413 38 . . . 10_1101-2021_02_13_429885 414 1 The the DT 10_1101-2021_02_13_429885 414 2 arm arm NN 10_1101-2021_02_13_429885 414 3 - - HYPH 10_1101-2021_02_13_429885 414 4 level level NN 10_1101-2021_02_13_429885 414 5 jump jump NN 10_1101-2021_02_13_429885 414 6 is be VBZ 10_1101-2021_02_13_429885 414 7 determined determine VBN 10_1101-2021_02_13_429885 414 8 as as IN 10_1101-2021_02_13_429885 414 9 the the DT 10_1101-2021_02_13_429885 414 10 sum sum NN 10_1101-2021_02_13_429885 414 11 of of IN 10_1101-2021_02_13_429885 414 12 the the DT 10_1101-2021_02_13_429885 414 13 difference difference NN 10_1101-2021_02_13_429885 414 14 between between IN 10_1101-2021_02_13_429885 414 15 the the DT 10_1101-2021_02_13_429885 414 16 ploidy ploidy NN 10_1101-2021_02_13_429885 414 17 of of IN 10_1101-2021_02_13_429885 414 18 two two CD 10_1101-2021_02_13_429885 414 19 consecutive consecutive JJ 10_1101-2021_02_13_429885 414 20 DNA dna NN 10_1101-2021_02_13_429885 414 21 segments segment NNS 10_1101-2021_02_13_429885 414 22 . . . 10_1101-2021_02_13_429885 415 1 These these DT 10_1101-2021_02_13_429885 415 2 covariates covariate NNS 10_1101-2021_02_13_429885 415 3 are be VBP 10_1101-2021_02_13_429885 415 4 similar similar JJ 10_1101-2021_02_13_429885 415 5 to to IN 10_1101-2021_02_13_429885 415 6 those those DT 10_1101-2021_02_13_429885 415 7 used use VBN 10_1101-2021_02_13_429885 415 8 to to TO 10_1101-2021_02_13_429885 415 9 infer infer VB 10_1101-2021_02_13_429885 415 10 CNA cna NN 10_1101-2021_02_13_429885 415 11 signatures signature NNS 10_1101-2021_02_13_429885 415 12 from from IN 10_1101-2021_02_13_429885 415 13 single single JJ 10_1101-2021_02_13_429885 415 14 - - HYPH 10_1101-2021_02_13_429885 415 15 cell cell NN 10_1101-2021_02_13_429885 415 16 low low JJ 10_1101-2021_02_13_429885 415 17 - - HYPH 10_1101-2021_02_13_429885 415 18 pass pass NN 10_1101-2021_02_13_429885 415 19 WGS WGS NNP 10_1101-2021_02_13_429885 415 20 ​(Macintyre ​(Macintyre NNP 10_1101-2021_02_13_429885 415 21 et et FW 10_1101-2021_02_13_429885 415 22 al al NNP 10_1101-2021_02_13_429885 415 23 . . . 10_1101-2021_02_13_429885 416 1 2018 2018 CD 10_1101-2021_02_13_429885 416 2 ) ) -RRB- 10_1101-2021_02_13_429885 416 3 ​. ​. ADD 10_1101-2021_02_13_429885 417 1 Other other JJ 10_1101-2021_02_13_429885 417 2 features feature NNS 10_1101-2021_02_13_429885 417 3 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 417 4 contains contain VBZ 10_1101-2021_02_13_429885 417 5 multiple multiple JJ 10_1101-2021_02_13_429885 417 6 functions function NNS 10_1101-2021_02_13_429885 417 7 to to TO 10_1101-2021_02_13_429885 417 8 subset subset VB 10_1101-2021_02_13_429885 417 9 the the DT 10_1101-2021_02_13_429885 417 10 data datum NNS 10_1101-2021_02_13_429885 417 11 ( ( -LRB- 10_1101-2021_02_13_429885 417 12 i.e. i.e. FW 10_1101-2021_02_13_429885 417 13 , , , 10_1101-2021_02_13_429885 417 14 select select JJ 10_1101-2021_02_13_429885 417 15 mutations mutation NNS 10_1101-2021_02_13_429885 417 16 that that WDT 10_1101-2021_02_13_429885 417 17 map map VBP 10_1101-2021_02_13_429885 417 18 only only RB 10_1101-2021_02_13_429885 417 19 to to IN 10_1101-2021_02_13_429885 417 20 certain certain JJ 10_1101-2021_02_13_429885 417 21 copy copy NN 10_1101-2021_02_13_429885 417 22 states state NNS 10_1101-2021_02_13_429885 417 23 , , , 10_1101-2021_02_13_429885 417 24 subset subset VBP 10_1101-2021_02_13_429885 417 25 CNAs cna NNS 10_1101-2021_02_13_429885 417 26 with with IN 10_1101-2021_02_13_429885 417 27 a a DT 10_1101-2021_02_13_429885 417 28 total total JJ 10_1101-2021_02_13_429885 417 29 ploidy ploidy NN 10_1101-2021_02_13_429885 417 30 , , , 10_1101-2021_02_13_429885 417 31 etc etc FW 10_1101-2021_02_13_429885 417 32 . . . 10_1101-2021_02_13_429885 418 1 ) ) -RRB- 10_1101-2021_02_13_429885 418 2 , , , 10_1101-2021_02_13_429885 418 3 visualise visualise VB 10_1101-2021_02_13_429885 418 4 the the DT 10_1101-2021_02_13_429885 418 5 data datum NNS 10_1101-2021_02_13_429885 418 6 ( ( -LRB- 10_1101-2021_02_13_429885 418 7 i.e. i.e. FW 10_1101-2021_02_13_429885 418 8 , , , 10_1101-2021_02_13_429885 418 9 plot plot NN 10_1101-2021_02_13_429885 418 10 mutational mutational JJ 10_1101-2021_02_13_429885 418 11 burden burden NN 10_1101-2021_02_13_429885 418 12 by by IN 10_1101-2021_02_13_429885 418 13 tumour tumour NN 10_1101-2021_02_13_429885 418 14 genome genome NN 10_1101-2021_02_13_429885 418 15 ) ) -RRB- 10_1101-2021_02_13_429885 418 16 or or CC 10_1101-2021_02_13_429885 418 17 smooth smooth VB 10_1101-2021_02_13_429885 418 18 the the DT 10_1101-2021_02_13_429885 418 19 input input NN 10_1101-2021_02_13_429885 418 20 CNA cna NN 10_1101-2021_02_13_429885 418 21 segments segment NNS 10_1101-2021_02_13_429885 418 22 . . . 10_1101-2021_02_13_429885 419 1 Smoothing smooth VBG 10_1101-2021_02_13_429885 419 2 is be VBZ 10_1101-2021_02_13_429885 419 3 an an DT 10_1101-2021_02_13_429885 419 4 operation operation NN 10_1101-2021_02_13_429885 419 5 that that WDT 10_1101-2021_02_13_429885 419 6 can can MD 10_1101-2021_02_13_429885 419 7 be be VB 10_1101-2021_02_13_429885 419 8 carried carry VBN 10_1101-2021_02_13_429885 419 9 out out RP 10_1101-2021_02_13_429885 419 10 before before IN 10_1101-2021_02_13_429885 419 11 testing test VBG 10_1101-2021_02_13_429885 419 12 for for IN 10_1101-2021_02_13_429885 419 13 over over IN 10_1101-2021_02_13_429885 419 14 - - HYPH 10_1101-2021_02_13_429885 419 15 fragmentation fragmentation NN 10_1101-2021_02_13_429885 419 16 . . . 10_1101-2021_02_13_429885 420 1 In in IN 10_1101-2021_02_13_429885 420 2 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 420 3 , , , 10_1101-2021_02_13_429885 420 4 by by IN 10_1101-2021_02_13_429885 420 5 smoothing smooth VBG 10_1101-2021_02_13_429885 420 6 we -PRON- PRP 10_1101-2021_02_13_429885 420 7 obtain obtain VBP 10_1101-2021_02_13_429885 420 8 that that IN 10_1101-2021_02_13_429885 420 9 two two CD 10_1101-2021_02_13_429885 420 10 contiguous contiguous JJ 10_1101-2021_02_13_429885 420 11 segments segment NNS 10_1101-2021_02_13_429885 420 12 are be VBP 10_1101-2021_02_13_429885 420 13 merged merge VBN 10_1101-2021_02_13_429885 420 14 if if IN 10_1101-2021_02_13_429885 420 15 they -PRON- PRP 10_1101-2021_02_13_429885 420 16 have have VBP 10_1101-2021_02_13_429885 420 17 exactly exactly RB 10_1101-2021_02_13_429885 420 18 the the DT 10_1101-2021_02_13_429885 420 19 same same JJ 10_1101-2021_02_13_429885 420 20 ploidy ploidy NN 10_1101-2021_02_13_429885 420 21 profile profile NN 10_1101-2021_02_13_429885 420 22 ( ( -LRB- 10_1101-2021_02_13_429885 420 23 i.e. i.e. FW 10_1101-2021_02_13_429885 421 1 same same JJ 10_1101-2021_02_13_429885 421 2 numbers number NNS 10_1101-2021_02_13_429885 421 3 for for IN 10_1101-2021_02_13_429885 421 4 the the DT 10_1101-2021_02_13_429885 421 5 major major JJ 10_1101-2021_02_13_429885 421 6 and and CC 10_1101-2021_02_13_429885 421 7 minor minor JJ 10_1101-2021_02_13_429885 421 8 .CC .CC NFP 10_1101-2021_02_13_429885 421 9 - - HYPH 10_1101-2021_02_13_429885 421 10 BY by IN 10_1101-2021_02_13_429885 421 11 - - HYPH 10_1101-2021_02_13_429885 421 12 NC NC NNP 10_1101-2021_02_13_429885 421 13 - - HYPH 10_1101-2021_02_13_429885 421 14 ND ND NNP 10_1101-2021_02_13_429885 421 15 4.0 4.0 CD 10_1101-2021_02_13_429885 421 16 International International NNP 10_1101-2021_02_13_429885 421 17 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 421 18 under under IN 10_1101-2021_02_13_429885 421 19 a a DT 10_1101-2021_02_13_429885 421 20 ( ( -LRB- 10_1101-2021_02_13_429885 421 21 which which WDT 10_1101-2021_02_13_429885 421 22 was be VBD 10_1101-2021_02_13_429885 421 23 not not RB 10_1101-2021_02_13_429885 421 24 certified certify VBN 10_1101-2021_02_13_429885 421 25 by by IN 10_1101-2021_02_13_429885 421 26 peer peer NN 10_1101-2021_02_13_429885 421 27 review review NN 10_1101-2021_02_13_429885 421 28 ) ) -RRB- 10_1101-2021_02_13_429885 421 29 is be VBZ 10_1101-2021_02_13_429885 421 30 the the DT 10_1101-2021_02_13_429885 421 31 author author NN 10_1101-2021_02_13_429885 421 32 / / SYM 10_1101-2021_02_13_429885 421 33 funder funder NN 10_1101-2021_02_13_429885 421 34 , , , 10_1101-2021_02_13_429885 421 35 who who WP 10_1101-2021_02_13_429885 421 36 has have VBZ 10_1101-2021_02_13_429885 421 37 granted grant VBN 10_1101-2021_02_13_429885 421 38 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 421 39 a a DT 10_1101-2021_02_13_429885 421 40 license license NN 10_1101-2021_02_13_429885 421 41 to to TO 10_1101-2021_02_13_429885 421 42 display display VB 10_1101-2021_02_13_429885 421 43 the the DT 10_1101-2021_02_13_429885 421 44 preprint preprint NN 10_1101-2021_02_13_429885 421 45 in in IN 10_1101-2021_02_13_429885 421 46 perpetuity perpetuity NN 10_1101-2021_02_13_429885 421 47 . . . 10_1101-2021_02_13_429885 422 1 It -PRON- PRP 10_1101-2021_02_13_429885 422 2 is be VBZ 10_1101-2021_02_13_429885 422 3 made make VBN 10_1101-2021_02_13_429885 422 4 The the DT 10_1101-2021_02_13_429885 422 5 copyright copyright NN 10_1101-2021_02_13_429885 422 6 holder holder NN 10_1101-2021_02_13_429885 422 7 for for IN 10_1101-2021_02_13_429885 422 8 this this DT 10_1101-2021_02_13_429885 422 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 422 10 version version NN 10_1101-2021_02_13_429885 422 11 posted post VBD 10_1101-2021_02_13_429885 422 12 February February NNP 10_1101-2021_02_13_429885 422 13 13 13 CD 10_1101-2021_02_13_429885 422 14 , , , 10_1101-2021_02_13_429885 422 15 2021 2021 CD 10_1101-2021_02_13_429885 422 16 . . . 10_1101-2021_02_13_429885 422 17 ; ; : 10_1101-2021_02_13_429885 422 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 422 19 : : : 10_1101-2021_02_13_429885 422 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 422 21 preprint preprint NN 10_1101-2021_02_13_429885 422 22 https://paperpile.com/c/rqVmzs/FjZP https://paperpile.com/c/rqvmzs/fjzp ADD 10_1101-2021_02_13_429885 422 23 https://paperpile.com/c/rqVmzs/FjZP https://paperpile.com/c/rqVmzs/FjZP NNP 10_1101-2021_02_13_429885 422 24 https://paperpile.com/c/rqVmzs/P1Yv https://paperpile.com/c/rqVmzs/P1Yv NNP 10_1101-2021_02_13_429885 422 25 https://paperpile.com/c/rqVmzs/P1Yv https://paperpile.com/c/rqVmzs/P1Yv NNP 10_1101-2021_02_13_429885 422 26 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 422 27 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 422 28 Househam Househam NNP 10_1101-2021_02_13_429885 422 29 et et FW 10_1101-2021_02_13_429885 422 30 al al NNP 10_1101-2021_02_13_429885 422 31 . . . 10_1101-2021_02_13_429885 423 1 A a DT 10_1101-2021_02_13_429885 423 2 fully fully RB 10_1101-2021_02_13_429885 423 3 automated automate VBN 10_1101-2021_02_13_429885 423 4 approach approach NN 10_1101-2021_02_13_429885 423 5 for for IN 10_1101-2021_02_13_429885 423 6 quality quality NN 10_1101-2021_02_13_429885 423 7 control control NN 10_1101-2021_02_13_429885 423 8 of of IN 10_1101-2021_02_13_429885 423 9 cancer cancer NN 10_1101-2021_02_13_429885 423 10 mutations mutation NNS 10_1101-2021_02_13_429885 423 11 in in IN 10_1101-2021_02_13_429885 423 12 the the DT 10_1101-2021_02_13_429885 423 13 era era NN 10_1101-2021_02_13_429885 423 14 of of IN 10_1101-2021_02_13_429885 423 15 high high JJ 10_1101-2021_02_13_429885 423 16 - - HYPH 10_1101-2021_02_13_429885 423 17 resolution resolution NN 10_1101-2021_02_13_429885 423 18 whole whole JJ 10_1101-2021_02_13_429885 423 19 genome genome JJ 10_1101-2021_02_13_429885 423 20 sequencing sequencing NN 10_1101-2021_02_13_429885 423 21 . . . 10_1101-2021_02_13_429885 424 1 alleles alleles NNP 10_1101-2021_02_13_429885 424 2 ) ) -RRB- 10_1101-2021_02_13_429885 424 3 , , , 10_1101-2021_02_13_429885 424 4 and and CC 10_1101-2021_02_13_429885 424 5 if if IN 10_1101-2021_02_13_429885 424 6 they -PRON- PRP 10_1101-2021_02_13_429885 424 7 are be VBP 10_1101-2021_02_13_429885 424 8 a a DT 10_1101-2021_02_13_429885 424 9 maximum maximum JJ 10_1101-2021_02_13_429885 424 10 distance distance NN 10_1101-2021_02_13_429885 424 11 apart apart RB 10_1101-2021_02_13_429885 424 12 ( ( -LRB- 10_1101-2021_02_13_429885 424 13 e.g. e.g. RB 10_1101-2021_02_13_429885 425 1 1 1 CD 10_1101-2021_02_13_429885 425 2 megabase megabase NN 10_1101-2021_02_13_429885 425 3 ) ) -RRB- 10_1101-2021_02_13_429885 425 4 . . . 10_1101-2021_02_13_429885 426 1 This this DT 10_1101-2021_02_13_429885 426 2 operation operation NN 10_1101-2021_02_13_429885 426 3 does do VBZ 10_1101-2021_02_13_429885 426 4 not not RB 10_1101-2021_02_13_429885 426 5 affect affect VB 10_1101-2021_02_13_429885 426 6 the the DT 10_1101-2021_02_13_429885 426 7 ploidy ploidy NN 10_1101-2021_02_13_429885 426 8 profile profile NN 10_1101-2021_02_13_429885 426 9 of of IN 10_1101-2021_02_13_429885 426 10 the the DT 10_1101-2021_02_13_429885 426 11 calls call NNS 10_1101-2021_02_13_429885 426 12 , , , 10_1101-2021_02_13_429885 426 13 but but CC 10_1101-2021_02_13_429885 426 14 reduces reduce VBZ 10_1101-2021_02_13_429885 426 15 the the DT 10_1101-2021_02_13_429885 426 16 amount amount NN 10_1101-2021_02_13_429885 426 17 of of IN 10_1101-2021_02_13_429885 426 18 breakpoints breakpoint NNS 10_1101-2021_02_13_429885 426 19 that that WDT 10_1101-2021_02_13_429885 426 20 would would MD 10_1101-2021_02_13_429885 426 21 inflate inflate VB 10_1101-2021_02_13_429885 426 22 the the DT 10_1101-2021_02_13_429885 426 23 p p NN 10_1101-2021_02_13_429885 426 24 - - HYPH 10_1101-2021_02_13_429885 426 25 value value NN 10_1101-2021_02_13_429885 426 26 of of IN 10_1101-2021_02_13_429885 426 27 the the DT 10_1101-2021_02_13_429885 426 28 Binomial Binomial NNP 10_1101-2021_02_13_429885 426 29 over over IN 10_1101-2021_02_13_429885 426 30 - - HYPH 10_1101-2021_02_13_429885 426 31 fragmentation fragmentation NN 10_1101-2021_02_13_429885 426 32 test test NN 10_1101-2021_02_13_429885 426 33 . . . 10_1101-2021_02_13_429885 427 1 .CC .CC NFP 10_1101-2021_02_13_429885 427 2 - - : 10_1101-2021_02_13_429885 427 3 BY by IN 10_1101-2021_02_13_429885 427 4 - - HYPH 10_1101-2021_02_13_429885 427 5 NC NC NNP 10_1101-2021_02_13_429885 427 6 - - HYPH 10_1101-2021_02_13_429885 427 7 ND ND NNP 10_1101-2021_02_13_429885 427 8 4.0 4.0 CD 10_1101-2021_02_13_429885 427 9 International International NNP 10_1101-2021_02_13_429885 427 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 427 11 under under IN 10_1101-2021_02_13_429885 427 12 a a DT 10_1101-2021_02_13_429885 427 13 ( ( -LRB- 10_1101-2021_02_13_429885 427 14 which which WDT 10_1101-2021_02_13_429885 427 15 was be VBD 10_1101-2021_02_13_429885 427 16 not not RB 10_1101-2021_02_13_429885 427 17 certified certify VBN 10_1101-2021_02_13_429885 427 18 by by IN 10_1101-2021_02_13_429885 427 19 peer peer NN 10_1101-2021_02_13_429885 427 20 review review NN 10_1101-2021_02_13_429885 427 21 ) ) -RRB- 10_1101-2021_02_13_429885 427 22 is be VBZ 10_1101-2021_02_13_429885 427 23 the the DT 10_1101-2021_02_13_429885 427 24 author author NN 10_1101-2021_02_13_429885 427 25 / / SYM 10_1101-2021_02_13_429885 427 26 funder funder NN 10_1101-2021_02_13_429885 427 27 , , , 10_1101-2021_02_13_429885 427 28 who who WP 10_1101-2021_02_13_429885 427 29 has have VBZ 10_1101-2021_02_13_429885 427 30 granted grant VBN 10_1101-2021_02_13_429885 427 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 427 32 a a DT 10_1101-2021_02_13_429885 427 33 license license NN 10_1101-2021_02_13_429885 427 34 to to TO 10_1101-2021_02_13_429885 427 35 display display VB 10_1101-2021_02_13_429885 427 36 the the DT 10_1101-2021_02_13_429885 427 37 preprint preprint NN 10_1101-2021_02_13_429885 427 38 in in IN 10_1101-2021_02_13_429885 427 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 427 40 . . . 10_1101-2021_02_13_429885 428 1 It -PRON- PRP 10_1101-2021_02_13_429885 428 2 is be VBZ 10_1101-2021_02_13_429885 428 3 made make VBN 10_1101-2021_02_13_429885 428 4 The the DT 10_1101-2021_02_13_429885 428 5 copyright copyright NN 10_1101-2021_02_13_429885 428 6 holder holder NN 10_1101-2021_02_13_429885 428 7 for for IN 10_1101-2021_02_13_429885 428 8 this this DT 10_1101-2021_02_13_429885 428 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 428 10 version version NN 10_1101-2021_02_13_429885 428 11 posted post VBD 10_1101-2021_02_13_429885 428 12 February February NNP 10_1101-2021_02_13_429885 428 13 13 13 CD 10_1101-2021_02_13_429885 428 14 , , , 10_1101-2021_02_13_429885 428 15 2021 2021 CD 10_1101-2021_02_13_429885 428 16 . . . 10_1101-2021_02_13_429885 428 17 ; ; : 10_1101-2021_02_13_429885 428 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 428 19 : : : 10_1101-2021_02_13_429885 428 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 428 21 preprint preprint NN 10_1101-2021_02_13_429885 428 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 428 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 428 24 Househam Househam NNP 10_1101-2021_02_13_429885 428 25 et et FW 10_1101-2021_02_13_429885 428 26 al al NNP 10_1101-2021_02_13_429885 428 27 . . . 10_1101-2021_02_13_429885 429 1 A a DT 10_1101-2021_02_13_429885 429 2 fully fully RB 10_1101-2021_02_13_429885 429 3 automated automate VBN 10_1101-2021_02_13_429885 429 4 approach approach NN 10_1101-2021_02_13_429885 429 5 for for IN 10_1101-2021_02_13_429885 429 6 quality quality NN 10_1101-2021_02_13_429885 429 7 control control NN 10_1101-2021_02_13_429885 429 8 of of IN 10_1101-2021_02_13_429885 429 9 cancer cancer NN 10_1101-2021_02_13_429885 429 10 mutations mutation NNS 10_1101-2021_02_13_429885 429 11 in in IN 10_1101-2021_02_13_429885 429 12 the the DT 10_1101-2021_02_13_429885 429 13 era era NN 10_1101-2021_02_13_429885 429 14 of of IN 10_1101-2021_02_13_429885 429 15 high high JJ 10_1101-2021_02_13_429885 429 16 - - HYPH 10_1101-2021_02_13_429885 429 17 resolution resolution NN 10_1101-2021_02_13_429885 429 18 whole whole JJ 10_1101-2021_02_13_429885 429 19 genome genome JJ 10_1101-2021_02_13_429885 429 20 sequencing sequencing NN 10_1101-2021_02_13_429885 429 21 . . . 10_1101-2021_02_13_429885 430 1 Main Main NNP 10_1101-2021_02_13_429885 430 2 Text Text NNP 10_1101-2021_02_13_429885 430 3 Figures Figures NNPS 10_1101-2021_02_13_429885 430 4 Figure Figure NNP 10_1101-2021_02_13_429885 430 5 1 1 CD 10_1101-2021_02_13_429885 430 6 . . . 10_1101-2021_02_13_429885 430 7 a. a. NN 10_1101-2021_02_13_429885 431 1 ​Theoretical ​Theoretical NNP 10_1101-2021_02_13_429885 431 2 ​VAF ​VAF NNP 10_1101-2021_02_13_429885 431 3 histogram histogram NNP 10_1101-2021_02_13_429885 431 4 for for IN 10_1101-2021_02_13_429885 431 5 diploid diploid NNP 10_1101-2021_02_13_429885 431 6 1:1 1:1 NNP 10_1101-2021_02_13_429885 431 7 mutations mutation NNS 10_1101-2021_02_13_429885 431 8 in in IN 10_1101-2021_02_13_429885 431 9 a a DT 10_1101-2021_02_13_429885 431 10 tumour tumour NN 10_1101-2021_02_13_429885 431 11 . . . 10_1101-2021_02_13_429885 432 1 A a DT 10_1101-2021_02_13_429885 432 2 clonal clonal JJ 10_1101-2021_02_13_429885 432 3 heterozygous heterozygous JJ 10_1101-2021_02_13_429885 432 4 mutation mutation NN 10_1101-2021_02_13_429885 432 5 has have VBZ 10_1101-2021_02_13_429885 432 6 50 50 CD 10_1101-2021_02_13_429885 432 7 % % NN 10_1101-2021_02_13_429885 432 8 VAF VAF NNP 10_1101-2021_02_13_429885 432 9 ; ; : 10_1101-2021_02_13_429885 432 10 all all DT 10_1101-2021_02_13_429885 432 11 mutations mutation NNS 10_1101-2021_02_13_429885 432 12 are be VBP 10_1101-2021_02_13_429885 432 13 observed observe VBN 10_1101-2021_02_13_429885 432 14 with with IN 10_1101-2021_02_13_429885 432 15 some some DT 10_1101-2021_02_13_429885 432 16 Binomial Binomial NNP 10_1101-2021_02_13_429885 432 17 sequencing sequencing NN 10_1101-2021_02_13_429885 432 18 noise noise NN 10_1101-2021_02_13_429885 432 19 . . . 10_1101-2021_02_13_429885 433 1 The the DT 10_1101-2021_02_13_429885 433 2 clonal clonal JJ 10_1101-2021_02_13_429885 433 3 mutations mutation NNS 10_1101-2021_02_13_429885 433 4 form form VBP 10_1101-2021_02_13_429885 433 5 a a DT 10_1101-2021_02_13_429885 433 6 peak peak NN 10_1101-2021_02_13_429885 433 7 at at IN 10_1101-2021_02_13_429885 433 8 100 100 CD 10_1101-2021_02_13_429885 433 9 % % NN 10_1101-2021_02_13_429885 433 10 CCF CCF NNP 10_1101-2021_02_13_429885 433 11 , , , 10_1101-2021_02_13_429885 433 12 plus plus CC 10_1101-2021_02_13_429885 433 13 other other JJ 10_1101-2021_02_13_429885 433 14 features feature NNS 10_1101-2021_02_13_429885 433 15 that that WDT 10_1101-2021_02_13_429885 433 16 characterise characterise VBP 10_1101-2021_02_13_429885 433 17 the the DT 10_1101-2021_02_13_429885 433 18 tumour tumour NN 10_1101-2021_02_13_429885 433 19 clonal clonal JJ 10_1101-2021_02_13_429885 433 20 composition composition NN 10_1101-2021_02_13_429885 433 21 ( ( -LRB- 10_1101-2021_02_13_429885 433 22 e.g. e.g. RB 10_1101-2021_02_13_429885 433 23 , , , 10_1101-2021_02_13_429885 433 24 the the DT 10_1101-2021_02_13_429885 433 25 tail tail NN 10_1101-2021_02_13_429885 433 26 ) ) -RRB- 10_1101-2021_02_13_429885 433 27 . . . 10_1101-2021_02_13_429885 434 1 The the DT 10_1101-2021_02_13_429885 434 2 expected expect VBN 10_1101-2021_02_13_429885 434 3 theoretical theoretical JJ 10_1101-2021_02_13_429885 434 4 VAF VAF NNP 10_1101-2021_02_13_429885 434 5 decreases decrease NNS 10_1101-2021_02_13_429885 434 6 if if IN 10_1101-2021_02_13_429885 434 7 sample sample NN 10_1101-2021_02_13_429885 434 8 purity purity NN 10_1101-2021_02_13_429885 434 9 reduces reduce NNS 10_1101-2021_02_13_429885 434 10 . . . 10_1101-2021_02_13_429885 435 1 ​b ​b NNP 10_1101-2021_02_13_429885 435 2 . . . 10_1101-2021_02_13_429885 436 1 The the DT 10_1101-2021_02_13_429885 436 2 case case NN 10_1101-2021_02_13_429885 436 3 of of IN 10_1101-2021_02_13_429885 436 4 a a DT 10_1101-2021_02_13_429885 436 5 2:1 2:1 CD 10_1101-2021_02_13_429885 436 6 tumour tumour NN 10_1101-2021_02_13_429885 436 7 genome genome NN 10_1101-2021_02_13_429885 436 8 , , , 10_1101-2021_02_13_429885 436 9 where where WRB 10_1101-2021_02_13_429885 436 10 we -PRON- PRP 10_1101-2021_02_13_429885 436 11 expect expect VBP 10_1101-2021_02_13_429885 436 12 2 2 CD 10_1101-2021_02_13_429885 436 13 peaks peak NNS 10_1101-2021_02_13_429885 436 14 in in IN 10_1101-2021_02_13_429885 436 15 the the DT 10_1101-2021_02_13_429885 436 16 VAF VAF NNP 10_1101-2021_02_13_429885 436 17 originating originate VBG 10_1101-2021_02_13_429885 436 18 from from IN 10_1101-2021_02_13_429885 436 19 mutations mutation NNS 10_1101-2021_02_13_429885 436 20 present present JJ 10_1101-2021_02_13_429885 436 21 in in IN 10_1101-2021_02_13_429885 436 22 one one CD 10_1101-2021_02_13_429885 436 23 ( ( -LRB- 10_1101-2021_02_13_429885 436 24 orange orange NN 10_1101-2021_02_13_429885 436 25 ) ) -RRB- 10_1101-2021_02_13_429885 436 26 or or CC 10_1101-2021_02_13_429885 436 27 two two CD 10_1101-2021_02_13_429885 436 28 copies copy NNS 10_1101-2021_02_13_429885 436 29 ( ( -LRB- 10_1101-2021_02_13_429885 436 30 purple purple NNP 10_1101-2021_02_13_429885 436 31 ) ) -RRB- 10_1101-2021_02_13_429885 436 32 . . . 10_1101-2021_02_13_429885 437 1 The the DT 10_1101-2021_02_13_429885 437 2 multiplicity multiplicity NN 10_1101-2021_02_13_429885 437 3 of of IN 10_1101-2021_02_13_429885 437 4 a a DT 10_1101-2021_02_13_429885 437 5 mutation mutation NN 10_1101-2021_02_13_429885 437 6 can can MD 10_1101-2021_02_13_429885 437 7 phase phase VB 10_1101-2021_02_13_429885 437 8 whether whether IN 10_1101-2021_02_13_429885 437 9 it -PRON- PRP 10_1101-2021_02_13_429885 437 10 happened happen VBD 10_1101-2021_02_13_429885 437 11 before before IN 10_1101-2021_02_13_429885 437 12 or or CC 10_1101-2021_02_13_429885 437 13 after after IN 10_1101-2021_02_13_429885 437 14 the the DT 10_1101-2021_02_13_429885 437 15 CNA CNA NNP 10_1101-2021_02_13_429885 437 16 . . . 10_1101-2021_02_13_429885 438 1 For for IN 10_1101-2021_02_13_429885 438 2 2:1 2:1 CD 10_1101-2021_02_13_429885 438 3 we -PRON- PRP 10_1101-2021_02_13_429885 438 4 expect expect VBP 10_1101-2021_02_13_429885 438 5 peaks peak NNS 10_1101-2021_02_13_429885 438 6 at at IN 10_1101-2021_02_13_429885 438 7 66 66 CD 10_1101-2021_02_13_429885 438 8 % % NN 10_1101-2021_02_13_429885 438 9 and and CC 10_1101-2021_02_13_429885 438 10 33 33 CD 10_1101-2021_02_13_429885 438 11 % % NN 10_1101-2021_02_13_429885 438 12 VAF VAF NNP 10_1101-2021_02_13_429885 438 13 , , , 10_1101-2021_02_13_429885 438 14 both both CC 10_1101-2021_02_13_429885 438 15 clonal clonal JJ 10_1101-2021_02_13_429885 438 16 mutations mutation NNS 10_1101-2021_02_13_429885 438 17 ( ( -LRB- 10_1101-2021_02_13_429885 438 18 100 100 CD 10_1101-2021_02_13_429885 438 19 % % NN 10_1101-2021_02_13_429885 438 20 CCF CCF NNP 10_1101-2021_02_13_429885 438 21 ) ) -RRB- 10_1101-2021_02_13_429885 438 22 . . . 10_1101-2021_02_13_429885 439 1 ​c ​c NNP 10_1101-2021_02_13_429885 439 2 . . . 10_1101-2021_02_13_429885 440 1 ​Computing ​compute VBG 10_1101-2021_02_13_429885 440 2 CCFs ccf NNS 10_1101-2021_02_13_429885 440 3 requires require VBZ 10_1101-2021_02_13_429885 440 4 caution caution NN 10_1101-2021_02_13_429885 440 5 for for IN 10_1101-2021_02_13_429885 440 6 mutations mutation NNS 10_1101-2021_02_13_429885 440 7 with with IN 10_1101-2021_02_13_429885 440 8 different different JJ 10_1101-2021_02_13_429885 440 9 multiplicities multiplicity NNS 10_1101-2021_02_13_429885 440 10 ; ; : 10_1101-2021_02_13_429885 440 11 we -PRON- PRP 10_1101-2021_02_13_429885 440 12 support support VBP 10_1101-2021_02_13_429885 440 13 2:0 2:0 CD 10_1101-2021_02_13_429885 440 14 , , , 10_1101-2021_02_13_429885 440 15 2:1 2:1 CD 10_1101-2021_02_13_429885 440 16 and and CC 10_1101-2021_02_13_429885 440 17 2:2 2:2 CD 10_1101-2021_02_13_429885 440 18 copy copy NN 10_1101-2021_02_13_429885 440 19 states state NNS 10_1101-2021_02_13_429885 440 20 in in IN 10_1101-2021_02_13_429885 440 21 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 440 22 , , , 10_1101-2021_02_13_429885 440 23 and and CC 10_1101-2021_02_13_429885 440 24 offer offer VB 10_1101-2021_02_13_429885 440 25 two two CD 10_1101-2021_02_13_429885 440 26 methods method NNS 10_1101-2021_02_13_429885 440 27 to to TO 10_1101-2021_02_13_429885 440 28 compute compute VB 10_1101-2021_02_13_429885 440 29 CCFs ccf NNS 10_1101-2021_02_13_429885 440 30 . . . 10_1101-2021_02_13_429885 441 1 The the DT 10_1101-2021_02_13_429885 441 2 one one NN 10_1101-2021_02_13_429885 441 3 depicted depict VBN 10_1101-2021_02_13_429885 441 4 is be VBZ 10_1101-2021_02_13_429885 441 5 based base VBN 10_1101-2021_02_13_429885 441 6 on on IN 10_1101-2021_02_13_429885 441 7 the the DT 10_1101-2021_02_13_429885 441 8 entropy entropy NN 10_1101-2021_02_13_429885 441 9 of of IN 10_1101-2021_02_13_429885 441 10 a a DT 10_1101-2021_02_13_429885 441 11 Binomial Binomial NNP 10_1101-2021_02_13_429885 441 12 mixture mixture NN 10_1101-2021_02_13_429885 441 13 . . . 10_1101-2021_02_13_429885 442 1 From from IN 10_1101-2021_02_13_429885 442 2 the the DT 10_1101-2021_02_13_429885 442 3 expected expect VBN 10_1101-2021_02_13_429885 442 4 VAF VAF NNP 10_1101-2021_02_13_429885 442 5 peaks peak NNS 10_1101-2021_02_13_429885 442 6 we -PRON- PRP 10_1101-2021_02_13_429885 442 7 construct construct VBP 10_1101-2021_02_13_429885 442 8 a a DT 10_1101-2021_02_13_429885 442 9 mixture mixture NN 10_1101-2021_02_13_429885 442 10 density density NN 10_1101-2021_02_13_429885 442 11 and and CC 10_1101-2021_02_13_429885 442 12 use use VB 10_1101-2021_02_13_429885 442 13 the the DT 10_1101-2021_02_13_429885 442 14 entropy entropy NN 10_1101-2021_02_13_429885 442 15 of of IN 10_1101-2021_02_13_429885 442 16 its -PRON- PRP$ 10_1101-2021_02_13_429885 442 17 latent latent NN 10_1101-2021_02_13_429885 442 18 variables variable NNS 10_1101-2021_02_13_429885 442 19 to to TO 10_1101-2021_02_13_429885 442 20 capture capture VB 10_1101-2021_02_13_429885 442 21 uncertainty uncertainty NN 10_1101-2021_02_13_429885 442 22 in in IN 10_1101-2021_02_13_429885 442 23 the the DT 10_1101-2021_02_13_429885 442 24 multiplicities multiplicity NNS 10_1101-2021_02_13_429885 442 25 . . . 10_1101-2021_02_13_429885 443 1 At at IN 10_1101-2021_02_13_429885 443 2 the the DT 10_1101-2021_02_13_429885 443 3 crossing crossing NN 10_1101-2021_02_13_429885 443 4 of of IN 10_1101-2021_02_13_429885 443 5 the the DT 10_1101-2021_02_13_429885 443 6 components component NNS 10_1101-2021_02_13_429885 443 7 we -PRON- PRP 10_1101-2021_02_13_429885 443 8 can can MD 10_1101-2021_02_13_429885 443 9 not not RB 10_1101-2021_02_13_429885 443 10 easily easily RB 10_1101-2021_02_13_429885 443 11 assign assign VB 10_1101-2021_02_13_429885 443 12 multiplicities multiplicity NNS 10_1101-2021_02_13_429885 443 13 , , , 10_1101-2021_02_13_429885 443 14 and and CC 10_1101-2021_02_13_429885 443 15 therefore therefore RB 10_1101-2021_02_13_429885 443 16 CCFs ccf NNS 10_1101-2021_02_13_429885 443 17 ; ; : 10_1101-2021_02_13_429885 443 18 the the DT 10_1101-2021_02_13_429885 443 19 entropy entropy JJ 10_1101-2021_02_13_429885 443 20 peaks peak NNS 10_1101-2021_02_13_429885 443 21 at at IN 10_1101-2021_02_13_429885 443 22 the the DT 10_1101-2021_02_13_429885 443 23 top top NN 10_1101-2021_02_13_429885 443 24 of of IN 10_1101-2021_02_13_429885 443 25 the the DT 10_1101-2021_02_13_429885 443 26 uncertainty uncertainty NN 10_1101-2021_02_13_429885 443 27 by by IN 10_1101-2021_02_13_429885 443 28 definition definition NN 10_1101-2021_02_13_429885 443 29 . . . 10_1101-2021_02_13_429885 444 1 ​d ​d LS 10_1101-2021_02_13_429885 444 2 . . . 10_1101-2021_02_13_429885 445 1 ​Heatmap ​Heatmap NNP 10_1101-2021_02_13_429885 445 2 expressing express VBG 10_1101-2021_02_13_429885 445 3 the the DT 10_1101-2021_02_13_429885 445 4 relationship relationship NN 10_1101-2021_02_13_429885 445 5 between between IN 10_1101-2021_02_13_429885 445 6 copy copy NN 10_1101-2021_02_13_429885 445 7 states state NNS 10_1101-2021_02_13_429885 445 8 , , , 10_1101-2021_02_13_429885 445 9 mutation mutation NN 10_1101-2021_02_13_429885 445 10 multiplicity multiplicity NN 10_1101-2021_02_13_429885 445 11 and and CC 10_1101-2021_02_13_429885 445 12 sample sample NN 10_1101-2021_02_13_429885 445 13 purity purity NN 10_1101-2021_02_13_429885 445 14 . . . 10_1101-2021_02_13_429885 446 1 The the DT 10_1101-2021_02_13_429885 446 2 color color NN 10_1101-2021_02_13_429885 446 3 reflects reflect VBZ 10_1101-2021_02_13_429885 446 4 the the DT 10_1101-2021_02_13_429885 446 5 expected expect VBN 10_1101-2021_02_13_429885 446 6 VAF VAF NNP 10_1101-2021_02_13_429885 446 7 for for IN 10_1101-2021_02_13_429885 446 8 the the DT 10_1101-2021_02_13_429885 446 9 corresponding corresponding JJ 10_1101-2021_02_13_429885 446 10 mutations mutation NNS 10_1101-2021_02_13_429885 446 11 , , , 10_1101-2021_02_13_429885 446 12 and and CC 10_1101-2021_02_13_429885 446 13 can can MD 10_1101-2021_02_13_429885 446 14 be be VB 10_1101-2021_02_13_429885 446 15 used use VBN 10_1101-2021_02_13_429885 446 16 to to IN 10_1101-2021_02_13_429885 446 17 QC QC NNP 10_1101-2021_02_13_429885 446 18 both both DT 10_1101-2021_02_13_429885 446 19 CNAs cna NNS 10_1101-2021_02_13_429885 446 20 and and CC 10_1101-2021_02_13_429885 446 21 purity purity NN 10_1101-2021_02_13_429885 446 22 estimates estimate NNS 10_1101-2021_02_13_429885 446 23 . . . 10_1101-2021_02_13_429885 447 1 .CC .CC NFP 10_1101-2021_02_13_429885 447 2 - - : 10_1101-2021_02_13_429885 447 3 BY by IN 10_1101-2021_02_13_429885 447 4 - - HYPH 10_1101-2021_02_13_429885 447 5 NC NC NNP 10_1101-2021_02_13_429885 447 6 - - HYPH 10_1101-2021_02_13_429885 447 7 ND ND NNP 10_1101-2021_02_13_429885 447 8 4.0 4.0 CD 10_1101-2021_02_13_429885 447 9 International International NNP 10_1101-2021_02_13_429885 447 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 447 11 under under IN 10_1101-2021_02_13_429885 447 12 a a DT 10_1101-2021_02_13_429885 447 13 ( ( -LRB- 10_1101-2021_02_13_429885 447 14 which which WDT 10_1101-2021_02_13_429885 447 15 was be VBD 10_1101-2021_02_13_429885 447 16 not not RB 10_1101-2021_02_13_429885 447 17 certified certify VBN 10_1101-2021_02_13_429885 447 18 by by IN 10_1101-2021_02_13_429885 447 19 peer peer NN 10_1101-2021_02_13_429885 447 20 review review NN 10_1101-2021_02_13_429885 447 21 ) ) -RRB- 10_1101-2021_02_13_429885 447 22 is be VBZ 10_1101-2021_02_13_429885 447 23 the the DT 10_1101-2021_02_13_429885 447 24 author author NN 10_1101-2021_02_13_429885 447 25 / / SYM 10_1101-2021_02_13_429885 447 26 funder funder NN 10_1101-2021_02_13_429885 447 27 , , , 10_1101-2021_02_13_429885 447 28 who who WP 10_1101-2021_02_13_429885 447 29 has have VBZ 10_1101-2021_02_13_429885 447 30 granted grant VBN 10_1101-2021_02_13_429885 447 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 447 32 a a DT 10_1101-2021_02_13_429885 447 33 license license NN 10_1101-2021_02_13_429885 447 34 to to TO 10_1101-2021_02_13_429885 447 35 display display VB 10_1101-2021_02_13_429885 447 36 the the DT 10_1101-2021_02_13_429885 447 37 preprint preprint NN 10_1101-2021_02_13_429885 447 38 in in IN 10_1101-2021_02_13_429885 447 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 447 40 . . . 10_1101-2021_02_13_429885 448 1 It -PRON- PRP 10_1101-2021_02_13_429885 448 2 is be VBZ 10_1101-2021_02_13_429885 448 3 made make VBN 10_1101-2021_02_13_429885 448 4 The the DT 10_1101-2021_02_13_429885 448 5 copyright copyright NN 10_1101-2021_02_13_429885 448 6 holder holder NN 10_1101-2021_02_13_429885 448 7 for for IN 10_1101-2021_02_13_429885 448 8 this this DT 10_1101-2021_02_13_429885 448 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 448 10 version version NN 10_1101-2021_02_13_429885 448 11 posted post VBD 10_1101-2021_02_13_429885 448 12 February February NNP 10_1101-2021_02_13_429885 448 13 13 13 CD 10_1101-2021_02_13_429885 448 14 , , , 10_1101-2021_02_13_429885 448 15 2021 2021 CD 10_1101-2021_02_13_429885 448 16 . . . 10_1101-2021_02_13_429885 448 17 ; ; : 10_1101-2021_02_13_429885 448 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 448 19 : : : 10_1101-2021_02_13_429885 448 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 448 21 preprint preprint NN 10_1101-2021_02_13_429885 448 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 448 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 448 24 Househam Househam NNP 10_1101-2021_02_13_429885 448 25 et et FW 10_1101-2021_02_13_429885 448 26 al al NNP 10_1101-2021_02_13_429885 448 27 . . . 10_1101-2021_02_13_429885 449 1 A a DT 10_1101-2021_02_13_429885 449 2 fully fully RB 10_1101-2021_02_13_429885 449 3 automated automate VBN 10_1101-2021_02_13_429885 449 4 approach approach NN 10_1101-2021_02_13_429885 449 5 for for IN 10_1101-2021_02_13_429885 449 6 quality quality NN 10_1101-2021_02_13_429885 449 7 control control NN 10_1101-2021_02_13_429885 449 8 of of IN 10_1101-2021_02_13_429885 449 9 cancer cancer NN 10_1101-2021_02_13_429885 449 10 mutations mutation NNS 10_1101-2021_02_13_429885 449 11 in in IN 10_1101-2021_02_13_429885 449 12 the the DT 10_1101-2021_02_13_429885 449 13 era era NN 10_1101-2021_02_13_429885 449 14 of of IN 10_1101-2021_02_13_429885 449 15 high high JJ 10_1101-2021_02_13_429885 449 16 - - HYPH 10_1101-2021_02_13_429885 449 17 resolution resolution NN 10_1101-2021_02_13_429885 449 18 whole whole JJ 10_1101-2021_02_13_429885 449 19 genome genome JJ 10_1101-2021_02_13_429885 449 20 sequencing sequencing NN 10_1101-2021_02_13_429885 449 21 . . . 10_1101-2021_02_13_429885 450 1 Figure figure NN 10_1101-2021_02_13_429885 450 2 2 2 CD 10_1101-2021_02_13_429885 450 3 . . . 10_1101-2021_02_13_429885 450 4 a. a. NN 10_1101-2021_02_13_429885 450 5 Genome genome NN 10_1101-2021_02_13_429885 450 6 - - HYPH 10_1101-2021_02_13_429885 450 7 wide wide JJ 10_1101-2021_02_13_429885 450 8 total total JJ 10_1101-2021_02_13_429885 450 9 clonal clonal JJ 10_1101-2021_02_13_429885 450 10 copy copy NN 10_1101-2021_02_13_429885 450 11 number number NN 10_1101-2021_02_13_429885 450 12 segments segment NNS 10_1101-2021_02_13_429885 450 13 for for IN 10_1101-2021_02_13_429885 450 14 a a DT 10_1101-2021_02_13_429885 450 15 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 450 16 cancer cancer NN 10_1101-2021_02_13_429885 450 17 sample sample NN 10_1101-2021_02_13_429885 450 18 with with IN 10_1101-2021_02_13_429885 450 19 overall overall JJ 10_1101-2021_02_13_429885 450 20 ploidy ploidy NN 10_1101-2021_02_13_429885 450 21 2 2 CD 10_1101-2021_02_13_429885 450 22 , , , 10_1101-2021_02_13_429885 450 23 and and CC 10_1101-2021_02_13_429885 450 24 sample sample VB 10_1101-2021_02_13_429885 450 25 purity purity NN 10_1101-2021_02_13_429885 450 26 ~85 ~85 NFP 10_1101-2021_02_13_429885 450 27 % % NN 10_1101-2021_02_13_429885 450 28 . . . 10_1101-2021_02_13_429885 451 1 The the DT 10_1101-2021_02_13_429885 451 2 panel panel NN 10_1101-2021_02_13_429885 451 3 is be VBZ 10_1101-2021_02_13_429885 451 4 composed compose VBN 10_1101-2021_02_13_429885 451 5 of of IN 10_1101-2021_02_13_429885 451 6 three three CD 10_1101-2021_02_13_429885 451 7 illustrations illustration NNS 10_1101-2021_02_13_429885 451 8 . . . 10_1101-2021_02_13_429885 452 1 The the DT 10_1101-2021_02_13_429885 452 2 bottom bottom JJ 10_1101-2021_02_13_429885 452 3 plot plot NN 10_1101-2021_02_13_429885 452 4 reports report VBZ 10_1101-2021_02_13_429885 452 5 the the DT 10_1101-2021_02_13_429885 452 6 copies copy NNS 10_1101-2021_02_13_429885 452 7 of of IN 10_1101-2021_02_13_429885 452 8 the the DT 10_1101-2021_02_13_429885 452 9 major major JJ 10_1101-2021_02_13_429885 452 10 and and CC 10_1101-2021_02_13_429885 452 11 minor minor JJ 10_1101-2021_02_13_429885 452 12 alleles allele NNS 10_1101-2021_02_13_429885 452 13 in in IN 10_1101-2021_02_13_429885 452 14 each each DT 10_1101-2021_02_13_429885 452 15 segment segment NN 10_1101-2021_02_13_429885 452 16 , , , 10_1101-2021_02_13_429885 452 17 and and CC 10_1101-2021_02_13_429885 452 18 some some DT 10_1101-2021_02_13_429885 452 19 genome genome JJ 10_1101-2021_02_13_429885 452 20 areas area NNS 10_1101-2021_02_13_429885 452 21 are be VBP 10_1101-2021_02_13_429885 452 22 shaded shaded JJ 10_1101-2021_02_13_429885 452 23 . . . 10_1101-2021_02_13_429885 453 1 The the DT 10_1101-2021_02_13_429885 453 2 central central JJ 10_1101-2021_02_13_429885 453 3 plot plot NN 10_1101-2021_02_13_429885 453 4 shows show VBZ 10_1101-2021_02_13_429885 453 5 genome genome JJ 10_1101-2021_02_13_429885 453 6 - - HYPH 10_1101-2021_02_13_429885 453 7 wide wide JJ 10_1101-2021_02_13_429885 453 8 somatic somatic JJ 10_1101-2021_02_13_429885 453 9 mutations mutation NNS 10_1101-2021_02_13_429885 453 10 with with IN 10_1101-2021_02_13_429885 453 11 their -PRON- PRP$ 10_1101-2021_02_13_429885 453 12 depth depth NN 10_1101-2021_02_13_429885 453 13 of of IN 10_1101-2021_02_13_429885 453 14 sequencing sequencing NN 10_1101-2021_02_13_429885 453 15 , , , 10_1101-2021_02_13_429885 453 16 and and CC 10_1101-2021_02_13_429885 453 17 the the DT 10_1101-2021_02_13_429885 453 18 top top JJ 10_1101-2021_02_13_429885 453 19 plot plot NN 10_1101-2021_02_13_429885 453 20 shows show VBZ 10_1101-2021_02_13_429885 453 21 the the DT 10_1101-2021_02_13_429885 453 22 total total JJ 10_1101-2021_02_13_429885 453 23 number number NN 10_1101-2021_02_13_429885 453 24 of of IN 10_1101-2021_02_13_429885 453 25 mappable mappable JJ 10_1101-2021_02_13_429885 453 26 mutations mutation NNS 10_1101-2021_02_13_429885 453 27 binned bin VBD 10_1101-2021_02_13_429885 453 28 every every DT 10_1101-2021_02_13_429885 453 29 megabase megabase NN 10_1101-2021_02_13_429885 453 30 . . . 10_1101-2021_02_13_429885 454 1 ​b ​b NNP 10_1101-2021_02_13_429885 454 2 . . . 10_1101-2021_02_13_429885 455 1 Variant Variant NNP 10_1101-2021_02_13_429885 455 2 Allele Allele NNP 10_1101-2021_02_13_429885 455 3 Frequencies Frequencies NNPS 10_1101-2021_02_13_429885 455 4 ( ( -LRB- 10_1101-2021_02_13_429885 455 5 VAFs VAFs NNP 10_1101-2021_02_13_429885 455 6 ) ) -RRB- 10_1101-2021_02_13_429885 455 7 for for IN 10_1101-2021_02_13_429885 455 8 the the DT 10_1101-2021_02_13_429885 455 9 mutations mutation NNS 10_1101-2021_02_13_429885 455 10 that that WDT 10_1101-2021_02_13_429885 455 11 map map VBP 10_1101-2021_02_13_429885 455 12 to to IN 10_1101-2021_02_13_429885 455 13 the the DT 10_1101-2021_02_13_429885 455 14 input input NN 10_1101-2021_02_13_429885 455 15 segments segment NNS 10_1101-2021_02_13_429885 455 16 ( ( -LRB- 10_1101-2021_02_13_429885 455 17 note note VB 10_1101-2021_02_13_429885 455 18 that that IN 10_1101-2021_02_13_429885 455 19 these these DT 10_1101-2021_02_13_429885 455 20 are be VBP 10_1101-2021_02_13_429885 455 21 all all DT 10_1101-2021_02_13_429885 455 22 SNVs SNVs NNPS 10_1101-2021_02_13_429885 455 23 ) ) -RRB- 10_1101-2021_02_13_429885 455 24 . . . 10_1101-2021_02_13_429885 456 1 ​c ​c NNP 10_1101-2021_02_13_429885 456 2 . . . 10_1101-2021_02_13_429885 457 1 ​Depth ​depth NN 10_1101-2021_02_13_429885 457 2 of of IN 10_1101-2021_02_13_429885 457 3 sequencing sequencing NN 10_1101-2021_02_13_429885 457 4 ( ( -LRB- 10_1101-2021_02_13_429885 457 5 DP DP NNP 10_1101-2021_02_13_429885 457 6 ) ) -RRB- 10_1101-2021_02_13_429885 457 7 for for IN 10_1101-2021_02_13_429885 457 8 every every DT 10_1101-2021_02_13_429885 457 9 SNV SNV NNP 10_1101-2021_02_13_429885 457 10 . . . 10_1101-2021_02_13_429885 458 1 d. d. NNP 10_1101-2021_02_13_429885 458 2 Number Number NNP 10_1101-2021_02_13_429885 458 3 of of IN 10_1101-2021_02_13_429885 458 4 reads read NNS 10_1101-2021_02_13_429885 458 5 ( ( -LRB- 10_1101-2021_02_13_429885 458 6 NV NV NNP 10_1101-2021_02_13_429885 458 7 ) ) -RRB- 10_1101-2021_02_13_429885 458 8 with with IN 10_1101-2021_02_13_429885 458 9 the the DT 10_1101-2021_02_13_429885 458 10 variant variant JJ 10_1101-2021_02_13_429885 458 11 allele allele NN 10_1101-2021_02_13_429885 458 12 for for IN 10_1101-2021_02_13_429885 458 13 every every DT 10_1101-2021_02_13_429885 458 14 SNV SNV NNP 10_1101-2021_02_13_429885 458 15 . . . 10_1101-2021_02_13_429885 459 1 e. e. NNP 10_1101-2021_02_13_429885 459 2 ​Cancer ​Cancer NNP 10_1101-2021_02_13_429885 459 3 Cell Cell NNP 10_1101-2021_02_13_429885 459 4 Fractions Fractions NNPS 10_1101-2021_02_13_429885 459 5 ( ( -LRB- 10_1101-2021_02_13_429885 459 6 CCF CCF NNP 10_1101-2021_02_13_429885 459 7 ) ) -RRB- 10_1101-2021_02_13_429885 459 8 estimation estimation NN 10_1101-2021_02_13_429885 459 9 for for IN 10_1101-2021_02_13_429885 459 10 this this DT 10_1101-2021_02_13_429885 459 11 sample sample NN 10_1101-2021_02_13_429885 459 12 , , , 10_1101-2021_02_13_429885 459 13 obtained obtain VBN 10_1101-2021_02_13_429885 459 14 from from IN 10_1101-2021_02_13_429885 459 15 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 459 16 . . . 10_1101-2021_02_13_429885 460 1 .CC .CC NFP 10_1101-2021_02_13_429885 460 2 - - : 10_1101-2021_02_13_429885 460 3 BY by IN 10_1101-2021_02_13_429885 460 4 - - HYPH 10_1101-2021_02_13_429885 460 5 NC NC NNP 10_1101-2021_02_13_429885 460 6 - - HYPH 10_1101-2021_02_13_429885 460 7 ND ND NNP 10_1101-2021_02_13_429885 460 8 4.0 4.0 CD 10_1101-2021_02_13_429885 460 9 International International NNP 10_1101-2021_02_13_429885 460 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 460 11 under under IN 10_1101-2021_02_13_429885 460 12 a a DT 10_1101-2021_02_13_429885 460 13 ( ( -LRB- 10_1101-2021_02_13_429885 460 14 which which WDT 10_1101-2021_02_13_429885 460 15 was be VBD 10_1101-2021_02_13_429885 460 16 not not RB 10_1101-2021_02_13_429885 460 17 certified certify VBN 10_1101-2021_02_13_429885 460 18 by by IN 10_1101-2021_02_13_429885 460 19 peer peer NN 10_1101-2021_02_13_429885 460 20 review review NN 10_1101-2021_02_13_429885 460 21 ) ) -RRB- 10_1101-2021_02_13_429885 460 22 is be VBZ 10_1101-2021_02_13_429885 460 23 the the DT 10_1101-2021_02_13_429885 460 24 author author NN 10_1101-2021_02_13_429885 460 25 / / SYM 10_1101-2021_02_13_429885 460 26 funder funder NN 10_1101-2021_02_13_429885 460 27 , , , 10_1101-2021_02_13_429885 460 28 who who WP 10_1101-2021_02_13_429885 460 29 has have VBZ 10_1101-2021_02_13_429885 460 30 granted grant VBN 10_1101-2021_02_13_429885 460 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 460 32 a a DT 10_1101-2021_02_13_429885 460 33 license license NN 10_1101-2021_02_13_429885 460 34 to to TO 10_1101-2021_02_13_429885 460 35 display display VB 10_1101-2021_02_13_429885 460 36 the the DT 10_1101-2021_02_13_429885 460 37 preprint preprint NN 10_1101-2021_02_13_429885 460 38 in in IN 10_1101-2021_02_13_429885 460 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 460 40 . . . 10_1101-2021_02_13_429885 461 1 It -PRON- PRP 10_1101-2021_02_13_429885 461 2 is be VBZ 10_1101-2021_02_13_429885 461 3 made make VBN 10_1101-2021_02_13_429885 461 4 The the DT 10_1101-2021_02_13_429885 461 5 copyright copyright NN 10_1101-2021_02_13_429885 461 6 holder holder NN 10_1101-2021_02_13_429885 461 7 for for IN 10_1101-2021_02_13_429885 461 8 this this DT 10_1101-2021_02_13_429885 461 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 461 10 version version NN 10_1101-2021_02_13_429885 461 11 posted post VBD 10_1101-2021_02_13_429885 461 12 February February NNP 10_1101-2021_02_13_429885 461 13 13 13 CD 10_1101-2021_02_13_429885 461 14 , , , 10_1101-2021_02_13_429885 461 15 2021 2021 CD 10_1101-2021_02_13_429885 461 16 . . . 10_1101-2021_02_13_429885 461 17 ; ; : 10_1101-2021_02_13_429885 461 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 461 19 : : : 10_1101-2021_02_13_429885 461 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 461 21 preprint preprint NN 10_1101-2021_02_13_429885 461 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 461 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 461 24 Househam Househam NNP 10_1101-2021_02_13_429885 461 25 et et FW 10_1101-2021_02_13_429885 461 26 al al NNP 10_1101-2021_02_13_429885 461 27 . . . 10_1101-2021_02_13_429885 462 1 A a DT 10_1101-2021_02_13_429885 462 2 fully fully RB 10_1101-2021_02_13_429885 462 3 automated automate VBN 10_1101-2021_02_13_429885 462 4 approach approach NN 10_1101-2021_02_13_429885 462 5 for for IN 10_1101-2021_02_13_429885 462 6 quality quality NN 10_1101-2021_02_13_429885 462 7 control control NN 10_1101-2021_02_13_429885 462 8 of of IN 10_1101-2021_02_13_429885 462 9 cancer cancer NN 10_1101-2021_02_13_429885 462 10 mutations mutation NNS 10_1101-2021_02_13_429885 462 11 in in IN 10_1101-2021_02_13_429885 462 12 the the DT 10_1101-2021_02_13_429885 462 13 era era NN 10_1101-2021_02_13_429885 462 14 of of IN 10_1101-2021_02_13_429885 462 15 high high JJ 10_1101-2021_02_13_429885 462 16 - - HYPH 10_1101-2021_02_13_429885 462 17 resolution resolution NN 10_1101-2021_02_13_429885 462 18 whole whole JJ 10_1101-2021_02_13_429885 462 19 genome genome JJ 10_1101-2021_02_13_429885 462 20 sequencing sequencing NN 10_1101-2021_02_13_429885 462 21 . . . 10_1101-2021_02_13_429885 463 1 Figure figure NN 10_1101-2021_02_13_429885 463 2 3 3 CD 10_1101-2021_02_13_429885 463 3 . . . 10_1101-2021_02_13_429885 463 4 a a DT 10_1101-2021_02_13_429885 463 5 - - HYPH 10_1101-2021_02_13_429885 463 6 d d NN 10_1101-2021_02_13_429885 463 7 . . . 10_1101-2021_02_13_429885 464 1 Peak peak VB 10_1101-2021_02_13_429885 464 2 detection detection NN 10_1101-2021_02_13_429885 464 3 analysis analysis NN 10_1101-2021_02_13_429885 464 4 assessing assess VBG 10_1101-2021_02_13_429885 464 5 the the DT 10_1101-2021_02_13_429885 464 6 quality quality NN 10_1101-2021_02_13_429885 464 7 of of IN 10_1101-2021_02_13_429885 464 8 CNA CNA NNP 10_1101-2021_02_13_429885 464 9 segments segment NNS 10_1101-2021_02_13_429885 464 10 ( ( -LRB- 10_1101-2021_02_13_429885 464 11 split split VBN 10_1101-2021_02_13_429885 464 12 by by IN 10_1101-2021_02_13_429885 464 13 copy copy NN 10_1101-2021_02_13_429885 464 14 state state NN 10_1101-2021_02_13_429885 464 15 ) ) -RRB- 10_1101-2021_02_13_429885 464 16 , , , 10_1101-2021_02_13_429885 464 17 and and CC 10_1101-2021_02_13_429885 464 18 tumour tumour NN 10_1101-2021_02_13_429885 464 19 purity purity NN 10_1101-2021_02_13_429885 464 20 . . . 10_1101-2021_02_13_429885 465 1 The the DT 10_1101-2021_02_13_429885 465 2 shaded shaded JJ 10_1101-2021_02_13_429885 465 3 gray gray JJ 10_1101-2021_02_13_429885 465 4 area area NN 10_1101-2021_02_13_429885 465 5 are be VBP 10_1101-2021_02_13_429885 465 6 input input NN 10_1101-2021_02_13_429885 465 7 mutations mutation NNS 10_1101-2021_02_13_429885 465 8 , , , 10_1101-2021_02_13_429885 465 9 and and CC 10_1101-2021_02_13_429885 465 10 the the DT 10_1101-2021_02_13_429885 465 11 thin thin JJ 10_1101-2021_02_13_429885 465 12 black black JJ 10_1101-2021_02_13_429885 465 13 profile profile NN 10_1101-2021_02_13_429885 465 14 is be VBZ 10_1101-2021_02_13_429885 465 15 its -PRON- PRP$ 10_1101-2021_02_13_429885 465 16 kernel kernel NN 10_1101-2021_02_13_429885 465 17 density density NN 10_1101-2021_02_13_429885 465 18 estimation estimation NN 10_1101-2021_02_13_429885 465 19 ( ( -LRB- 10_1101-2021_02_13_429885 465 20 KDE KDE NNP 10_1101-2021_02_13_429885 465 21 ) ) -RRB- 10_1101-2021_02_13_429885 465 22 . . . 10_1101-2021_02_13_429885 466 1 The the DT 10_1101-2021_02_13_429885 466 2 black black JJ 10_1101-2021_02_13_429885 466 3 circles circle NNS 10_1101-2021_02_13_429885 466 4 represent represent VBP 10_1101-2021_02_13_429885 466 5 the the DT 10_1101-2021_02_13_429885 466 6 peaks peak NNS 10_1101-2021_02_13_429885 466 7 detected detect VBN 10_1101-2021_02_13_429885 466 8 from from IN 10_1101-2021_02_13_429885 466 9 the the DT 10_1101-2021_02_13_429885 466 10 KDE KDE NNP 10_1101-2021_02_13_429885 466 11 , , , 10_1101-2021_02_13_429885 466 12 and and CC 10_1101-2021_02_13_429885 466 13 the the DT 10_1101-2021_02_13_429885 466 14 vertical vertical JJ 10_1101-2021_02_13_429885 466 15 dashed dash VBN 10_1101-2021_02_13_429885 466 16 lines line NNS 10_1101-2021_02_13_429885 466 17 are be VBP 10_1101-2021_02_13_429885 466 18 the the DT 10_1101-2021_02_13_429885 466 19 expected expect VBN 10_1101-2021_02_13_429885 466 20 peaks peak NNS 10_1101-2021_02_13_429885 466 21 , , , 10_1101-2021_02_13_429885 466 22 given give VBN 10_1101-2021_02_13_429885 466 23 the the DT 10_1101-2021_02_13_429885 466 24 tumour tumour NN 10_1101-2021_02_13_429885 466 25 purity purity NN 10_1101-2021_02_13_429885 466 26 . . . 10_1101-2021_02_13_429885 467 1 If if IN 10_1101-2021_02_13_429885 467 2 the the DT 10_1101-2021_02_13_429885 467 3 data data NN 10_1101-2021_02_13_429885 467 4 peaks peak NNS 10_1101-2021_02_13_429885 467 5 fall fall VBP 10_1101-2021_02_13_429885 467 6 within within IN 10_1101-2021_02_13_429885 467 7 the the DT 10_1101-2021_02_13_429885 467 8 shaded shaded JJ 10_1101-2021_02_13_429885 467 9 area area NN 10_1101-2021_02_13_429885 467 10 surrounding surround VBG 10_1101-2021_02_13_429885 467 11 the the DT 10_1101-2021_02_13_429885 467 12 vertical vertical JJ 10_1101-2021_02_13_429885 467 13 line line NN 10_1101-2021_02_13_429885 467 14 , , , 10_1101-2021_02_13_429885 467 15 the the DT 10_1101-2021_02_13_429885 467 16 estimates estimate NNS 10_1101-2021_02_13_429885 467 17 are be VBP 10_1101-2021_02_13_429885 467 18 consistent consistent JJ 10_1101-2021_02_13_429885 467 19 and and CC 10_1101-2021_02_13_429885 467 20 the the DT 10_1101-2021_02_13_429885 467 21 plot plot NN 10_1101-2021_02_13_429885 467 22 is be VBZ 10_1101-2021_02_13_429885 467 23 therefore therefore RB 10_1101-2021_02_13_429885 467 24 green green JJ 10_1101-2021_02_13_429885 467 25 ( ( -LRB- 10_1101-2021_02_13_429885 467 26 QC QC NNP 10_1101-2021_02_13_429885 467 27 pass pass NN 10_1101-2021_02_13_429885 467 28 ) ) -RRB- 10_1101-2021_02_13_429885 467 29 . . . 10_1101-2021_02_13_429885 468 1 For for IN 10_1101-2021_02_13_429885 468 2 copy copy NN 10_1101-2021_02_13_429885 468 3 states state NNS 10_1101-2021_02_13_429885 468 4 with with IN 10_1101-2021_02_13_429885 468 5 total total JJ 10_1101-2021_02_13_429885 468 6 copy copy NN 10_1101-2021_02_13_429885 468 7 number number NN 10_1101-2021_02_13_429885 468 8 > > NN 10_1101-2021_02_13_429885 468 9 2 2 CD 10_1101-2021_02_13_429885 468 10 , , , 10_1101-2021_02_13_429885 468 11 multiple multiple JJ 10_1101-2021_02_13_429885 468 12 peaks peak NNS 10_1101-2021_02_13_429885 468 13 are be VBP 10_1101-2021_02_13_429885 468 14 checked check VBN 10_1101-2021_02_13_429885 468 15 independently independently RB 10_1101-2021_02_13_429885 468 16 . . . 10_1101-2021_02_13_429885 469 1 In in IN 10_1101-2021_02_13_429885 469 2 that that DT 10_1101-2021_02_13_429885 469 3 case case NN 10_1101-2021_02_13_429885 469 4 the the DT 10_1101-2021_02_13_429885 469 5 overall overall JJ 10_1101-2021_02_13_429885 469 6 QC QC NNP 10_1101-2021_02_13_429885 469 7 status status NN 10_1101-2021_02_13_429885 469 8 for for IN 10_1101-2021_02_13_429885 469 9 the the DT 10_1101-2021_02_13_429885 469 10 copy copy NN 10_1101-2021_02_13_429885 469 11 state state NN 10_1101-2021_02_13_429885 469 12 is be VBZ 10_1101-2021_02_13_429885 469 13 a a DT 10_1101-2021_02_13_429885 469 14 linear linear JJ 10_1101-2021_02_13_429885 469 15 combination combination NN 10_1101-2021_02_13_429885 469 16 of of IN 10_1101-2021_02_13_429885 469 17 the the DT 10_1101-2021_02_13_429885 469 18 results result NNS 10_1101-2021_02_13_429885 469 19 , , , 10_1101-2021_02_13_429885 469 20 weighted weight VBN 10_1101-2021_02_13_429885 469 21 by by IN 10_1101-2021_02_13_429885 469 22 the the DT 10_1101-2021_02_13_429885 469 23 number number NN 10_1101-2021_02_13_429885 469 24 of of IN 10_1101-2021_02_13_429885 469 25 mutations mutation NNS 10_1101-2021_02_13_429885 469 26 assignable assignable JJ 10_1101-2021_02_13_429885 469 27 to to IN 10_1101-2021_02_13_429885 469 28 each each DT 10_1101-2021_02_13_429885 469 29 peak peak NN 10_1101-2021_02_13_429885 469 30 . . . 10_1101-2021_02_13_429885 470 1 ​e ​e LS 10_1101-2021_02_13_429885 470 2 - - HYPH 10_1101-2021_02_13_429885 470 3 h h NN 10_1101-2021_02_13_429885 470 4 . . . 10_1101-2021_02_13_429885 470 5 Cancer Cancer NNP 10_1101-2021_02_13_429885 470 6 Cell Cell NNP 10_1101-2021_02_13_429885 470 7 Fractions Fractions NNPS 10_1101-2021_02_13_429885 470 8 ( ( -LRB- 10_1101-2021_02_13_429885 470 9 CCF CCF NNP 10_1101-2021_02_13_429885 470 10 ) ) -RRB- 10_1101-2021_02_13_429885 470 11 estimation estimation NN 10_1101-2021_02_13_429885 470 12 for for IN 10_1101-2021_02_13_429885 470 13 each each DT 10_1101-2021_02_13_429885 470 14 tumour tumour NN 10_1101-2021_02_13_429885 470 15 genome genome NN 10_1101-2021_02_13_429885 470 16 , , , 10_1101-2021_02_13_429885 470 17 using use VBG 10_1101-2021_02_13_429885 470 18 the the DT 10_1101-2021_02_13_429885 470 19 entropy entropy JJ 10_1101-2021_02_13_429885 470 20 method method NN 10_1101-2021_02_13_429885 470 21 . . . 10_1101-2021_02_13_429885 471 1 Each each DT 10_1101-2021_02_13_429885 471 2 plot plot NN 10_1101-2021_02_13_429885 471 3 shows show VBZ 10_1101-2021_02_13_429885 471 4 both both DT 10_1101-2021_02_13_429885 471 5 CCF ccf NN 10_1101-2021_02_13_429885 471 6 , , , 10_1101-2021_02_13_429885 471 7 and and CC 10_1101-2021_02_13_429885 471 8 the the DT 10_1101-2021_02_13_429885 471 9 VAF VAF NNP 10_1101-2021_02_13_429885 471 10 from from IN 10_1101-2021_02_13_429885 471 11 which which WDT 10_1101-2021_02_13_429885 471 12 mutation mutation NN 10_1101-2021_02_13_429885 471 13 multiplicities multiplicity NNS 10_1101-2021_02_13_429885 471 14 are be VBP 10_1101-2021_02_13_429885 471 15 computed compute VBN 10_1101-2021_02_13_429885 471 16 . . . 10_1101-2021_02_13_429885 472 1 In in IN 10_1101-2021_02_13_429885 472 2 the the DT 10_1101-2021_02_13_429885 472 3 rightmost rightmost JJ 10_1101-2021_02_13_429885 472 4 panel panel NN 10_1101-2021_02_13_429885 472 5 we -PRON- PRP 10_1101-2021_02_13_429885 472 6 overlay overlay VBP 10_1101-2021_02_13_429885 472 7 the the DT 10_1101-2021_02_13_429885 472 8 entropy entropy JJ 10_1101-2021_02_13_429885 472 9 profile profile NN 10_1101-2021_02_13_429885 472 10 computed compute VBN 10_1101-2021_02_13_429885 472 11 by by IN 10_1101-2021_02_13_429885 472 12 a a DT 10_1101-2021_02_13_429885 472 13 2-dimensional 2-dimensional CD 10_1101-2021_02_13_429885 472 14 Binomial Binomial NNP 10_1101-2021_02_13_429885 472 15 mixture mixture NN 10_1101-2021_02_13_429885 472 16 . . . 10_1101-2021_02_13_429885 473 1 Areas area NNS 10_1101-2021_02_13_429885 473 2 within within IN 10_1101-2021_02_13_429885 473 3 the the DT 10_1101-2021_02_13_429885 473 4 red red JJ 10_1101-2021_02_13_429885 473 5 vertical vertical JJ 10_1101-2021_02_13_429885 473 6 dashed dash VBN 10_1101-2021_02_13_429885 473 7 lines line NNS 10_1101-2021_02_13_429885 473 8 are be VBP 10_1101-2021_02_13_429885 473 9 those those DT 10_1101-2021_02_13_429885 473 10 for for IN 10_1101-2021_02_13_429885 473 11 which which WDT 10_1101-2021_02_13_429885 473 12 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 473 13 can can MD 10_1101-2021_02_13_429885 473 14 not not RB 10_1101-2021_02_13_429885 473 15 assign assign VB 10_1101-2021_02_13_429885 473 16 a a DT 10_1101-2021_02_13_429885 473 17 confident confident JJ 10_1101-2021_02_13_429885 473 18 CCF ccf NN 10_1101-2021_02_13_429885 473 19 value value NN 10_1101-2021_02_13_429885 473 20 . . . 10_1101-2021_02_13_429885 474 1 For for IN 10_1101-2021_02_13_429885 474 2 copy copy NN 10_1101-2021_02_13_429885 474 3 states state NNS 10_1101-2021_02_13_429885 474 4 1:0 1:0 CD 10_1101-2021_02_13_429885 474 5 and and CC 10_1101-2021_02_13_429885 474 6 1:1 1:1 CD 10_1101-2021_02_13_429885 474 7 the the DT 10_1101-2021_02_13_429885 474 8 mutation mutation NN 10_1101-2021_02_13_429885 474 9 multiplicity multiplicity NN 10_1101-2021_02_13_429885 474 10 is be VBZ 10_1101-2021_02_13_429885 474 11 fixed fix VBN 10_1101-2021_02_13_429885 474 12 to to IN 10_1101-2021_02_13_429885 474 13 1 1 CD 10_1101-2021_02_13_429885 474 14 by by IN 10_1101-2021_02_13_429885 474 15 definition definition NN 10_1101-2021_02_13_429885 474 16 . . . 10_1101-2021_02_13_429885 475 1 .CC .CC NFP 10_1101-2021_02_13_429885 475 2 - - : 10_1101-2021_02_13_429885 475 3 BY by IN 10_1101-2021_02_13_429885 475 4 - - HYPH 10_1101-2021_02_13_429885 475 5 NC NC NNP 10_1101-2021_02_13_429885 475 6 - - HYPH 10_1101-2021_02_13_429885 475 7 ND ND NNP 10_1101-2021_02_13_429885 475 8 4.0 4.0 CD 10_1101-2021_02_13_429885 475 9 International International NNP 10_1101-2021_02_13_429885 475 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 475 11 under under IN 10_1101-2021_02_13_429885 475 12 a a DT 10_1101-2021_02_13_429885 475 13 ( ( -LRB- 10_1101-2021_02_13_429885 475 14 which which WDT 10_1101-2021_02_13_429885 475 15 was be VBD 10_1101-2021_02_13_429885 475 16 not not RB 10_1101-2021_02_13_429885 475 17 certified certify VBN 10_1101-2021_02_13_429885 475 18 by by IN 10_1101-2021_02_13_429885 475 19 peer peer NN 10_1101-2021_02_13_429885 475 20 review review NN 10_1101-2021_02_13_429885 475 21 ) ) -RRB- 10_1101-2021_02_13_429885 475 22 is be VBZ 10_1101-2021_02_13_429885 475 23 the the DT 10_1101-2021_02_13_429885 475 24 author author NN 10_1101-2021_02_13_429885 475 25 / / SYM 10_1101-2021_02_13_429885 475 26 funder funder NN 10_1101-2021_02_13_429885 475 27 , , , 10_1101-2021_02_13_429885 475 28 who who WP 10_1101-2021_02_13_429885 475 29 has have VBZ 10_1101-2021_02_13_429885 475 30 granted grant VBN 10_1101-2021_02_13_429885 475 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 475 32 a a DT 10_1101-2021_02_13_429885 475 33 license license NN 10_1101-2021_02_13_429885 475 34 to to TO 10_1101-2021_02_13_429885 475 35 display display VB 10_1101-2021_02_13_429885 475 36 the the DT 10_1101-2021_02_13_429885 475 37 preprint preprint NN 10_1101-2021_02_13_429885 475 38 in in IN 10_1101-2021_02_13_429885 475 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 475 40 . . . 10_1101-2021_02_13_429885 476 1 It -PRON- PRP 10_1101-2021_02_13_429885 476 2 is be VBZ 10_1101-2021_02_13_429885 476 3 made make VBN 10_1101-2021_02_13_429885 476 4 The the DT 10_1101-2021_02_13_429885 476 5 copyright copyright NN 10_1101-2021_02_13_429885 476 6 holder holder NN 10_1101-2021_02_13_429885 476 7 for for IN 10_1101-2021_02_13_429885 476 8 this this DT 10_1101-2021_02_13_429885 476 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 476 10 version version NN 10_1101-2021_02_13_429885 476 11 posted post VBD 10_1101-2021_02_13_429885 476 12 February February NNP 10_1101-2021_02_13_429885 476 13 13 13 CD 10_1101-2021_02_13_429885 476 14 , , , 10_1101-2021_02_13_429885 476 15 2021 2021 CD 10_1101-2021_02_13_429885 476 16 . . . 10_1101-2021_02_13_429885 476 17 ; ; : 10_1101-2021_02_13_429885 476 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 476 19 : : : 10_1101-2021_02_13_429885 476 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 476 21 preprint preprint NN 10_1101-2021_02_13_429885 476 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 476 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 476 24 Househam Househam NNP 10_1101-2021_02_13_429885 476 25 et et FW 10_1101-2021_02_13_429885 476 26 al al NNP 10_1101-2021_02_13_429885 476 27 . . . 10_1101-2021_02_13_429885 477 1 A a DT 10_1101-2021_02_13_429885 477 2 fully fully RB 10_1101-2021_02_13_429885 477 3 automated automate VBN 10_1101-2021_02_13_429885 477 4 approach approach NN 10_1101-2021_02_13_429885 477 5 for for IN 10_1101-2021_02_13_429885 477 6 quality quality NN 10_1101-2021_02_13_429885 477 7 control control NN 10_1101-2021_02_13_429885 477 8 of of IN 10_1101-2021_02_13_429885 477 9 cancer cancer NN 10_1101-2021_02_13_429885 477 10 mutations mutation NNS 10_1101-2021_02_13_429885 477 11 in in IN 10_1101-2021_02_13_429885 477 12 the the DT 10_1101-2021_02_13_429885 477 13 era era NN 10_1101-2021_02_13_429885 477 14 of of IN 10_1101-2021_02_13_429885 477 15 high high JJ 10_1101-2021_02_13_429885 477 16 - - HYPH 10_1101-2021_02_13_429885 477 17 resolution resolution NN 10_1101-2021_02_13_429885 477 18 whole whole JJ 10_1101-2021_02_13_429885 477 19 genome genome JJ 10_1101-2021_02_13_429885 477 20 sequencing sequencing NN 10_1101-2021_02_13_429885 477 21 . . . 10_1101-2021_02_13_429885 478 1 Figure figure NN 10_1101-2021_02_13_429885 478 2 4 4 CD 10_1101-2021_02_13_429885 478 3 . . . 10_1101-2021_02_13_429885 478 4 a. a. NN 10_1101-2021_02_13_429885 478 5 Circos Circos NNP 10_1101-2021_02_13_429885 478 6 plot plot NN 10_1101-2021_02_13_429885 478 7 for for IN 10_1101-2021_02_13_429885 478 8 four four CD 10_1101-2021_02_13_429885 478 9 possible possible JJ 10_1101-2021_02_13_429885 478 10 whole whole RB 10_1101-2021_02_13_429885 478 11 - - HYPH 10_1101-2021_02_13_429885 478 12 genome genome JJ 10_1101-2021_02_13_429885 478 13 CNA cna NN 10_1101-2021_02_13_429885 478 14 segmentations segmentation NNS 10_1101-2021_02_13_429885 478 15 determined determine VBN 10_1101-2021_02_13_429885 478 16 by by IN 10_1101-2021_02_13_429885 478 17 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 478 18 with with IN 10_1101-2021_02_13_429885 478 19 WGS WGS NNP 10_1101-2021_02_13_429885 478 20 data datum NNS 10_1101-2021_02_13_429885 478 21 ( ( -LRB- 10_1101-2021_02_13_429885 478 22 ~80x ~80x NFP 10_1101-2021_02_13_429885 478 23 median median JJ 10_1101-2021_02_13_429885 478 24 coverage coverage NN 10_1101-2021_02_13_429885 478 25 , , , 10_1101-2021_02_13_429885 478 26 purity purity NN 10_1101-2021_02_13_429885 478 27 87 87 CD 10_1101-2021_02_13_429885 478 28 % % NN 10_1101-2021_02_13_429885 478 29 ) ) -RRB- 10_1101-2021_02_13_429885 478 30 . . . 10_1101-2021_02_13_429885 479 1 The the DT 10_1101-2021_02_13_429885 479 2 input input NN 10_1101-2021_02_13_429885 479 3 sample sample NN 10_1101-2021_02_13_429885 479 4 is be VBZ 10_1101-2021_02_13_429885 479 5 Set7_57 Set7_57 NNP 10_1101-2021_02_13_429885 479 6 , , , 10_1101-2021_02_13_429885 479 7 one one CD 10_1101-2021_02_13_429885 479 8 of of IN 10_1101-2021_02_13_429885 479 9 four four CD 10_1101-2021_02_13_429885 479 10 multi multi JJ 10_1101-2021_02_13_429885 479 11 - - JJ 10_1101-2021_02_13_429885 479 12 region region JJ 10_1101-2021_02_13_429885 479 13 biopsies biopsy NNS 10_1101-2021_02_13_429885 479 14 for for IN 10_1101-2021_02_13_429885 479 15 colorectal colorectal JJ 10_1101-2021_02_13_429885 479 16 cancer cancer NN 10_1101-2021_02_13_429885 479 17 patient patient NN 10_1101-2021_02_13_429885 479 18 Set7 Set7 NNP 10_1101-2021_02_13_429885 479 19 . . . 10_1101-2021_02_13_429885 480 1 The the DT 10_1101-2021_02_13_429885 480 2 first first JJ 10_1101-2021_02_13_429885 480 3 run run NN 10_1101-2021_02_13_429885 480 4 is be VBZ 10_1101-2021_02_13_429885 480 5 with with IN 10_1101-2021_02_13_429885 480 6 default default JJ 10_1101-2021_02_13_429885 480 7 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 480 8 parameters parameter NNS 10_1101-2021_02_13_429885 480 9 . . . 10_1101-2021_02_13_429885 481 1 With with IN 10_1101-2021_02_13_429885 481 2 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 481 3 , , , 10_1101-2021_02_13_429885 481 4 we -PRON- PRP 10_1101-2021_02_13_429885 481 5 slightly slightly RB 10_1101-2021_02_13_429885 481 6 adjust adjust VBP 10_1101-2021_02_13_429885 481 7 purity purity NN 10_1101-2021_02_13_429885 481 8 estimation estimation NN 10_1101-2021_02_13_429885 481 9 and and CC 10_1101-2021_02_13_429885 481 10 obtain obtain VB 10_1101-2021_02_13_429885 481 11 a a DT 10_1101-2021_02_13_429885 481 12 final final JJ 10_1101-2021_02_13_429885 481 13 run run NN 10_1101-2021_02_13_429885 481 14 of of IN 10_1101-2021_02_13_429885 481 15 the the DT 10_1101-2021_02_13_429885 481 16 tool tool NN 10_1101-2021_02_13_429885 481 17 . . . 10_1101-2021_02_13_429885 482 1 We -PRON- PRP 10_1101-2021_02_13_429885 482 2 also also RB 10_1101-2021_02_13_429885 482 3 one one CD 10_1101-2021_02_13_429885 482 4 run run NN 10_1101-2021_02_13_429885 482 5 forcing force VBG 10_1101-2021_02_13_429885 482 6 overall overall JJ 10_1101-2021_02_13_429885 482 7 tumour tumour NN 10_1101-2021_02_13_429885 482 8 ploidy ploidy NN 10_1101-2021_02_13_429885 482 9 to to IN 10_1101-2021_02_13_429885 482 10 4 4 CD 10_1101-2021_02_13_429885 482 11 ( ( -LRB- 10_1101-2021_02_13_429885 482 12 tetraploid tetraploid NN 10_1101-2021_02_13_429885 482 13 ) ) -RRB- 10_1101-2021_02_13_429885 482 14 , , , 10_1101-2021_02_13_429885 482 15 and and CC 10_1101-2021_02_13_429885 482 16 one one CD 10_1101-2021_02_13_429885 482 17 with with IN 10_1101-2021_02_13_429885 482 18 maximum maximum JJ 10_1101-2021_02_13_429885 482 19 tumour tumour NN 10_1101-2021_02_13_429885 482 20 purity purity NN 10_1101-2021_02_13_429885 482 21 60 60 CD 10_1101-2021_02_13_429885 482 22 % % NN 10_1101-2021_02_13_429885 482 23 . . . 10_1101-2021_02_13_429885 483 1 ​b ​b NNP 10_1101-2021_02_13_429885 483 2 . . . 10_1101-2021_02_13_429885 484 1 Purity purity NN 10_1101-2021_02_13_429885 484 2 and and CC 10_1101-2021_02_13_429885 484 3 ploidy ploidy NN 10_1101-2021_02_13_429885 484 4 estimation estimation NN 10_1101-2021_02_13_429885 484 5 for for IN 10_1101-2021_02_13_429885 484 6 the the DT 10_1101-2021_02_13_429885 484 7 four four CD 10_1101-2021_02_13_429885 484 8 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 484 9 runs run NNS 10_1101-2021_02_13_429885 484 10 . . . 10_1101-2021_02_13_429885 485 1 Arrows arrow NNS 10_1101-2021_02_13_429885 485 2 show show VBP 10_1101-2021_02_13_429885 485 3 the the DT 10_1101-2021_02_13_429885 485 4 adjustment adjustment NN 10_1101-2021_02_13_429885 485 5 proposed propose VBN 10_1101-2021_02_13_429885 485 6 by by IN 10_1101-2021_02_13_429885 485 7 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 485 8 , , , 10_1101-2021_02_13_429885 485 9 the the DT 10_1101-2021_02_13_429885 485 10 default default NN 10_1101-2021_02_13_429885 485 11 and and CC 10_1101-2021_02_13_429885 485 12 final final JJ 10_1101-2021_02_13_429885 485 13 runs run NNS 10_1101-2021_02_13_429885 485 14 are be VBP 10_1101-2021_02_13_429885 485 15 the the DT 10_1101-2021_02_13_429885 485 16 only only JJ 10_1101-2021_02_13_429885 485 17 ones one NNS 10_1101-2021_02_13_429885 485 18 to to TO 10_1101-2021_02_13_429885 485 19 pass pass VB 10_1101-2021_02_13_429885 485 20 QC qc NN 10_1101-2021_02_13_429885 485 21 . . . 10_1101-2021_02_13_429885 486 1 ​c ​c NNP 10_1101-2021_02_13_429885 486 2 . . . 10_1101-2021_02_13_429885 487 1 Final final JJ 10_1101-2021_02_13_429885 487 2 run run NN 10_1101-2021_02_13_429885 487 3 with with IN 10_1101-2021_02_13_429885 487 4 perfect perfect JJ 10_1101-2021_02_13_429885 487 5 results result NNS 10_1101-2021_02_13_429885 487 6 for for IN 10_1101-2021_02_13_429885 487 7 Set7_57 Set7_57 NNP 10_1101-2021_02_13_429885 487 8 : : : 10_1101-2021_02_13_429885 487 9 copy copy NN 10_1101-2021_02_13_429885 487 10 number number NN 10_1101-2021_02_13_429885 487 11 segments segment NNS 10_1101-2021_02_13_429885 487 12 , , , 10_1101-2021_02_13_429885 487 13 depth depth NN 10_1101-2021_02_13_429885 487 14 of of IN 10_1101-2021_02_13_429885 487 15 coverage coverage NN 10_1101-2021_02_13_429885 487 16 per per IN 10_1101-2021_02_13_429885 487 17 mutation mutation NN 10_1101-2021_02_13_429885 487 18 and and CC 10_1101-2021_02_13_429885 487 19 mutation mutation NN 10_1101-2021_02_13_429885 487 20 density density NN 10_1101-2021_02_13_429885 487 21 per per IN 10_1101-2021_02_13_429885 487 22 megabase megabase NN 10_1101-2021_02_13_429885 487 23 . . . 10_1101-2021_02_13_429885 488 1 ​d ​d LS 10_1101-2021_02_13_429885 488 2 . . . 10_1101-2021_02_13_429885 489 1 ​Miscalled ​miscalle VBN 10_1101-2021_02_13_429885 489 2 copy copy NN 10_1101-2021_02_13_429885 489 3 - - HYPH 10_1101-2021_02_13_429885 489 4 neutral neutral JJ 10_1101-2021_02_13_429885 489 5 LOH LOH NNP 10_1101-2021_02_13_429885 489 6 segment segment NN 10_1101-2021_02_13_429885 489 7 , , , 10_1101-2021_02_13_429885 489 8 obtained obtain VBN 10_1101-2021_02_13_429885 489 9 by by IN 10_1101-2021_02_13_429885 489 10 forcing force VBG 10_1101-2021_02_13_429885 489 11 a a DT 10_1101-2021_02_13_429885 489 12 tetraploid tetraploid NN 10_1101-2021_02_13_429885 489 13 solution solution NN 10_1101-2021_02_13_429885 489 14 in in IN 10_1101-2021_02_13_429885 489 15 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 489 16 . . . 10_1101-2021_02_13_429885 490 1 For for IN 10_1101-2021_02_13_429885 490 2 a a DT 10_1101-2021_02_13_429885 490 3 2:0 2:0 CD 10_1101-2021_02_13_429885 490 4 segment segment NN 10_1101-2021_02_13_429885 490 5 with with IN 10_1101-2021_02_13_429885 490 6 the the DT 10_1101-2021_02_13_429885 490 7 estimated estimate VBN 10_1101-2021_02_13_429885 490 8 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 490 9 .CC .CC : 10_1101-2021_02_13_429885 490 10 - - : 10_1101-2021_02_13_429885 490 11 BY by IN 10_1101-2021_02_13_429885 490 12 - - HYPH 10_1101-2021_02_13_429885 490 13 NC NC NNP 10_1101-2021_02_13_429885 490 14 - - HYPH 10_1101-2021_02_13_429885 490 15 ND ND NNP 10_1101-2021_02_13_429885 490 16 4.0 4.0 CD 10_1101-2021_02_13_429885 490 17 International International NNP 10_1101-2021_02_13_429885 490 18 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 490 19 under under IN 10_1101-2021_02_13_429885 490 20 a a DT 10_1101-2021_02_13_429885 490 21 ( ( -LRB- 10_1101-2021_02_13_429885 490 22 which which WDT 10_1101-2021_02_13_429885 490 23 was be VBD 10_1101-2021_02_13_429885 490 24 not not RB 10_1101-2021_02_13_429885 490 25 certified certify VBN 10_1101-2021_02_13_429885 490 26 by by IN 10_1101-2021_02_13_429885 490 27 peer peer NN 10_1101-2021_02_13_429885 490 28 review review NN 10_1101-2021_02_13_429885 490 29 ) ) -RRB- 10_1101-2021_02_13_429885 490 30 is be VBZ 10_1101-2021_02_13_429885 490 31 the the DT 10_1101-2021_02_13_429885 490 32 author author NN 10_1101-2021_02_13_429885 490 33 / / SYM 10_1101-2021_02_13_429885 490 34 funder funder NN 10_1101-2021_02_13_429885 490 35 , , , 10_1101-2021_02_13_429885 490 36 who who WP 10_1101-2021_02_13_429885 490 37 has have VBZ 10_1101-2021_02_13_429885 490 38 granted grant VBN 10_1101-2021_02_13_429885 490 39 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 490 40 a a DT 10_1101-2021_02_13_429885 490 41 license license NN 10_1101-2021_02_13_429885 490 42 to to TO 10_1101-2021_02_13_429885 490 43 display display VB 10_1101-2021_02_13_429885 490 44 the the DT 10_1101-2021_02_13_429885 490 45 preprint preprint NN 10_1101-2021_02_13_429885 490 46 in in IN 10_1101-2021_02_13_429885 490 47 perpetuity perpetuity NN 10_1101-2021_02_13_429885 490 48 . . . 10_1101-2021_02_13_429885 491 1 It -PRON- PRP 10_1101-2021_02_13_429885 491 2 is be VBZ 10_1101-2021_02_13_429885 491 3 made make VBN 10_1101-2021_02_13_429885 491 4 The the DT 10_1101-2021_02_13_429885 491 5 copyright copyright NN 10_1101-2021_02_13_429885 491 6 holder holder NN 10_1101-2021_02_13_429885 491 7 for for IN 10_1101-2021_02_13_429885 491 8 this this DT 10_1101-2021_02_13_429885 491 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 491 10 version version NN 10_1101-2021_02_13_429885 491 11 posted post VBD 10_1101-2021_02_13_429885 491 12 February February NNP 10_1101-2021_02_13_429885 491 13 13 13 CD 10_1101-2021_02_13_429885 491 14 , , , 10_1101-2021_02_13_429885 491 15 2021 2021 CD 10_1101-2021_02_13_429885 491 16 . . . 10_1101-2021_02_13_429885 491 17 ; ; : 10_1101-2021_02_13_429885 491 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 491 19 : : : 10_1101-2021_02_13_429885 491 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 491 21 preprint preprint NN 10_1101-2021_02_13_429885 491 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 491 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 491 24 Househam Househam NNP 10_1101-2021_02_13_429885 491 25 et et FW 10_1101-2021_02_13_429885 491 26 al al NNP 10_1101-2021_02_13_429885 491 27 . . . 10_1101-2021_02_13_429885 492 1 A a DT 10_1101-2021_02_13_429885 492 2 fully fully RB 10_1101-2021_02_13_429885 492 3 automated automate VBN 10_1101-2021_02_13_429885 492 4 approach approach NN 10_1101-2021_02_13_429885 492 5 for for IN 10_1101-2021_02_13_429885 492 6 quality quality NN 10_1101-2021_02_13_429885 492 7 control control NN 10_1101-2021_02_13_429885 492 8 of of IN 10_1101-2021_02_13_429885 492 9 cancer cancer NN 10_1101-2021_02_13_429885 492 10 mutations mutation NNS 10_1101-2021_02_13_429885 492 11 in in IN 10_1101-2021_02_13_429885 492 12 the the DT 10_1101-2021_02_13_429885 492 13 era era NN 10_1101-2021_02_13_429885 492 14 of of IN 10_1101-2021_02_13_429885 492 15 high high JJ 10_1101-2021_02_13_429885 492 16 - - HYPH 10_1101-2021_02_13_429885 492 17 resolution resolution NN 10_1101-2021_02_13_429885 492 18 whole whole JJ 10_1101-2021_02_13_429885 492 19 genome genome JJ 10_1101-2021_02_13_429885 492 20 sequencing sequencing NN 10_1101-2021_02_13_429885 492 21 . . . 10_1101-2021_02_13_429885 493 1 purity purity NN 10_1101-2021_02_13_429885 493 2 we -PRON- PRP 10_1101-2021_02_13_429885 493 3 expected expect VBD 10_1101-2021_02_13_429885 493 4 peaks peak NNS 10_1101-2021_02_13_429885 493 5 at at IN 10_1101-2021_02_13_429885 493 6 ~60 ~60 NFP 10_1101-2021_02_13_429885 493 7 % % NN 10_1101-2021_02_13_429885 493 8 and and CC 10_1101-2021_02_13_429885 493 9 ~30 ~30 : 10_1101-2021_02_13_429885 493 10 % % NN 10_1101-2021_02_13_429885 493 11 VAF VAF NNP 10_1101-2021_02_13_429885 493 12 , , , 10_1101-2021_02_13_429885 493 13 which which WDT 10_1101-2021_02_13_429885 493 14 can can MD 10_1101-2021_02_13_429885 493 15 not not RB 10_1101-2021_02_13_429885 493 16 be be VB 10_1101-2021_02_13_429885 493 17 matched match VBN 10_1101-2021_02_13_429885 493 18 . . . 10_1101-2021_02_13_429885 494 1 ​e ​e LS 10_1101-2021_02_13_429885 494 2 . . . 10_1101-2021_02_13_429885 495 1 CNA CNA NNP 10_1101-2021_02_13_429885 495 2 calling call VBG 10_1101-2021_02_13_429885 495 3 with with IN 10_1101-2021_02_13_429885 495 4 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 495 5 and and CC 10_1101-2021_02_13_429885 495 6 Sequenza Sequenza NNP 10_1101-2021_02_13_429885 495 7 for for IN 10_1101-2021_02_13_429885 495 8 4 4 CD 10_1101-2021_02_13_429885 495 9 WGS WGS NNP 10_1101-2021_02_13_429885 495 10 biopsies biopsy NNS 10_1101-2021_02_13_429885 495 11 of of IN 10_1101-2021_02_13_429885 495 12 the the DT 10_1101-2021_02_13_429885 495 13 primary primary JJ 10_1101-2021_02_13_429885 495 14 colorectal colorectal JJ 10_1101-2021_02_13_429885 495 15 cancer cancer NN 10_1101-2021_02_13_429885 495 16 Set7 Set7 NNP 10_1101-2021_02_13_429885 495 17 . . . 10_1101-2021_02_13_429885 496 1 Figure figure NN 10_1101-2021_02_13_429885 496 2 5 5 CD 10_1101-2021_02_13_429885 496 3 . . . 10_1101-2021_02_13_429885 496 4 a. a. NN 10_1101-2021_02_13_429885 497 1 Summary Summary NNP 10_1101-2021_02_13_429885 497 2 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 497 3 pass pass VBP 10_1101-2021_02_13_429885 497 4 or or CC 10_1101-2021_02_13_429885 497 5 fail fail VBP 10_1101-2021_02_13_429885 497 6 barplot barplot NN 10_1101-2021_02_13_429885 497 7 for for IN 10_1101-2021_02_13_429885 497 8 top top JJ 10_1101-2021_02_13_429885 497 9 - - HYPH 10_1101-2021_02_13_429885 497 10 quality quality NN 10_1101-2021_02_13_429885 497 11 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 497 12 samples sample NNS 10_1101-2021_02_13_429885 497 13 065 065 CD 10_1101-2021_02_13_429885 497 14 n n CC 10_1101-2021_02_13_429885 497 15 = = SYM 10_1101-2021_02_13_429885 497 16 1 1 CD 10_1101-2021_02_13_429885 497 17 across across IN 10_1101-2021_02_13_429885 497 18 distinct distinct JJ 10_1101-2021_02_13_429885 497 19 tumour tumour NN 10_1101-2021_02_13_429885 497 20 types type NNS 10_1101-2021_02_13_429885 497 21 . . . 10_1101-2021_02_13_429885 498 1 Failures failure NNS 10_1101-2021_02_13_429885 498 2 for for IN 10_1101-2021_02_13_429885 498 3 peaks peak NNS 10_1101-2021_02_13_429885 498 4 are be VBP 10_1101-2021_02_13_429885 498 5 with with IN 10_1101-2021_02_13_429885 498 6 a a DT 10_1101-2021_02_13_429885 498 7 3 3 CD 10_1101-2021_02_13_429885 498 8 % % NN 10_1101-2021_02_13_429885 498 9 error error NN 10_1101-2021_02_13_429885 498 10 tolerance tolerance NN 10_1101-2021_02_13_429885 498 11 , , , 10_1101-2021_02_13_429885 498 12 and and CC 10_1101-2021_02_13_429885 498 13 CCFs ccf NNS 10_1101-2021_02_13_429885 498 14 with with IN 10_1101-2021_02_13_429885 498 15 10 10 CD 10_1101-2021_02_13_429885 498 16 % % NN 10_1101-2021_02_13_429885 498 17 of of IN 10_1101-2021_02_13_429885 498 18 SNVs SNVs NNPS 10_1101-2021_02_13_429885 498 19 not not RB 10_1101-2021_02_13_429885 498 20 assignable assignable JJ 10_1101-2021_02_13_429885 498 21 , , , 10_1101-2021_02_13_429885 498 22 per per IN 10_1101-2021_02_13_429885 498 23 copy copy NN 10_1101-2021_02_13_429885 498 24 state state NN 10_1101-2021_02_13_429885 498 25 . . . 10_1101-2021_02_13_429885 499 1 ​b ​b NNP 10_1101-2021_02_13_429885 499 2 . . . 10_1101-2021_02_13_429885 500 1 ​Zoom ​Zoom NNP 10_1101-2021_02_13_429885 500 2 peak peak VBP 10_1101-2021_02_13_429885 500 3 analysis analysis NN 10_1101-2021_02_13_429885 500 4 with with IN 10_1101-2021_02_13_429885 500 5 a a DT 10_1101-2021_02_13_429885 500 6 scatter scatter NN 10_1101-2021_02_13_429885 500 7 showing showing NN 10_1101-2021_02_13_429885 500 8 , , , 10_1101-2021_02_13_429885 500 9 for for IN 10_1101-2021_02_13_429885 500 10 .CC .CC NFP 10_1101-2021_02_13_429885 500 11 - - HYPH 10_1101-2021_02_13_429885 500 12 BY by IN 10_1101-2021_02_13_429885 500 13 - - HYPH 10_1101-2021_02_13_429885 500 14 NC NC NNP 10_1101-2021_02_13_429885 500 15 - - HYPH 10_1101-2021_02_13_429885 500 16 ND ND NNP 10_1101-2021_02_13_429885 500 17 4.0 4.0 CD 10_1101-2021_02_13_429885 500 18 International International NNP 10_1101-2021_02_13_429885 500 19 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 500 20 under under IN 10_1101-2021_02_13_429885 500 21 a a DT 10_1101-2021_02_13_429885 500 22 ( ( -LRB- 10_1101-2021_02_13_429885 500 23 which which WDT 10_1101-2021_02_13_429885 500 24 was be VBD 10_1101-2021_02_13_429885 500 25 not not RB 10_1101-2021_02_13_429885 500 26 certified certify VBN 10_1101-2021_02_13_429885 500 27 by by IN 10_1101-2021_02_13_429885 500 28 peer peer NN 10_1101-2021_02_13_429885 500 29 review review NN 10_1101-2021_02_13_429885 500 30 ) ) -RRB- 10_1101-2021_02_13_429885 500 31 is be VBZ 10_1101-2021_02_13_429885 500 32 the the DT 10_1101-2021_02_13_429885 500 33 author author NN 10_1101-2021_02_13_429885 500 34 / / SYM 10_1101-2021_02_13_429885 500 35 funder funder NN 10_1101-2021_02_13_429885 500 36 , , , 10_1101-2021_02_13_429885 500 37 who who WP 10_1101-2021_02_13_429885 500 38 has have VBZ 10_1101-2021_02_13_429885 500 39 granted grant VBN 10_1101-2021_02_13_429885 500 40 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 500 41 a a DT 10_1101-2021_02_13_429885 500 42 license license NN 10_1101-2021_02_13_429885 500 43 to to TO 10_1101-2021_02_13_429885 500 44 display display VB 10_1101-2021_02_13_429885 500 45 the the DT 10_1101-2021_02_13_429885 500 46 preprint preprint NN 10_1101-2021_02_13_429885 500 47 in in IN 10_1101-2021_02_13_429885 500 48 perpetuity perpetuity NN 10_1101-2021_02_13_429885 500 49 . . . 10_1101-2021_02_13_429885 501 1 It -PRON- PRP 10_1101-2021_02_13_429885 501 2 is be VBZ 10_1101-2021_02_13_429885 501 3 made make VBN 10_1101-2021_02_13_429885 501 4 The the DT 10_1101-2021_02_13_429885 501 5 copyright copyright NN 10_1101-2021_02_13_429885 501 6 holder holder NN 10_1101-2021_02_13_429885 501 7 for for IN 10_1101-2021_02_13_429885 501 8 this this DT 10_1101-2021_02_13_429885 501 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 501 10 version version NN 10_1101-2021_02_13_429885 501 11 posted post VBD 10_1101-2021_02_13_429885 501 12 February February NNP 10_1101-2021_02_13_429885 501 13 13 13 CD 10_1101-2021_02_13_429885 501 14 , , , 10_1101-2021_02_13_429885 501 15 2021 2021 CD 10_1101-2021_02_13_429885 501 16 . . . 10_1101-2021_02_13_429885 501 17 ; ; : 10_1101-2021_02_13_429885 501 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 501 19 : : : 10_1101-2021_02_13_429885 501 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 501 21 preprint preprint NN 10_1101-2021_02_13_429885 501 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 501 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 501 24 Househam Househam NNP 10_1101-2021_02_13_429885 501 25 et et FW 10_1101-2021_02_13_429885 501 26 al al NNP 10_1101-2021_02_13_429885 501 27 . . . 10_1101-2021_02_13_429885 502 1 A a DT 10_1101-2021_02_13_429885 502 2 fully fully RB 10_1101-2021_02_13_429885 502 3 automated automate VBN 10_1101-2021_02_13_429885 502 4 approach approach NN 10_1101-2021_02_13_429885 502 5 for for IN 10_1101-2021_02_13_429885 502 6 quality quality NN 10_1101-2021_02_13_429885 502 7 control control NN 10_1101-2021_02_13_429885 502 8 of of IN 10_1101-2021_02_13_429885 502 9 cancer cancer NN 10_1101-2021_02_13_429885 502 10 mutations mutation NNS 10_1101-2021_02_13_429885 502 11 in in IN 10_1101-2021_02_13_429885 502 12 the the DT 10_1101-2021_02_13_429885 502 13 era era NN 10_1101-2021_02_13_429885 502 14 of of IN 10_1101-2021_02_13_429885 502 15 high high JJ 10_1101-2021_02_13_429885 502 16 - - HYPH 10_1101-2021_02_13_429885 502 17 resolution resolution NN 10_1101-2021_02_13_429885 502 18 whole whole JJ 10_1101-2021_02_13_429885 502 19 genome genome JJ 10_1101-2021_02_13_429885 502 20 sequencing sequencing NN 10_1101-2021_02_13_429885 502 21 . . . 10_1101-2021_02_13_429885 503 1 every every DT 10_1101-2021_02_13_429885 503 2 tumour tumour NN 10_1101-2021_02_13_429885 503 3 type type NN 10_1101-2021_02_13_429885 503 4 , , , 10_1101-2021_02_13_429885 503 5 the the DT 10_1101-2021_02_13_429885 503 6 total total JJ 10_1101-2021_02_13_429885 503 7 cases case NNS 10_1101-2021_02_13_429885 503 8 per per IN 10_1101-2021_02_13_429885 503 9 tumour tumour NN 10_1101-2021_02_13_429885 503 10 against against IN 10_1101-2021_02_13_429885 503 11 the the DT 10_1101-2021_02_13_429885 503 12 proportion proportion NN 10_1101-2021_02_13_429885 503 13 of of IN 10_1101-2021_02_13_429885 503 14 pass pass NN 10_1101-2021_02_13_429885 503 15 or or CC 10_1101-2021_02_13_429885 503 16 fails fail VBZ 10_1101-2021_02_13_429885 503 17 ; ; : 10_1101-2021_02_13_429885 503 18 each each DT 10_1101-2021_02_13_429885 503 19 dot dot NN 10_1101-2021_02_13_429885 503 20 size size NN 10_1101-2021_02_13_429885 503 21 is be VBZ 10_1101-2021_02_13_429885 503 22 proportional proportional JJ 10_1101-2021_02_13_429885 503 23 to to IN 10_1101-2021_02_13_429885 503 24 the the DT 10_1101-2021_02_13_429885 503 25 error error NN 10_1101-2021_02_13_429885 503 26 measure measure NN 10_1101-2021_02_13_429885 503 27 from from IN 10_1101-2021_02_13_429885 503 28 mismatched mismatch VBN 10_1101-2021_02_13_429885 503 29 peaks peak NNS 10_1101-2021_02_13_429885 503 30 . . . 10_1101-2021_02_13_429885 504 1 .CC .CC NFP 10_1101-2021_02_13_429885 504 2 - - : 10_1101-2021_02_13_429885 504 3 BY by IN 10_1101-2021_02_13_429885 504 4 - - HYPH 10_1101-2021_02_13_429885 504 5 NC NC NNP 10_1101-2021_02_13_429885 504 6 - - HYPH 10_1101-2021_02_13_429885 504 7 ND ND NNP 10_1101-2021_02_13_429885 504 8 4.0 4.0 CD 10_1101-2021_02_13_429885 504 9 International International NNP 10_1101-2021_02_13_429885 504 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 504 11 under under IN 10_1101-2021_02_13_429885 504 12 a a DT 10_1101-2021_02_13_429885 504 13 ( ( -LRB- 10_1101-2021_02_13_429885 504 14 which which WDT 10_1101-2021_02_13_429885 504 15 was be VBD 10_1101-2021_02_13_429885 504 16 not not RB 10_1101-2021_02_13_429885 504 17 certified certify VBN 10_1101-2021_02_13_429885 504 18 by by IN 10_1101-2021_02_13_429885 504 19 peer peer NN 10_1101-2021_02_13_429885 504 20 review review NN 10_1101-2021_02_13_429885 504 21 ) ) -RRB- 10_1101-2021_02_13_429885 504 22 is be VBZ 10_1101-2021_02_13_429885 504 23 the the DT 10_1101-2021_02_13_429885 504 24 author author NN 10_1101-2021_02_13_429885 504 25 / / SYM 10_1101-2021_02_13_429885 504 26 funder funder NN 10_1101-2021_02_13_429885 504 27 , , , 10_1101-2021_02_13_429885 504 28 who who WP 10_1101-2021_02_13_429885 504 29 has have VBZ 10_1101-2021_02_13_429885 504 30 granted grant VBN 10_1101-2021_02_13_429885 504 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 504 32 a a DT 10_1101-2021_02_13_429885 504 33 license license NN 10_1101-2021_02_13_429885 504 34 to to TO 10_1101-2021_02_13_429885 504 35 display display VB 10_1101-2021_02_13_429885 504 36 the the DT 10_1101-2021_02_13_429885 504 37 preprint preprint NN 10_1101-2021_02_13_429885 504 38 in in IN 10_1101-2021_02_13_429885 504 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 504 40 . . . 10_1101-2021_02_13_429885 505 1 It -PRON- PRP 10_1101-2021_02_13_429885 505 2 is be VBZ 10_1101-2021_02_13_429885 505 3 made make VBN 10_1101-2021_02_13_429885 505 4 The the DT 10_1101-2021_02_13_429885 505 5 copyright copyright NN 10_1101-2021_02_13_429885 505 6 holder holder NN 10_1101-2021_02_13_429885 505 7 for for IN 10_1101-2021_02_13_429885 505 8 this this DT 10_1101-2021_02_13_429885 505 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 505 10 version version NN 10_1101-2021_02_13_429885 505 11 posted post VBD 10_1101-2021_02_13_429885 505 12 February February NNP 10_1101-2021_02_13_429885 505 13 13 13 CD 10_1101-2021_02_13_429885 505 14 , , , 10_1101-2021_02_13_429885 505 15 2021 2021 CD 10_1101-2021_02_13_429885 505 16 . . . 10_1101-2021_02_13_429885 505 17 ; ; : 10_1101-2021_02_13_429885 505 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 505 19 : : : 10_1101-2021_02_13_429885 505 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 505 21 preprint preprint NN 10_1101-2021_02_13_429885 505 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 505 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 505 24 Househam Househam NNP 10_1101-2021_02_13_429885 505 25 et et FW 10_1101-2021_02_13_429885 505 26 al al NNP 10_1101-2021_02_13_429885 505 27 . . . 10_1101-2021_02_13_429885 506 1 A a DT 10_1101-2021_02_13_429885 506 2 fully fully RB 10_1101-2021_02_13_429885 506 3 automated automate VBN 10_1101-2021_02_13_429885 506 4 approach approach NN 10_1101-2021_02_13_429885 506 5 for for IN 10_1101-2021_02_13_429885 506 6 quality quality NN 10_1101-2021_02_13_429885 506 7 control control NN 10_1101-2021_02_13_429885 506 8 of of IN 10_1101-2021_02_13_429885 506 9 cancer cancer NN 10_1101-2021_02_13_429885 506 10 mutations mutation NNS 10_1101-2021_02_13_429885 506 11 in in IN 10_1101-2021_02_13_429885 506 12 the the DT 10_1101-2021_02_13_429885 506 13 era era NN 10_1101-2021_02_13_429885 506 14 of of IN 10_1101-2021_02_13_429885 506 15 high high JJ 10_1101-2021_02_13_429885 506 16 - - HYPH 10_1101-2021_02_13_429885 506 17 resolution resolution NN 10_1101-2021_02_13_429885 506 18 whole whole JJ 10_1101-2021_02_13_429885 506 19 genome genome JJ 10_1101-2021_02_13_429885 506 20 sequencing sequencing NN 10_1101-2021_02_13_429885 506 21 . . . 10_1101-2021_02_13_429885 507 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 507 2 Figures Figures NNPS 10_1101-2021_02_13_429885 507 3 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 507 4 Figure Figure NNP 10_1101-2021_02_13_429885 507 5 S1 S1 NNS 10_1101-2021_02_13_429885 507 6 . . . 10_1101-2021_02_13_429885 508 1 ​PCAWG ​pcawg ADD 10_1101-2021_02_13_429885 508 2 sample sample VB 10_1101-2021_02_13_429885 508 3 with with IN 10_1101-2021_02_13_429885 508 4 low low JJ 10_1101-2021_02_13_429885 508 5 mutational mutational JJ 10_1101-2021_02_13_429885 508 6 burden burden NN 10_1101-2021_02_13_429885 508 7 . . . 10_1101-2021_02_13_429885 509 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 509 2 Figure Figure NNP 10_1101-2021_02_13_429885 509 3 S2 s2 NN 10_1101-2021_02_13_429885 509 4 . . . 10_1101-2021_02_13_429885 510 1 ​Sample​ ​sample​ PRP 10_1101-2021_02_13_429885 510 2 ​Set7_55 ​Set7_55 NNP 10_1101-2021_02_13_429885 510 3 ( ( -LRB- 10_1101-2021_02_13_429885 510 4 multi multi JJ 10_1101-2021_02_13_429885 510 5 - - JJ 10_1101-2021_02_13_429885 510 6 region region JJ 10_1101-2021_02_13_429885 510 7 ) ) -RRB- 10_1101-2021_02_13_429885 510 8 . . . 10_1101-2021_02_13_429885 511 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 511 2 Figure Figure NNP 10_1101-2021_02_13_429885 511 3 S3 S3 NNP 10_1101-2021_02_13_429885 511 4 . . . 10_1101-2021_02_13_429885 512 1 ​Sample​ ​Sample​ NNP 10_1101-2021_02_13_429885 512 2 ​Set7_59 ​Set7_59 NNP 10_1101-2021_02_13_429885 512 3 ( ( -LRB- 10_1101-2021_02_13_429885 512 4 multi multi JJ 10_1101-2021_02_13_429885 512 5 - - JJ 10_1101-2021_02_13_429885 512 6 region region JJ 10_1101-2021_02_13_429885 512 7 ) ) -RRB- 10_1101-2021_02_13_429885 512 8 . . . 10_1101-2021_02_13_429885 513 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 513 2 Figure Figure NNP 10_1101-2021_02_13_429885 513 3 S4 s4 NN 10_1101-2021_02_13_429885 513 4 . . . 10_1101-2021_02_13_429885 514 1 ​Sample​ ​sample​ CD 10_1101-2021_02_13_429885 514 2 ​Set7_62 ​Set7_62 NNP 10_1101-2021_02_13_429885 514 3 ( ( -LRB- 10_1101-2021_02_13_429885 514 4 multi multi JJ 10_1101-2021_02_13_429885 514 5 - - JJ 10_1101-2021_02_13_429885 514 6 region region JJ 10_1101-2021_02_13_429885 514 7 ) ) -RRB- 10_1101-2021_02_13_429885 514 8 . . . 10_1101-2021_02_13_429885 515 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 515 2 Figure Figure NNP 10_1101-2021_02_13_429885 515 3 S5 s5 NN 10_1101-2021_02_13_429885 515 4 . . . 10_1101-2021_02_13_429885 516 1 ​Sample​ ​sample​ ADD 10_1101-2021_02_13_429885 516 2 ​Set6_42 ​Set6_42 NNP 10_1101-2021_02_13_429885 516 3 ( ( -LRB- 10_1101-2021_02_13_429885 516 4 multi multi JJ 10_1101-2021_02_13_429885 516 5 - - JJ 10_1101-2021_02_13_429885 516 6 region region JJ 10_1101-2021_02_13_429885 516 7 ) ) -RRB- 10_1101-2021_02_13_429885 516 8 . . . 10_1101-2021_02_13_429885 517 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 517 2 Figure Figure NNP 10_1101-2021_02_13_429885 517 3 S6 s6 NN 10_1101-2021_02_13_429885 517 4 . . . 10_1101-2021_02_13_429885 518 1 ​Sample​ ​sample​ PRP 10_1101-2021_02_13_429885 518 2 ​Set6_44 ​Set6_44 NNP 10_1101-2021_02_13_429885 518 3 ( ( -LRB- 10_1101-2021_02_13_429885 518 4 multi multi JJ 10_1101-2021_02_13_429885 518 5 - - JJ 10_1101-2021_02_13_429885 518 6 region region JJ 10_1101-2021_02_13_429885 518 7 ) ) -RRB- 10_1101-2021_02_13_429885 518 8 . . . 10_1101-2021_02_13_429885 519 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 519 2 Figure Figure NNP 10_1101-2021_02_13_429885 519 3 S7 S7 NNP 10_1101-2021_02_13_429885 519 4 . . . 10_1101-2021_02_13_429885 520 1 ​Sample​ ​sample​ PRP 10_1101-2021_02_13_429885 520 2 ​Set6_45 ​Set6_45 NNP 10_1101-2021_02_13_429885 520 3 ( ( -LRB- 10_1101-2021_02_13_429885 520 4 multi multi JJ 10_1101-2021_02_13_429885 520 5 - - JJ 10_1101-2021_02_13_429885 520 6 region region JJ 10_1101-2021_02_13_429885 520 7 ) ) -RRB- 10_1101-2021_02_13_429885 520 8 . . . 10_1101-2021_02_13_429885 521 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 521 2 Figure Figure NNP 10_1101-2021_02_13_429885 521 3 S8 S8 NNP 10_1101-2021_02_13_429885 521 4 . . . 10_1101-2021_02_13_429885 522 1 ​Sample​ ​sample​ ADD 10_1101-2021_02_13_429885 522 2 ​Set6_46 ​Set6_46 NNP 10_1101-2021_02_13_429885 522 3 ( ( -LRB- 10_1101-2021_02_13_429885 522 4 multi multi JJ 10_1101-2021_02_13_429885 522 5 - - JJ 10_1101-2021_02_13_429885 522 6 region region JJ 10_1101-2021_02_13_429885 522 7 ) ) -RRB- 10_1101-2021_02_13_429885 522 8 . . . 10_1101-2021_02_13_429885 523 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 523 2 Figure Figure NNP 10_1101-2021_02_13_429885 523 3 S9 s9 NN 10_1101-2021_02_13_429885 523 4 . . . 10_1101-2021_02_13_429885 524 1 ​Sample​ ​Sample​ -LRB- 10_1101-2021_02_13_429885 524 2 ​Set6_47 ​Set6_47 NNP 10_1101-2021_02_13_429885 524 3 ( ( -LRB- 10_1101-2021_02_13_429885 524 4 multi multi JJ 10_1101-2021_02_13_429885 524 5 - - JJ 10_1101-2021_02_13_429885 524 6 region region JJ 10_1101-2021_02_13_429885 524 7 ) ) -RRB- 10_1101-2021_02_13_429885 524 8 . . . 10_1101-2021_02_13_429885 525 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 525 2 Figure Figure NNP 10_1101-2021_02_13_429885 525 3 S10 S10 NNP 10_1101-2021_02_13_429885 525 4 . . . 10_1101-2021_02_13_429885 526 1 ​Sample​ ​sample​ PRP 10_1101-2021_02_13_429885 526 2 ​Set6_48 ​Set6_48 NNP 10_1101-2021_02_13_429885 526 3 ( ( -LRB- 10_1101-2021_02_13_429885 526 4 multi multi JJ 10_1101-2021_02_13_429885 526 5 - - JJ 10_1101-2021_02_13_429885 526 6 region region JJ 10_1101-2021_02_13_429885 526 7 ) ) -RRB- 10_1101-2021_02_13_429885 526 8 . . . 10_1101-2021_02_13_429885 527 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 527 2 Figure Figure NNP 10_1101-2021_02_13_429885 527 3 S11 S11 NNP 10_1101-2021_02_13_429885 527 4 . . . 10_1101-2021_02_13_429885 528 1 ​PCAWG ​pcawg ADD 10_1101-2021_02_13_429885 528 2 sample sample NNP 10_1101-2021_02_13_429885 528 3 with with IN 10_1101-2021_02_13_429885 528 4 overstimated overstimated JJ 10_1101-2021_02_13_429885 528 5 100 100 CD 10_1101-2021_02_13_429885 528 6 % % NN 10_1101-2021_02_13_429885 528 7 purity purity NN 10_1101-2021_02_13_429885 528 8 . . . 10_1101-2021_02_13_429885 529 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 529 2 Figure Figure NNP 10_1101-2021_02_13_429885 529 3 S12 S12 NNP 10_1101-2021_02_13_429885 529 4 . . . 10_1101-2021_02_13_429885 530 1 ​PCAWG ​pcawg ADD 10_1101-2021_02_13_429885 530 2 sample sample VB 10_1101-2021_02_13_429885 530 3 with with IN 10_1101-2021_02_13_429885 530 4 true true JJ 10_1101-2021_02_13_429885 530 5 99 99 CD 10_1101-2021_02_13_429885 530 6 % % NN 10_1101-2021_02_13_429885 530 7 purity purity NN 10_1101-2021_02_13_429885 530 8 . . . 10_1101-2021_02_13_429885 531 1 .CC .CC NFP 10_1101-2021_02_13_429885 531 2 - - : 10_1101-2021_02_13_429885 531 3 BY by IN 10_1101-2021_02_13_429885 531 4 - - HYPH 10_1101-2021_02_13_429885 531 5 NC NC NNP 10_1101-2021_02_13_429885 531 6 - - HYPH 10_1101-2021_02_13_429885 531 7 ND ND NNP 10_1101-2021_02_13_429885 531 8 4.0 4.0 CD 10_1101-2021_02_13_429885 531 9 International International NNP 10_1101-2021_02_13_429885 531 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 531 11 under under IN 10_1101-2021_02_13_429885 531 12 a a DT 10_1101-2021_02_13_429885 531 13 ( ( -LRB- 10_1101-2021_02_13_429885 531 14 which which WDT 10_1101-2021_02_13_429885 531 15 was be VBD 10_1101-2021_02_13_429885 531 16 not not RB 10_1101-2021_02_13_429885 531 17 certified certify VBN 10_1101-2021_02_13_429885 531 18 by by IN 10_1101-2021_02_13_429885 531 19 peer peer NN 10_1101-2021_02_13_429885 531 20 review review NN 10_1101-2021_02_13_429885 531 21 ) ) -RRB- 10_1101-2021_02_13_429885 531 22 is be VBZ 10_1101-2021_02_13_429885 531 23 the the DT 10_1101-2021_02_13_429885 531 24 author author NN 10_1101-2021_02_13_429885 531 25 / / SYM 10_1101-2021_02_13_429885 531 26 funder funder NN 10_1101-2021_02_13_429885 531 27 , , , 10_1101-2021_02_13_429885 531 28 who who WP 10_1101-2021_02_13_429885 531 29 has have VBZ 10_1101-2021_02_13_429885 531 30 granted grant VBN 10_1101-2021_02_13_429885 531 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 531 32 a a DT 10_1101-2021_02_13_429885 531 33 license license NN 10_1101-2021_02_13_429885 531 34 to to TO 10_1101-2021_02_13_429885 531 35 display display VB 10_1101-2021_02_13_429885 531 36 the the DT 10_1101-2021_02_13_429885 531 37 preprint preprint NN 10_1101-2021_02_13_429885 531 38 in in IN 10_1101-2021_02_13_429885 531 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 531 40 . . . 10_1101-2021_02_13_429885 532 1 It -PRON- PRP 10_1101-2021_02_13_429885 532 2 is be VBZ 10_1101-2021_02_13_429885 532 3 made make VBN 10_1101-2021_02_13_429885 532 4 The the DT 10_1101-2021_02_13_429885 532 5 copyright copyright NN 10_1101-2021_02_13_429885 532 6 holder holder NN 10_1101-2021_02_13_429885 532 7 for for IN 10_1101-2021_02_13_429885 532 8 this this DT 10_1101-2021_02_13_429885 532 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 532 10 version version NN 10_1101-2021_02_13_429885 532 11 posted post VBD 10_1101-2021_02_13_429885 532 12 February February NNP 10_1101-2021_02_13_429885 532 13 13 13 CD 10_1101-2021_02_13_429885 532 14 , , , 10_1101-2021_02_13_429885 532 15 2021 2021 CD 10_1101-2021_02_13_429885 532 16 . . . 10_1101-2021_02_13_429885 532 17 ; ; : 10_1101-2021_02_13_429885 532 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 532 19 : : : 10_1101-2021_02_13_429885 532 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 532 21 preprint preprint NN 10_1101-2021_02_13_429885 532 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 532 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 532 24 Househam Househam NNP 10_1101-2021_02_13_429885 532 25 et et FW 10_1101-2021_02_13_429885 532 26 al al NNP 10_1101-2021_02_13_429885 532 27 . . . 10_1101-2021_02_13_429885 533 1 A a DT 10_1101-2021_02_13_429885 533 2 fully fully RB 10_1101-2021_02_13_429885 533 3 automated automate VBN 10_1101-2021_02_13_429885 533 4 approach approach NN 10_1101-2021_02_13_429885 533 5 for for IN 10_1101-2021_02_13_429885 533 6 quality quality NN 10_1101-2021_02_13_429885 533 7 control control NN 10_1101-2021_02_13_429885 533 8 of of IN 10_1101-2021_02_13_429885 533 9 cancer cancer NN 10_1101-2021_02_13_429885 533 10 mutations mutation NNS 10_1101-2021_02_13_429885 533 11 in in IN 10_1101-2021_02_13_429885 533 12 the the DT 10_1101-2021_02_13_429885 533 13 era era NN 10_1101-2021_02_13_429885 533 14 of of IN 10_1101-2021_02_13_429885 533 15 high high JJ 10_1101-2021_02_13_429885 533 16 - - HYPH 10_1101-2021_02_13_429885 533 17 resolution resolution NN 10_1101-2021_02_13_429885 533 18 whole whole JJ 10_1101-2021_02_13_429885 533 19 genome genome JJ 10_1101-2021_02_13_429885 533 20 sequencing sequencing NN 10_1101-2021_02_13_429885 533 21 . . . 10_1101-2021_02_13_429885 534 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 534 2 Figure Figure NNP 10_1101-2021_02_13_429885 534 3 S1 S1 NNS 10_1101-2021_02_13_429885 534 4 . . . 10_1101-2021_02_13_429885 535 1 ​Example ​Example NNP 10_1101-2021_02_13_429885 535 2 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 535 3 medulloblastoma medulloblastoma NN 10_1101-2021_02_13_429885 535 4 sample sample NN 10_1101-2021_02_13_429885 535 5 with with IN 10_1101-2021_02_13_429885 535 6 low low JJ 10_1101-2021_02_13_429885 535 7 - - HYPH 10_1101-2021_02_13_429885 535 8 mutational mutational JJ 10_1101-2021_02_13_429885 535 9 burden burden NN 10_1101-2021_02_13_429885 535 10 , , , 10_1101-2021_02_13_429885 535 11 which which WDT 10_1101-2021_02_13_429885 535 12 passes pass VBZ 10_1101-2021_02_13_429885 535 13 data data NN 10_1101-2021_02_13_429885 535 14 QC QC NNP 10_1101-2021_02_13_429885 535 15 with with IN 10_1101-2021_02_13_429885 535 16 CNAqc CNAqc NNP 10_1101-2021_02_13_429885 535 17 . . . 10_1101-2021_02_13_429885 536 1 ​a ​a NNS 10_1101-2021_02_13_429885 536 2 . . . 10_1101-2021_02_13_429885 537 1 ​Data ​Data NNP 10_1101-2021_02_13_429885 537 2 for for IN 10_1101-2021_02_13_429885 537 3 the the DT 10_1101-2021_02_13_429885 537 4 sample sample NN 10_1101-2021_02_13_429885 537 5 ( ( -LRB- 10_1101-2021_02_13_429885 537 6 genome genome NN 10_1101-2021_02_13_429885 537 7 - - HYPH 10_1101-2021_02_13_429885 537 8 wide wide JJ 10_1101-2021_02_13_429885 537 9 CNA cna NN 10_1101-2021_02_13_429885 537 10 segments segment NNS 10_1101-2021_02_13_429885 537 11 , , , 10_1101-2021_02_13_429885 537 12 CCF CCF NNP 10_1101-2021_02_13_429885 537 13 and and CC 10_1101-2021_02_13_429885 537 14 read read VBD 10_1101-2021_02_13_429885 537 15 counts count NNS 10_1101-2021_02_13_429885 537 16 distribution distribution NN 10_1101-2021_02_13_429885 537 17 ) ) -RRB- 10_1101-2021_02_13_429885 537 18 . . . 10_1101-2021_02_13_429885 538 1 Note note VB 10_1101-2021_02_13_429885 538 2 that that IN 10_1101-2021_02_13_429885 538 3 this this DT 10_1101-2021_02_13_429885 538 4 sample sample NN 10_1101-2021_02_13_429885 538 5 has have VBZ 10_1101-2021_02_13_429885 538 6 only only RB 10_1101-2021_02_13_429885 538 7 76 76 CD 10_1101-2021_02_13_429885 538 8 SNVs SNVs NNPS 10_1101-2021_02_13_429885 538 9 in in IN 10_1101-2021_02_13_429885 538 10 diploid diploid JJ 10_1101-2021_02_13_429885 538 11 tumour tumour NN 10_1101-2021_02_13_429885 538 12 regions region NNS 10_1101-2021_02_13_429885 538 13 , , , 10_1101-2021_02_13_429885 538 14 like like IN 10_1101-2021_02_13_429885 538 15 we -PRON- PRP 10_1101-2021_02_13_429885 538 16 observe observe VBP 10_1101-2021_02_13_429885 538 17 in in IN 10_1101-2021_02_13_429885 538 18 whole whole NN 10_1101-2021_02_13_429885 538 19 - - HYPH 10_1101-2021_02_13_429885 538 20 exome exome NN 10_1101-2021_02_13_429885 538 21 assays assay NNS 10_1101-2021_02_13_429885 538 22 . . . 10_1101-2021_02_13_429885 539 1 ​b ​b NNP 10_1101-2021_02_13_429885 539 2 , , , 10_1101-2021_02_13_429885 539 3 c c NNP 10_1101-2021_02_13_429885 539 4 . . . 10_1101-2021_02_13_429885 540 1 Peak peak VB 10_1101-2021_02_13_429885 540 2 analysis analysis NN 10_1101-2021_02_13_429885 540 3 and and CC 10_1101-2021_02_13_429885 540 4 CCF ccf NN 10_1101-2021_02_13_429885 540 5 computation computation NN 10_1101-2021_02_13_429885 540 6 for for IN 10_1101-2021_02_13_429885 540 7 diploid diploid NNP 10_1101-2021_02_13_429885 540 8 SNVs SNVs NNPS 10_1101-2021_02_13_429885 540 9 . . . 10_1101-2021_02_13_429885 541 1 .CC .CC NFP 10_1101-2021_02_13_429885 541 2 - - : 10_1101-2021_02_13_429885 541 3 BY by IN 10_1101-2021_02_13_429885 541 4 - - HYPH 10_1101-2021_02_13_429885 541 5 NC NC NNP 10_1101-2021_02_13_429885 541 6 - - HYPH 10_1101-2021_02_13_429885 541 7 ND ND NNP 10_1101-2021_02_13_429885 541 8 4.0 4.0 CD 10_1101-2021_02_13_429885 541 9 International International NNP 10_1101-2021_02_13_429885 541 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 541 11 under under IN 10_1101-2021_02_13_429885 541 12 a a DT 10_1101-2021_02_13_429885 541 13 ( ( -LRB- 10_1101-2021_02_13_429885 541 14 which which WDT 10_1101-2021_02_13_429885 541 15 was be VBD 10_1101-2021_02_13_429885 541 16 not not RB 10_1101-2021_02_13_429885 541 17 certified certify VBN 10_1101-2021_02_13_429885 541 18 by by IN 10_1101-2021_02_13_429885 541 19 peer peer NN 10_1101-2021_02_13_429885 541 20 review review NN 10_1101-2021_02_13_429885 541 21 ) ) -RRB- 10_1101-2021_02_13_429885 541 22 is be VBZ 10_1101-2021_02_13_429885 541 23 the the DT 10_1101-2021_02_13_429885 541 24 author author NN 10_1101-2021_02_13_429885 541 25 / / SYM 10_1101-2021_02_13_429885 541 26 funder funder NN 10_1101-2021_02_13_429885 541 27 , , , 10_1101-2021_02_13_429885 541 28 who who WP 10_1101-2021_02_13_429885 541 29 has have VBZ 10_1101-2021_02_13_429885 541 30 granted grant VBN 10_1101-2021_02_13_429885 541 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 541 32 a a DT 10_1101-2021_02_13_429885 541 33 license license NN 10_1101-2021_02_13_429885 541 34 to to TO 10_1101-2021_02_13_429885 541 35 display display VB 10_1101-2021_02_13_429885 541 36 the the DT 10_1101-2021_02_13_429885 541 37 preprint preprint NN 10_1101-2021_02_13_429885 541 38 in in IN 10_1101-2021_02_13_429885 541 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 541 40 . . . 10_1101-2021_02_13_429885 542 1 It -PRON- PRP 10_1101-2021_02_13_429885 542 2 is be VBZ 10_1101-2021_02_13_429885 542 3 made make VBN 10_1101-2021_02_13_429885 542 4 The the DT 10_1101-2021_02_13_429885 542 5 copyright copyright NN 10_1101-2021_02_13_429885 542 6 holder holder NN 10_1101-2021_02_13_429885 542 7 for for IN 10_1101-2021_02_13_429885 542 8 this this DT 10_1101-2021_02_13_429885 542 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 542 10 version version NN 10_1101-2021_02_13_429885 542 11 posted post VBD 10_1101-2021_02_13_429885 542 12 February February NNP 10_1101-2021_02_13_429885 542 13 13 13 CD 10_1101-2021_02_13_429885 542 14 , , , 10_1101-2021_02_13_429885 542 15 2021 2021 CD 10_1101-2021_02_13_429885 542 16 . . . 10_1101-2021_02_13_429885 542 17 ; ; : 10_1101-2021_02_13_429885 542 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 542 19 : : : 10_1101-2021_02_13_429885 542 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 542 21 preprint preprint NN 10_1101-2021_02_13_429885 542 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 542 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 542 24 Househam Househam NNP 10_1101-2021_02_13_429885 542 25 et et FW 10_1101-2021_02_13_429885 542 26 al al NNP 10_1101-2021_02_13_429885 542 27 . . . 10_1101-2021_02_13_429885 543 1 A a DT 10_1101-2021_02_13_429885 543 2 fully fully RB 10_1101-2021_02_13_429885 543 3 automated automate VBN 10_1101-2021_02_13_429885 543 4 approach approach NN 10_1101-2021_02_13_429885 543 5 for for IN 10_1101-2021_02_13_429885 543 6 quality quality NN 10_1101-2021_02_13_429885 543 7 control control NN 10_1101-2021_02_13_429885 543 8 of of IN 10_1101-2021_02_13_429885 543 9 cancer cancer NN 10_1101-2021_02_13_429885 543 10 mutations mutation NNS 10_1101-2021_02_13_429885 543 11 in in IN 10_1101-2021_02_13_429885 543 12 the the DT 10_1101-2021_02_13_429885 543 13 era era NN 10_1101-2021_02_13_429885 543 14 of of IN 10_1101-2021_02_13_429885 543 15 high high JJ 10_1101-2021_02_13_429885 543 16 - - HYPH 10_1101-2021_02_13_429885 543 17 resolution resolution NN 10_1101-2021_02_13_429885 543 18 whole whole JJ 10_1101-2021_02_13_429885 543 19 genome genome JJ 10_1101-2021_02_13_429885 543 20 sequencing sequencing NN 10_1101-2021_02_13_429885 543 21 . . . 10_1101-2021_02_13_429885 544 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 544 2 Figure Figure NNP 10_1101-2021_02_13_429885 544 3 S2 s2 NN 10_1101-2021_02_13_429885 544 4 . . . 10_1101-2021_02_13_429885 545 1 ​Colorectal ​colorectal JJ 10_1101-2021_02_13_429885 545 2 multi multi JJ 10_1101-2021_02_13_429885 545 3 - - JJ 10_1101-2021_02_13_429885 545 4 region region JJ 10_1101-2021_02_13_429885 545 5 sample sample NN 10_1101-2021_02_13_429885 545 6 Set7_55 Set7_55 NNP 10_1101-2021_02_13_429885 545 7 for for IN 10_1101-2021_02_13_429885 545 8 patient patient NN 10_1101-2021_02_13_429885 545 9 Set7 Set7 NNP 10_1101-2021_02_13_429885 545 10 ( ( -LRB- 10_1101-2021_02_13_429885 545 11 see see VB 10_1101-2021_02_13_429885 545 12 also also RB 10_1101-2021_02_13_429885 545 13 Main Main NNP 10_1101-2021_02_13_429885 545 14 Text Text NNP 10_1101-2021_02_13_429885 545 15 ​Figure ​figure NN 10_1101-2021_02_13_429885 545 16 4 4 CD 10_1101-2021_02_13_429885 545 17 ​ ​ UH 10_1101-2021_02_13_429885 545 18 ) ) -RRB- 10_1101-2021_02_13_429885 545 19 . . . 10_1101-2021_02_13_429885 546 1 ​a ​a NNS 10_1101-2021_02_13_429885 546 2 . . . 10_1101-2021_02_13_429885 547 1 ​Data ​Data NNP 10_1101-2021_02_13_429885 547 2 for for IN 10_1101-2021_02_13_429885 547 3 the the DT 10_1101-2021_02_13_429885 547 4 sample sample NN 10_1101-2021_02_13_429885 547 5 ( ( -LRB- 10_1101-2021_02_13_429885 547 6 genome genome NN 10_1101-2021_02_13_429885 547 7 - - HYPH 10_1101-2021_02_13_429885 547 8 wide wide JJ 10_1101-2021_02_13_429885 547 9 CNA cna NN 10_1101-2021_02_13_429885 547 10 segments segment NNS 10_1101-2021_02_13_429885 547 11 , , , 10_1101-2021_02_13_429885 547 12 CCF CCF NNP 10_1101-2021_02_13_429885 547 13 and and CC 10_1101-2021_02_13_429885 547 14 read read VBD 10_1101-2021_02_13_429885 547 15 counts count NNS 10_1101-2021_02_13_429885 547 16 distribution distribution NN 10_1101-2021_02_13_429885 547 17 ) ) -RRB- 10_1101-2021_02_13_429885 547 18 . . . 10_1101-2021_02_13_429885 548 1 ​b ​b NNP 10_1101-2021_02_13_429885 548 2 , , , 10_1101-2021_02_13_429885 548 3 c c NNP 10_1101-2021_02_13_429885 548 4 . . . 10_1101-2021_02_13_429885 549 1 Peak peak VB 10_1101-2021_02_13_429885 549 2 analysis analysis NN 10_1101-2021_02_13_429885 549 3 and and CC 10_1101-2021_02_13_429885 549 4 CCF ccf NN 10_1101-2021_02_13_429885 549 5 computation computation NN 10_1101-2021_02_13_429885 549 6 for for IN 10_1101-2021_02_13_429885 549 7 the the DT 10_1101-2021_02_13_429885 549 8 sample sample NN 10_1101-2021_02_13_429885 549 9 . . . 10_1101-2021_02_13_429885 550 1 .CC .CC NFP 10_1101-2021_02_13_429885 550 2 - - : 10_1101-2021_02_13_429885 550 3 BY by IN 10_1101-2021_02_13_429885 550 4 - - HYPH 10_1101-2021_02_13_429885 550 5 NC NC NNP 10_1101-2021_02_13_429885 550 6 - - HYPH 10_1101-2021_02_13_429885 550 7 ND ND NNP 10_1101-2021_02_13_429885 550 8 4.0 4.0 CD 10_1101-2021_02_13_429885 550 9 International International NNP 10_1101-2021_02_13_429885 550 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 550 11 under under IN 10_1101-2021_02_13_429885 550 12 a a DT 10_1101-2021_02_13_429885 550 13 ( ( -LRB- 10_1101-2021_02_13_429885 550 14 which which WDT 10_1101-2021_02_13_429885 550 15 was be VBD 10_1101-2021_02_13_429885 550 16 not not RB 10_1101-2021_02_13_429885 550 17 certified certify VBN 10_1101-2021_02_13_429885 550 18 by by IN 10_1101-2021_02_13_429885 550 19 peer peer NN 10_1101-2021_02_13_429885 550 20 review review NN 10_1101-2021_02_13_429885 550 21 ) ) -RRB- 10_1101-2021_02_13_429885 550 22 is be VBZ 10_1101-2021_02_13_429885 550 23 the the DT 10_1101-2021_02_13_429885 550 24 author author NN 10_1101-2021_02_13_429885 550 25 / / SYM 10_1101-2021_02_13_429885 550 26 funder funder NN 10_1101-2021_02_13_429885 550 27 , , , 10_1101-2021_02_13_429885 550 28 who who WP 10_1101-2021_02_13_429885 550 29 has have VBZ 10_1101-2021_02_13_429885 550 30 granted grant VBN 10_1101-2021_02_13_429885 550 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 550 32 a a DT 10_1101-2021_02_13_429885 550 33 license license NN 10_1101-2021_02_13_429885 550 34 to to TO 10_1101-2021_02_13_429885 550 35 display display VB 10_1101-2021_02_13_429885 550 36 the the DT 10_1101-2021_02_13_429885 550 37 preprint preprint NN 10_1101-2021_02_13_429885 550 38 in in IN 10_1101-2021_02_13_429885 550 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 550 40 . . . 10_1101-2021_02_13_429885 551 1 It -PRON- PRP 10_1101-2021_02_13_429885 551 2 is be VBZ 10_1101-2021_02_13_429885 551 3 made make VBN 10_1101-2021_02_13_429885 551 4 The the DT 10_1101-2021_02_13_429885 551 5 copyright copyright NN 10_1101-2021_02_13_429885 551 6 holder holder NN 10_1101-2021_02_13_429885 551 7 for for IN 10_1101-2021_02_13_429885 551 8 this this DT 10_1101-2021_02_13_429885 551 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 551 10 version version NN 10_1101-2021_02_13_429885 551 11 posted post VBD 10_1101-2021_02_13_429885 551 12 February February NNP 10_1101-2021_02_13_429885 551 13 13 13 CD 10_1101-2021_02_13_429885 551 14 , , , 10_1101-2021_02_13_429885 551 15 2021 2021 CD 10_1101-2021_02_13_429885 551 16 . . . 10_1101-2021_02_13_429885 551 17 ; ; : 10_1101-2021_02_13_429885 551 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 551 19 : : : 10_1101-2021_02_13_429885 551 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 551 21 preprint preprint NN 10_1101-2021_02_13_429885 551 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 551 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 551 24 Househam Househam NNP 10_1101-2021_02_13_429885 551 25 et et FW 10_1101-2021_02_13_429885 551 26 al al NNP 10_1101-2021_02_13_429885 551 27 . . . 10_1101-2021_02_13_429885 552 1 A a DT 10_1101-2021_02_13_429885 552 2 fully fully RB 10_1101-2021_02_13_429885 552 3 automated automate VBN 10_1101-2021_02_13_429885 552 4 approach approach NN 10_1101-2021_02_13_429885 552 5 for for IN 10_1101-2021_02_13_429885 552 6 quality quality NN 10_1101-2021_02_13_429885 552 7 control control NN 10_1101-2021_02_13_429885 552 8 of of IN 10_1101-2021_02_13_429885 552 9 cancer cancer NN 10_1101-2021_02_13_429885 552 10 mutations mutation NNS 10_1101-2021_02_13_429885 552 11 in in IN 10_1101-2021_02_13_429885 552 12 the the DT 10_1101-2021_02_13_429885 552 13 era era NN 10_1101-2021_02_13_429885 552 14 of of IN 10_1101-2021_02_13_429885 552 15 high high JJ 10_1101-2021_02_13_429885 552 16 - - HYPH 10_1101-2021_02_13_429885 552 17 resolution resolution NN 10_1101-2021_02_13_429885 552 18 whole whole JJ 10_1101-2021_02_13_429885 552 19 genome genome JJ 10_1101-2021_02_13_429885 552 20 sequencing sequencing NN 10_1101-2021_02_13_429885 552 21 . . . 10_1101-2021_02_13_429885 553 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 553 2 Figure Figure NNP 10_1101-2021_02_13_429885 553 3 S3 S3 NNP 10_1101-2021_02_13_429885 553 4 . . . 10_1101-2021_02_13_429885 554 1 ​Colorectal ​colorectal JJ 10_1101-2021_02_13_429885 554 2 multi multi JJ 10_1101-2021_02_13_429885 554 3 - - JJ 10_1101-2021_02_13_429885 554 4 region region JJ 10_1101-2021_02_13_429885 554 5 sample sample NN 10_1101-2021_02_13_429885 554 6 Set7_59 Set7_59 NNP 10_1101-2021_02_13_429885 554 7 for for IN 10_1101-2021_02_13_429885 554 8 patient patient NN 10_1101-2021_02_13_429885 554 9 Set7 Set7 NNP 10_1101-2021_02_13_429885 554 10 ( ( -LRB- 10_1101-2021_02_13_429885 554 11 see see VB 10_1101-2021_02_13_429885 554 12 also also RB 10_1101-2021_02_13_429885 554 13 Main Main NNP 10_1101-2021_02_13_429885 554 14 Text Text NNP 10_1101-2021_02_13_429885 554 15 ​Figure ​figure NN 10_1101-2021_02_13_429885 554 16 4 4 CD 10_1101-2021_02_13_429885 554 17 ​ ​ UH 10_1101-2021_02_13_429885 554 18 ) ) -RRB- 10_1101-2021_02_13_429885 554 19 . . . 10_1101-2021_02_13_429885 555 1 ​a ​a NNS 10_1101-2021_02_13_429885 555 2 . . . 10_1101-2021_02_13_429885 556 1 ​Data ​Data NNP 10_1101-2021_02_13_429885 556 2 for for IN 10_1101-2021_02_13_429885 556 3 the the DT 10_1101-2021_02_13_429885 556 4 sample sample NN 10_1101-2021_02_13_429885 556 5 ( ( -LRB- 10_1101-2021_02_13_429885 556 6 genome genome NN 10_1101-2021_02_13_429885 556 7 - - HYPH 10_1101-2021_02_13_429885 556 8 wide wide JJ 10_1101-2021_02_13_429885 556 9 CNA cna NN 10_1101-2021_02_13_429885 556 10 segments segment NNS 10_1101-2021_02_13_429885 556 11 , , , 10_1101-2021_02_13_429885 556 12 CCF CCF NNP 10_1101-2021_02_13_429885 556 13 and and CC 10_1101-2021_02_13_429885 556 14 read read VBD 10_1101-2021_02_13_429885 556 15 counts count NNS 10_1101-2021_02_13_429885 556 16 distribution distribution NN 10_1101-2021_02_13_429885 556 17 ) ) -RRB- 10_1101-2021_02_13_429885 556 18 . . . 10_1101-2021_02_13_429885 557 1 ​b ​b NNP 10_1101-2021_02_13_429885 557 2 , , , 10_1101-2021_02_13_429885 557 3 c c NNP 10_1101-2021_02_13_429885 557 4 . . . 10_1101-2021_02_13_429885 558 1 Peak peak VB 10_1101-2021_02_13_429885 558 2 analysis analysis NN 10_1101-2021_02_13_429885 558 3 and and CC 10_1101-2021_02_13_429885 558 4 CCF ccf NN 10_1101-2021_02_13_429885 558 5 computation computation NN 10_1101-2021_02_13_429885 558 6 for for IN 10_1101-2021_02_13_429885 558 7 the the DT 10_1101-2021_02_13_429885 558 8 sample sample NN 10_1101-2021_02_13_429885 558 9 . . . 10_1101-2021_02_13_429885 559 1 .CC .CC NFP 10_1101-2021_02_13_429885 559 2 - - : 10_1101-2021_02_13_429885 559 3 BY by IN 10_1101-2021_02_13_429885 559 4 - - HYPH 10_1101-2021_02_13_429885 559 5 NC NC NNP 10_1101-2021_02_13_429885 559 6 - - HYPH 10_1101-2021_02_13_429885 559 7 ND ND NNP 10_1101-2021_02_13_429885 559 8 4.0 4.0 CD 10_1101-2021_02_13_429885 559 9 International International NNP 10_1101-2021_02_13_429885 559 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 559 11 under under IN 10_1101-2021_02_13_429885 559 12 a a DT 10_1101-2021_02_13_429885 559 13 ( ( -LRB- 10_1101-2021_02_13_429885 559 14 which which WDT 10_1101-2021_02_13_429885 559 15 was be VBD 10_1101-2021_02_13_429885 559 16 not not RB 10_1101-2021_02_13_429885 559 17 certified certify VBN 10_1101-2021_02_13_429885 559 18 by by IN 10_1101-2021_02_13_429885 559 19 peer peer NN 10_1101-2021_02_13_429885 559 20 review review NN 10_1101-2021_02_13_429885 559 21 ) ) -RRB- 10_1101-2021_02_13_429885 559 22 is be VBZ 10_1101-2021_02_13_429885 559 23 the the DT 10_1101-2021_02_13_429885 559 24 author author NN 10_1101-2021_02_13_429885 559 25 / / SYM 10_1101-2021_02_13_429885 559 26 funder funder NN 10_1101-2021_02_13_429885 559 27 , , , 10_1101-2021_02_13_429885 559 28 who who WP 10_1101-2021_02_13_429885 559 29 has have VBZ 10_1101-2021_02_13_429885 559 30 granted grant VBN 10_1101-2021_02_13_429885 559 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 559 32 a a DT 10_1101-2021_02_13_429885 559 33 license license NN 10_1101-2021_02_13_429885 559 34 to to TO 10_1101-2021_02_13_429885 559 35 display display VB 10_1101-2021_02_13_429885 559 36 the the DT 10_1101-2021_02_13_429885 559 37 preprint preprint NN 10_1101-2021_02_13_429885 559 38 in in IN 10_1101-2021_02_13_429885 559 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 559 40 . . . 10_1101-2021_02_13_429885 560 1 It -PRON- PRP 10_1101-2021_02_13_429885 560 2 is be VBZ 10_1101-2021_02_13_429885 560 3 made make VBN 10_1101-2021_02_13_429885 560 4 The the DT 10_1101-2021_02_13_429885 560 5 copyright copyright NN 10_1101-2021_02_13_429885 560 6 holder holder NN 10_1101-2021_02_13_429885 560 7 for for IN 10_1101-2021_02_13_429885 560 8 this this DT 10_1101-2021_02_13_429885 560 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 560 10 version version NN 10_1101-2021_02_13_429885 560 11 posted post VBD 10_1101-2021_02_13_429885 560 12 February February NNP 10_1101-2021_02_13_429885 560 13 13 13 CD 10_1101-2021_02_13_429885 560 14 , , , 10_1101-2021_02_13_429885 560 15 2021 2021 CD 10_1101-2021_02_13_429885 560 16 . . . 10_1101-2021_02_13_429885 560 17 ; ; : 10_1101-2021_02_13_429885 560 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 560 19 : : : 10_1101-2021_02_13_429885 560 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 560 21 preprint preprint NN 10_1101-2021_02_13_429885 560 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 560 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 560 24 Househam Househam NNP 10_1101-2021_02_13_429885 560 25 et et FW 10_1101-2021_02_13_429885 560 26 al al NNP 10_1101-2021_02_13_429885 560 27 . . . 10_1101-2021_02_13_429885 561 1 A a DT 10_1101-2021_02_13_429885 561 2 fully fully RB 10_1101-2021_02_13_429885 561 3 automated automate VBN 10_1101-2021_02_13_429885 561 4 approach approach NN 10_1101-2021_02_13_429885 561 5 for for IN 10_1101-2021_02_13_429885 561 6 quality quality NN 10_1101-2021_02_13_429885 561 7 control control NN 10_1101-2021_02_13_429885 561 8 of of IN 10_1101-2021_02_13_429885 561 9 cancer cancer NN 10_1101-2021_02_13_429885 561 10 mutations mutation NNS 10_1101-2021_02_13_429885 561 11 in in IN 10_1101-2021_02_13_429885 561 12 the the DT 10_1101-2021_02_13_429885 561 13 era era NN 10_1101-2021_02_13_429885 561 14 of of IN 10_1101-2021_02_13_429885 561 15 high high JJ 10_1101-2021_02_13_429885 561 16 - - HYPH 10_1101-2021_02_13_429885 561 17 resolution resolution NN 10_1101-2021_02_13_429885 561 18 whole whole JJ 10_1101-2021_02_13_429885 561 19 genome genome JJ 10_1101-2021_02_13_429885 561 20 sequencing sequencing NN 10_1101-2021_02_13_429885 561 21 . . . 10_1101-2021_02_13_429885 562 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 562 2 Figure Figure NNP 10_1101-2021_02_13_429885 562 3 S4 s4 NN 10_1101-2021_02_13_429885 562 4 . . . 10_1101-2021_02_13_429885 563 1 ​Colorectal ​colorectal JJ 10_1101-2021_02_13_429885 563 2 multi multi JJ 10_1101-2021_02_13_429885 563 3 - - JJ 10_1101-2021_02_13_429885 563 4 region region JJ 10_1101-2021_02_13_429885 563 5 sample sample NN 10_1101-2021_02_13_429885 563 6 Set7_62 Set7_62 NNP 10_1101-2021_02_13_429885 563 7 for for IN 10_1101-2021_02_13_429885 563 8 patient patient NN 10_1101-2021_02_13_429885 563 9 Set7 Set7 NNP 10_1101-2021_02_13_429885 563 10 ( ( -LRB- 10_1101-2021_02_13_429885 563 11 see see VB 10_1101-2021_02_13_429885 563 12 also also RB 10_1101-2021_02_13_429885 563 13 Main Main NNP 10_1101-2021_02_13_429885 563 14 Text Text NNP 10_1101-2021_02_13_429885 563 15 ​Figure ​figure NN 10_1101-2021_02_13_429885 563 16 4 4 CD 10_1101-2021_02_13_429885 563 17 ​ ​ UH 10_1101-2021_02_13_429885 563 18 ) ) -RRB- 10_1101-2021_02_13_429885 563 19 . . . 10_1101-2021_02_13_429885 564 1 ​a ​a NNS 10_1101-2021_02_13_429885 564 2 . . . 10_1101-2021_02_13_429885 565 1 ​Data ​Data NNP 10_1101-2021_02_13_429885 565 2 for for IN 10_1101-2021_02_13_429885 565 3 the the DT 10_1101-2021_02_13_429885 565 4 sample sample NN 10_1101-2021_02_13_429885 565 5 ( ( -LRB- 10_1101-2021_02_13_429885 565 6 genome genome NN 10_1101-2021_02_13_429885 565 7 - - HYPH 10_1101-2021_02_13_429885 565 8 wide wide JJ 10_1101-2021_02_13_429885 565 9 CNA cna NN 10_1101-2021_02_13_429885 565 10 segments segment NNS 10_1101-2021_02_13_429885 565 11 , , , 10_1101-2021_02_13_429885 565 12 CCF CCF NNP 10_1101-2021_02_13_429885 565 13 and and CC 10_1101-2021_02_13_429885 565 14 read read VBD 10_1101-2021_02_13_429885 565 15 counts count NNS 10_1101-2021_02_13_429885 565 16 distribution distribution NN 10_1101-2021_02_13_429885 565 17 ) ) -RRB- 10_1101-2021_02_13_429885 565 18 . . . 10_1101-2021_02_13_429885 566 1 ​b ​b NNP 10_1101-2021_02_13_429885 566 2 , , , 10_1101-2021_02_13_429885 566 3 c.​ c.​ NNP 10_1101-2021_02_13_429885 566 4 Peak Peak NNP 10_1101-2021_02_13_429885 566 5 analysis analysis NN 10_1101-2021_02_13_429885 566 6 and and CC 10_1101-2021_02_13_429885 566 7 CCF ccf NN 10_1101-2021_02_13_429885 566 8 computation computation NN 10_1101-2021_02_13_429885 566 9 for for IN 10_1101-2021_02_13_429885 566 10 the the DT 10_1101-2021_02_13_429885 566 11 sample sample NN 10_1101-2021_02_13_429885 566 12 . . . 10_1101-2021_02_13_429885 567 1 .CC .CC NFP 10_1101-2021_02_13_429885 567 2 - - : 10_1101-2021_02_13_429885 567 3 BY by IN 10_1101-2021_02_13_429885 567 4 - - HYPH 10_1101-2021_02_13_429885 567 5 NC NC NNP 10_1101-2021_02_13_429885 567 6 - - HYPH 10_1101-2021_02_13_429885 567 7 ND ND NNP 10_1101-2021_02_13_429885 567 8 4.0 4.0 CD 10_1101-2021_02_13_429885 567 9 International International NNP 10_1101-2021_02_13_429885 567 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 567 11 under under IN 10_1101-2021_02_13_429885 567 12 a a DT 10_1101-2021_02_13_429885 567 13 ( ( -LRB- 10_1101-2021_02_13_429885 567 14 which which WDT 10_1101-2021_02_13_429885 567 15 was be VBD 10_1101-2021_02_13_429885 567 16 not not RB 10_1101-2021_02_13_429885 567 17 certified certify VBN 10_1101-2021_02_13_429885 567 18 by by IN 10_1101-2021_02_13_429885 567 19 peer peer NN 10_1101-2021_02_13_429885 567 20 review review NN 10_1101-2021_02_13_429885 567 21 ) ) -RRB- 10_1101-2021_02_13_429885 567 22 is be VBZ 10_1101-2021_02_13_429885 567 23 the the DT 10_1101-2021_02_13_429885 567 24 author author NN 10_1101-2021_02_13_429885 567 25 / / SYM 10_1101-2021_02_13_429885 567 26 funder funder NN 10_1101-2021_02_13_429885 567 27 , , , 10_1101-2021_02_13_429885 567 28 who who WP 10_1101-2021_02_13_429885 567 29 has have VBZ 10_1101-2021_02_13_429885 567 30 granted grant VBN 10_1101-2021_02_13_429885 567 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 567 32 a a DT 10_1101-2021_02_13_429885 567 33 license license NN 10_1101-2021_02_13_429885 567 34 to to TO 10_1101-2021_02_13_429885 567 35 display display VB 10_1101-2021_02_13_429885 567 36 the the DT 10_1101-2021_02_13_429885 567 37 preprint preprint NN 10_1101-2021_02_13_429885 567 38 in in IN 10_1101-2021_02_13_429885 567 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 567 40 . . . 10_1101-2021_02_13_429885 568 1 It -PRON- PRP 10_1101-2021_02_13_429885 568 2 is be VBZ 10_1101-2021_02_13_429885 568 3 made make VBN 10_1101-2021_02_13_429885 568 4 The the DT 10_1101-2021_02_13_429885 568 5 copyright copyright NN 10_1101-2021_02_13_429885 568 6 holder holder NN 10_1101-2021_02_13_429885 568 7 for for IN 10_1101-2021_02_13_429885 568 8 this this DT 10_1101-2021_02_13_429885 568 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 568 10 version version NN 10_1101-2021_02_13_429885 568 11 posted post VBD 10_1101-2021_02_13_429885 568 12 February February NNP 10_1101-2021_02_13_429885 568 13 13 13 CD 10_1101-2021_02_13_429885 568 14 , , , 10_1101-2021_02_13_429885 568 15 2021 2021 CD 10_1101-2021_02_13_429885 568 16 . . . 10_1101-2021_02_13_429885 568 17 ; ; : 10_1101-2021_02_13_429885 568 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 568 19 : : : 10_1101-2021_02_13_429885 568 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 568 21 preprint preprint NN 10_1101-2021_02_13_429885 568 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 568 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 568 24 Househam Househam NNP 10_1101-2021_02_13_429885 568 25 et et FW 10_1101-2021_02_13_429885 568 26 al al NNP 10_1101-2021_02_13_429885 568 27 . . . 10_1101-2021_02_13_429885 569 1 A a DT 10_1101-2021_02_13_429885 569 2 fully fully RB 10_1101-2021_02_13_429885 569 3 automated automate VBN 10_1101-2021_02_13_429885 569 4 approach approach NN 10_1101-2021_02_13_429885 569 5 for for IN 10_1101-2021_02_13_429885 569 6 quality quality NN 10_1101-2021_02_13_429885 569 7 control control NN 10_1101-2021_02_13_429885 569 8 of of IN 10_1101-2021_02_13_429885 569 9 cancer cancer NN 10_1101-2021_02_13_429885 569 10 mutations mutation NNS 10_1101-2021_02_13_429885 569 11 in in IN 10_1101-2021_02_13_429885 569 12 the the DT 10_1101-2021_02_13_429885 569 13 era era NN 10_1101-2021_02_13_429885 569 14 of of IN 10_1101-2021_02_13_429885 569 15 high high JJ 10_1101-2021_02_13_429885 569 16 - - HYPH 10_1101-2021_02_13_429885 569 17 resolution resolution NN 10_1101-2021_02_13_429885 569 18 whole whole JJ 10_1101-2021_02_13_429885 569 19 genome genome JJ 10_1101-2021_02_13_429885 569 20 sequencing sequencing NN 10_1101-2021_02_13_429885 569 21 . . . 10_1101-2021_02_13_429885 570 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 570 2 Figure Figure NNP 10_1101-2021_02_13_429885 570 3 S5 s5 NN 10_1101-2021_02_13_429885 570 4 . . . 10_1101-2021_02_13_429885 571 1 ​Colorectal ​colorectal JJ 10_1101-2021_02_13_429885 571 2 multi multi JJ 10_1101-2021_02_13_429885 571 3 - - JJ 10_1101-2021_02_13_429885 571 4 region region JJ 10_1101-2021_02_13_429885 571 5 sample sample NN 10_1101-2021_02_13_429885 571 6 Set6_42 Set6_42 NNP 10_1101-2021_02_13_429885 571 7 for for IN 10_1101-2021_02_13_429885 571 8 patient patient NN 10_1101-2021_02_13_429885 571 9 Set6 Set6 NNP 10_1101-2021_02_13_429885 571 10 . . . 10_1101-2021_02_13_429885 572 1 ​a ​a NNS 10_1101-2021_02_13_429885 572 2 . . . 10_1101-2021_02_13_429885 573 1 Data datum NNS 10_1101-2021_02_13_429885 573 2 for for IN 10_1101-2021_02_13_429885 573 3 the the DT 10_1101-2021_02_13_429885 573 4 sample sample NN 10_1101-2021_02_13_429885 573 5 ( ( -LRB- 10_1101-2021_02_13_429885 573 6 genome genome NN 10_1101-2021_02_13_429885 573 7 - - HYPH 10_1101-2021_02_13_429885 573 8 wide wide JJ 10_1101-2021_02_13_429885 573 9 CNA cna NN 10_1101-2021_02_13_429885 573 10 segments segment NNS 10_1101-2021_02_13_429885 573 11 , , , 10_1101-2021_02_13_429885 573 12 CCF CCF NNP 10_1101-2021_02_13_429885 573 13 and and CC 10_1101-2021_02_13_429885 573 14 read read VBD 10_1101-2021_02_13_429885 573 15 counts count NNS 10_1101-2021_02_13_429885 573 16 distribution distribution NN 10_1101-2021_02_13_429885 573 17 ) ) -RRB- 10_1101-2021_02_13_429885 573 18 . . . 10_1101-2021_02_13_429885 574 1 b b LS 10_1101-2021_02_13_429885 574 2 , , , 10_1101-2021_02_13_429885 574 3 c.​ c.​ NNP 10_1101-2021_02_13_429885 574 4 Peak Peak NNP 10_1101-2021_02_13_429885 574 5 analysis analysis NN 10_1101-2021_02_13_429885 574 6 and and CC 10_1101-2021_02_13_429885 574 7 CCF ccf NN 10_1101-2021_02_13_429885 574 8 computation computation NN 10_1101-2021_02_13_429885 574 9 for for IN 10_1101-2021_02_13_429885 574 10 the the DT 10_1101-2021_02_13_429885 574 11 sample sample NN 10_1101-2021_02_13_429885 574 12 . . . 10_1101-2021_02_13_429885 575 1 .CC .CC NFP 10_1101-2021_02_13_429885 575 2 - - : 10_1101-2021_02_13_429885 575 3 BY by IN 10_1101-2021_02_13_429885 575 4 - - HYPH 10_1101-2021_02_13_429885 575 5 NC NC NNP 10_1101-2021_02_13_429885 575 6 - - HYPH 10_1101-2021_02_13_429885 575 7 ND ND NNP 10_1101-2021_02_13_429885 575 8 4.0 4.0 CD 10_1101-2021_02_13_429885 575 9 International International NNP 10_1101-2021_02_13_429885 575 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 575 11 under under IN 10_1101-2021_02_13_429885 575 12 a a DT 10_1101-2021_02_13_429885 575 13 ( ( -LRB- 10_1101-2021_02_13_429885 575 14 which which WDT 10_1101-2021_02_13_429885 575 15 was be VBD 10_1101-2021_02_13_429885 575 16 not not RB 10_1101-2021_02_13_429885 575 17 certified certify VBN 10_1101-2021_02_13_429885 575 18 by by IN 10_1101-2021_02_13_429885 575 19 peer peer NN 10_1101-2021_02_13_429885 575 20 review review NN 10_1101-2021_02_13_429885 575 21 ) ) -RRB- 10_1101-2021_02_13_429885 575 22 is be VBZ 10_1101-2021_02_13_429885 575 23 the the DT 10_1101-2021_02_13_429885 575 24 author author NN 10_1101-2021_02_13_429885 575 25 / / SYM 10_1101-2021_02_13_429885 575 26 funder funder NN 10_1101-2021_02_13_429885 575 27 , , , 10_1101-2021_02_13_429885 575 28 who who WP 10_1101-2021_02_13_429885 575 29 has have VBZ 10_1101-2021_02_13_429885 575 30 granted grant VBN 10_1101-2021_02_13_429885 575 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 575 32 a a DT 10_1101-2021_02_13_429885 575 33 license license NN 10_1101-2021_02_13_429885 575 34 to to TO 10_1101-2021_02_13_429885 575 35 display display VB 10_1101-2021_02_13_429885 575 36 the the DT 10_1101-2021_02_13_429885 575 37 preprint preprint NN 10_1101-2021_02_13_429885 575 38 in in IN 10_1101-2021_02_13_429885 575 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 575 40 . . . 10_1101-2021_02_13_429885 576 1 It -PRON- PRP 10_1101-2021_02_13_429885 576 2 is be VBZ 10_1101-2021_02_13_429885 576 3 made make VBN 10_1101-2021_02_13_429885 576 4 The the DT 10_1101-2021_02_13_429885 576 5 copyright copyright NN 10_1101-2021_02_13_429885 576 6 holder holder NN 10_1101-2021_02_13_429885 576 7 for for IN 10_1101-2021_02_13_429885 576 8 this this DT 10_1101-2021_02_13_429885 576 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 576 10 version version NN 10_1101-2021_02_13_429885 576 11 posted post VBD 10_1101-2021_02_13_429885 576 12 February February NNP 10_1101-2021_02_13_429885 576 13 13 13 CD 10_1101-2021_02_13_429885 576 14 , , , 10_1101-2021_02_13_429885 576 15 2021 2021 CD 10_1101-2021_02_13_429885 576 16 . . . 10_1101-2021_02_13_429885 576 17 ; ; : 10_1101-2021_02_13_429885 576 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 576 19 : : : 10_1101-2021_02_13_429885 576 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 576 21 preprint preprint NN 10_1101-2021_02_13_429885 576 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 576 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 576 24 Househam Househam NNP 10_1101-2021_02_13_429885 576 25 et et FW 10_1101-2021_02_13_429885 576 26 al al NNP 10_1101-2021_02_13_429885 576 27 . . . 10_1101-2021_02_13_429885 577 1 A a DT 10_1101-2021_02_13_429885 577 2 fully fully RB 10_1101-2021_02_13_429885 577 3 automated automate VBN 10_1101-2021_02_13_429885 577 4 approach approach NN 10_1101-2021_02_13_429885 577 5 for for IN 10_1101-2021_02_13_429885 577 6 quality quality NN 10_1101-2021_02_13_429885 577 7 control control NN 10_1101-2021_02_13_429885 577 8 of of IN 10_1101-2021_02_13_429885 577 9 cancer cancer NN 10_1101-2021_02_13_429885 577 10 mutations mutation NNS 10_1101-2021_02_13_429885 577 11 in in IN 10_1101-2021_02_13_429885 577 12 the the DT 10_1101-2021_02_13_429885 577 13 era era NN 10_1101-2021_02_13_429885 577 14 of of IN 10_1101-2021_02_13_429885 577 15 high high JJ 10_1101-2021_02_13_429885 577 16 - - HYPH 10_1101-2021_02_13_429885 577 17 resolution resolution NN 10_1101-2021_02_13_429885 577 18 whole whole JJ 10_1101-2021_02_13_429885 577 19 genome genome JJ 10_1101-2021_02_13_429885 577 20 sequencing sequencing NN 10_1101-2021_02_13_429885 577 21 . . . 10_1101-2021_02_13_429885 578 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 578 2 Figure Figure NNP 10_1101-2021_02_13_429885 578 3 S6 s6 NN 10_1101-2021_02_13_429885 578 4 . . . 10_1101-2021_02_13_429885 579 1 ​Colorectal ​colorectal JJ 10_1101-2021_02_13_429885 579 2 multi multi JJ 10_1101-2021_02_13_429885 579 3 - - JJ 10_1101-2021_02_13_429885 579 4 region region JJ 10_1101-2021_02_13_429885 579 5 sample sample NN 10_1101-2021_02_13_429885 579 6 Set6_44 Set6_44 NNP 10_1101-2021_02_13_429885 579 7 for for IN 10_1101-2021_02_13_429885 579 8 patient patient NN 10_1101-2021_02_13_429885 579 9 Set6 Set6 NNP 10_1101-2021_02_13_429885 579 10 . . . 10_1101-2021_02_13_429885 580 1 ​a ​a NNS 10_1101-2021_02_13_429885 580 2 . . . 10_1101-2021_02_13_429885 581 1 Data datum NNS 10_1101-2021_02_13_429885 581 2 for for IN 10_1101-2021_02_13_429885 581 3 the the DT 10_1101-2021_02_13_429885 581 4 sample sample NN 10_1101-2021_02_13_429885 581 5 ( ( -LRB- 10_1101-2021_02_13_429885 581 6 genome genome NN 10_1101-2021_02_13_429885 581 7 - - HYPH 10_1101-2021_02_13_429885 581 8 wide wide JJ 10_1101-2021_02_13_429885 581 9 CNA cna NN 10_1101-2021_02_13_429885 581 10 segments segment NNS 10_1101-2021_02_13_429885 581 11 , , , 10_1101-2021_02_13_429885 581 12 CCF CCF NNP 10_1101-2021_02_13_429885 581 13 and and CC 10_1101-2021_02_13_429885 581 14 read read VBD 10_1101-2021_02_13_429885 581 15 counts count NNS 10_1101-2021_02_13_429885 581 16 distribution distribution NN 10_1101-2021_02_13_429885 581 17 ) ) -RRB- 10_1101-2021_02_13_429885 581 18 . . . 10_1101-2021_02_13_429885 582 1 b b LS 10_1101-2021_02_13_429885 582 2 , , , 10_1101-2021_02_13_429885 582 3 c.​ c.​ NNP 10_1101-2021_02_13_429885 582 4 Peak Peak NNP 10_1101-2021_02_13_429885 582 5 analysis analysis NN 10_1101-2021_02_13_429885 582 6 and and CC 10_1101-2021_02_13_429885 582 7 CCF ccf NN 10_1101-2021_02_13_429885 582 8 computation computation NN 10_1101-2021_02_13_429885 582 9 for for IN 10_1101-2021_02_13_429885 582 10 the the DT 10_1101-2021_02_13_429885 582 11 sample sample NN 10_1101-2021_02_13_429885 582 12 . . . 10_1101-2021_02_13_429885 583 1 .CC .CC NFP 10_1101-2021_02_13_429885 583 2 - - : 10_1101-2021_02_13_429885 583 3 BY by IN 10_1101-2021_02_13_429885 583 4 - - HYPH 10_1101-2021_02_13_429885 583 5 NC NC NNP 10_1101-2021_02_13_429885 583 6 - - HYPH 10_1101-2021_02_13_429885 583 7 ND ND NNP 10_1101-2021_02_13_429885 583 8 4.0 4.0 CD 10_1101-2021_02_13_429885 583 9 International International NNP 10_1101-2021_02_13_429885 583 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 583 11 under under IN 10_1101-2021_02_13_429885 583 12 a a DT 10_1101-2021_02_13_429885 583 13 ( ( -LRB- 10_1101-2021_02_13_429885 583 14 which which WDT 10_1101-2021_02_13_429885 583 15 was be VBD 10_1101-2021_02_13_429885 583 16 not not RB 10_1101-2021_02_13_429885 583 17 certified certify VBN 10_1101-2021_02_13_429885 583 18 by by IN 10_1101-2021_02_13_429885 583 19 peer peer NN 10_1101-2021_02_13_429885 583 20 review review NN 10_1101-2021_02_13_429885 583 21 ) ) -RRB- 10_1101-2021_02_13_429885 583 22 is be VBZ 10_1101-2021_02_13_429885 583 23 the the DT 10_1101-2021_02_13_429885 583 24 author author NN 10_1101-2021_02_13_429885 583 25 / / SYM 10_1101-2021_02_13_429885 583 26 funder funder NN 10_1101-2021_02_13_429885 583 27 , , , 10_1101-2021_02_13_429885 583 28 who who WP 10_1101-2021_02_13_429885 583 29 has have VBZ 10_1101-2021_02_13_429885 583 30 granted grant VBN 10_1101-2021_02_13_429885 583 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 583 32 a a DT 10_1101-2021_02_13_429885 583 33 license license NN 10_1101-2021_02_13_429885 583 34 to to TO 10_1101-2021_02_13_429885 583 35 display display VB 10_1101-2021_02_13_429885 583 36 the the DT 10_1101-2021_02_13_429885 583 37 preprint preprint NN 10_1101-2021_02_13_429885 583 38 in in IN 10_1101-2021_02_13_429885 583 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 583 40 . . . 10_1101-2021_02_13_429885 584 1 It -PRON- PRP 10_1101-2021_02_13_429885 584 2 is be VBZ 10_1101-2021_02_13_429885 584 3 made make VBN 10_1101-2021_02_13_429885 584 4 The the DT 10_1101-2021_02_13_429885 584 5 copyright copyright NN 10_1101-2021_02_13_429885 584 6 holder holder NN 10_1101-2021_02_13_429885 584 7 for for IN 10_1101-2021_02_13_429885 584 8 this this DT 10_1101-2021_02_13_429885 584 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 584 10 version version NN 10_1101-2021_02_13_429885 584 11 posted post VBD 10_1101-2021_02_13_429885 584 12 February February NNP 10_1101-2021_02_13_429885 584 13 13 13 CD 10_1101-2021_02_13_429885 584 14 , , , 10_1101-2021_02_13_429885 584 15 2021 2021 CD 10_1101-2021_02_13_429885 584 16 . . . 10_1101-2021_02_13_429885 584 17 ; ; : 10_1101-2021_02_13_429885 584 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 584 19 : : : 10_1101-2021_02_13_429885 584 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 584 21 preprint preprint NN 10_1101-2021_02_13_429885 584 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 584 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 584 24 Househam Househam NNP 10_1101-2021_02_13_429885 584 25 et et FW 10_1101-2021_02_13_429885 584 26 al al NNP 10_1101-2021_02_13_429885 584 27 . . . 10_1101-2021_02_13_429885 585 1 A a DT 10_1101-2021_02_13_429885 585 2 fully fully RB 10_1101-2021_02_13_429885 585 3 automated automate VBN 10_1101-2021_02_13_429885 585 4 approach approach NN 10_1101-2021_02_13_429885 585 5 for for IN 10_1101-2021_02_13_429885 585 6 quality quality NN 10_1101-2021_02_13_429885 585 7 control control NN 10_1101-2021_02_13_429885 585 8 of of IN 10_1101-2021_02_13_429885 585 9 cancer cancer NN 10_1101-2021_02_13_429885 585 10 mutations mutation NNS 10_1101-2021_02_13_429885 585 11 in in IN 10_1101-2021_02_13_429885 585 12 the the DT 10_1101-2021_02_13_429885 585 13 era era NN 10_1101-2021_02_13_429885 585 14 of of IN 10_1101-2021_02_13_429885 585 15 high high JJ 10_1101-2021_02_13_429885 585 16 - - HYPH 10_1101-2021_02_13_429885 585 17 resolution resolution NN 10_1101-2021_02_13_429885 585 18 whole whole JJ 10_1101-2021_02_13_429885 585 19 genome genome JJ 10_1101-2021_02_13_429885 585 20 sequencing sequencing NN 10_1101-2021_02_13_429885 585 21 . . . 10_1101-2021_02_13_429885 586 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 586 2 Figure Figure NNP 10_1101-2021_02_13_429885 586 3 S7 S7 NNP 10_1101-2021_02_13_429885 586 4 . . . 10_1101-2021_02_13_429885 587 1 ​Colorectal ​colorectal JJ 10_1101-2021_02_13_429885 587 2 multi multi JJ 10_1101-2021_02_13_429885 587 3 - - JJ 10_1101-2021_02_13_429885 587 4 region region JJ 10_1101-2021_02_13_429885 587 5 sample sample NN 10_1101-2021_02_13_429885 587 6 Set6_45 Set6_45 NNP 10_1101-2021_02_13_429885 587 7 for for IN 10_1101-2021_02_13_429885 587 8 patient patient NN 10_1101-2021_02_13_429885 587 9 Set6 Set6 NNP 10_1101-2021_02_13_429885 587 10 . . . 10_1101-2021_02_13_429885 588 1 ​a ​a NNS 10_1101-2021_02_13_429885 588 2 . . . 10_1101-2021_02_13_429885 589 1 Data datum NNS 10_1101-2021_02_13_429885 589 2 for for IN 10_1101-2021_02_13_429885 589 3 the the DT 10_1101-2021_02_13_429885 589 4 sample sample NN 10_1101-2021_02_13_429885 589 5 ( ( -LRB- 10_1101-2021_02_13_429885 589 6 genome genome NN 10_1101-2021_02_13_429885 589 7 - - HYPH 10_1101-2021_02_13_429885 589 8 wide wide JJ 10_1101-2021_02_13_429885 589 9 CNA cna NN 10_1101-2021_02_13_429885 589 10 segments segment NNS 10_1101-2021_02_13_429885 589 11 , , , 10_1101-2021_02_13_429885 589 12 CCF CCF NNP 10_1101-2021_02_13_429885 589 13 and and CC 10_1101-2021_02_13_429885 589 14 read read VBD 10_1101-2021_02_13_429885 589 15 counts count NNS 10_1101-2021_02_13_429885 589 16 distribution distribution NN 10_1101-2021_02_13_429885 589 17 ) ) -RRB- 10_1101-2021_02_13_429885 589 18 . . . 10_1101-2021_02_13_429885 590 1 b b LS 10_1101-2021_02_13_429885 590 2 , , , 10_1101-2021_02_13_429885 590 3 c.​ c.​ NNP 10_1101-2021_02_13_429885 590 4 Peak Peak NNP 10_1101-2021_02_13_429885 590 5 analysis analysis NN 10_1101-2021_02_13_429885 590 6 and and CC 10_1101-2021_02_13_429885 590 7 CCF ccf NN 10_1101-2021_02_13_429885 590 8 computation computation NN 10_1101-2021_02_13_429885 590 9 for for IN 10_1101-2021_02_13_429885 590 10 the the DT 10_1101-2021_02_13_429885 590 11 sample sample NN 10_1101-2021_02_13_429885 590 12 . . . 10_1101-2021_02_13_429885 591 1 .CC .CC NFP 10_1101-2021_02_13_429885 591 2 - - : 10_1101-2021_02_13_429885 591 3 BY by IN 10_1101-2021_02_13_429885 591 4 - - HYPH 10_1101-2021_02_13_429885 591 5 NC NC NNP 10_1101-2021_02_13_429885 591 6 - - HYPH 10_1101-2021_02_13_429885 591 7 ND ND NNP 10_1101-2021_02_13_429885 591 8 4.0 4.0 CD 10_1101-2021_02_13_429885 591 9 International International NNP 10_1101-2021_02_13_429885 591 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 591 11 under under IN 10_1101-2021_02_13_429885 591 12 a a DT 10_1101-2021_02_13_429885 591 13 ( ( -LRB- 10_1101-2021_02_13_429885 591 14 which which WDT 10_1101-2021_02_13_429885 591 15 was be VBD 10_1101-2021_02_13_429885 591 16 not not RB 10_1101-2021_02_13_429885 591 17 certified certify VBN 10_1101-2021_02_13_429885 591 18 by by IN 10_1101-2021_02_13_429885 591 19 peer peer NN 10_1101-2021_02_13_429885 591 20 review review NN 10_1101-2021_02_13_429885 591 21 ) ) -RRB- 10_1101-2021_02_13_429885 591 22 is be VBZ 10_1101-2021_02_13_429885 591 23 the the DT 10_1101-2021_02_13_429885 591 24 author author NN 10_1101-2021_02_13_429885 591 25 / / SYM 10_1101-2021_02_13_429885 591 26 funder funder NN 10_1101-2021_02_13_429885 591 27 , , , 10_1101-2021_02_13_429885 591 28 who who WP 10_1101-2021_02_13_429885 591 29 has have VBZ 10_1101-2021_02_13_429885 591 30 granted grant VBN 10_1101-2021_02_13_429885 591 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 591 32 a a DT 10_1101-2021_02_13_429885 591 33 license license NN 10_1101-2021_02_13_429885 591 34 to to TO 10_1101-2021_02_13_429885 591 35 display display VB 10_1101-2021_02_13_429885 591 36 the the DT 10_1101-2021_02_13_429885 591 37 preprint preprint NN 10_1101-2021_02_13_429885 591 38 in in IN 10_1101-2021_02_13_429885 591 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 591 40 . . . 10_1101-2021_02_13_429885 592 1 It -PRON- PRP 10_1101-2021_02_13_429885 592 2 is be VBZ 10_1101-2021_02_13_429885 592 3 made make VBN 10_1101-2021_02_13_429885 592 4 The the DT 10_1101-2021_02_13_429885 592 5 copyright copyright NN 10_1101-2021_02_13_429885 592 6 holder holder NN 10_1101-2021_02_13_429885 592 7 for for IN 10_1101-2021_02_13_429885 592 8 this this DT 10_1101-2021_02_13_429885 592 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 592 10 version version NN 10_1101-2021_02_13_429885 592 11 posted post VBD 10_1101-2021_02_13_429885 592 12 February February NNP 10_1101-2021_02_13_429885 592 13 13 13 CD 10_1101-2021_02_13_429885 592 14 , , , 10_1101-2021_02_13_429885 592 15 2021 2021 CD 10_1101-2021_02_13_429885 592 16 . . . 10_1101-2021_02_13_429885 592 17 ; ; : 10_1101-2021_02_13_429885 592 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 592 19 : : : 10_1101-2021_02_13_429885 592 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 592 21 preprint preprint NN 10_1101-2021_02_13_429885 592 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 592 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 592 24 Househam Househam NNP 10_1101-2021_02_13_429885 592 25 et et FW 10_1101-2021_02_13_429885 592 26 al al NNP 10_1101-2021_02_13_429885 592 27 . . . 10_1101-2021_02_13_429885 593 1 A a DT 10_1101-2021_02_13_429885 593 2 fully fully RB 10_1101-2021_02_13_429885 593 3 automated automate VBN 10_1101-2021_02_13_429885 593 4 approach approach NN 10_1101-2021_02_13_429885 593 5 for for IN 10_1101-2021_02_13_429885 593 6 quality quality NN 10_1101-2021_02_13_429885 593 7 control control NN 10_1101-2021_02_13_429885 593 8 of of IN 10_1101-2021_02_13_429885 593 9 cancer cancer NN 10_1101-2021_02_13_429885 593 10 mutations mutation NNS 10_1101-2021_02_13_429885 593 11 in in IN 10_1101-2021_02_13_429885 593 12 the the DT 10_1101-2021_02_13_429885 593 13 era era NN 10_1101-2021_02_13_429885 593 14 of of IN 10_1101-2021_02_13_429885 593 15 high high JJ 10_1101-2021_02_13_429885 593 16 - - HYPH 10_1101-2021_02_13_429885 593 17 resolution resolution NN 10_1101-2021_02_13_429885 593 18 whole whole JJ 10_1101-2021_02_13_429885 593 19 genome genome JJ 10_1101-2021_02_13_429885 593 20 sequencing sequencing NN 10_1101-2021_02_13_429885 593 21 . . . 10_1101-2021_02_13_429885 594 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 594 2 Figure Figure NNP 10_1101-2021_02_13_429885 594 3 S8 S8 NNP 10_1101-2021_02_13_429885 594 4 . . . 10_1101-2021_02_13_429885 595 1 ​Colorectal ​colorectal JJ 10_1101-2021_02_13_429885 595 2 multi multi JJ 10_1101-2021_02_13_429885 595 3 - - JJ 10_1101-2021_02_13_429885 595 4 region region JJ 10_1101-2021_02_13_429885 595 5 sample sample NN 10_1101-2021_02_13_429885 595 6 Set6_46 Set6_46 NNP 10_1101-2021_02_13_429885 595 7 for for IN 10_1101-2021_02_13_429885 595 8 patient patient NN 10_1101-2021_02_13_429885 595 9 Set6 Set6 NNP 10_1101-2021_02_13_429885 595 10 . . . 10_1101-2021_02_13_429885 596 1 ​a ​a NNS 10_1101-2021_02_13_429885 596 2 . . . 10_1101-2021_02_13_429885 597 1 Data datum NNS 10_1101-2021_02_13_429885 597 2 for for IN 10_1101-2021_02_13_429885 597 3 the the DT 10_1101-2021_02_13_429885 597 4 sample sample NN 10_1101-2021_02_13_429885 597 5 ( ( -LRB- 10_1101-2021_02_13_429885 597 6 genome genome NN 10_1101-2021_02_13_429885 597 7 - - HYPH 10_1101-2021_02_13_429885 597 8 wide wide JJ 10_1101-2021_02_13_429885 597 9 CNA cna NN 10_1101-2021_02_13_429885 597 10 segments segment NNS 10_1101-2021_02_13_429885 597 11 , , , 10_1101-2021_02_13_429885 597 12 CCF CCF NNP 10_1101-2021_02_13_429885 597 13 and and CC 10_1101-2021_02_13_429885 597 14 read read VBD 10_1101-2021_02_13_429885 597 15 counts count NNS 10_1101-2021_02_13_429885 597 16 distribution distribution NN 10_1101-2021_02_13_429885 597 17 ) ) -RRB- 10_1101-2021_02_13_429885 597 18 . . . 10_1101-2021_02_13_429885 598 1 b b LS 10_1101-2021_02_13_429885 598 2 , , , 10_1101-2021_02_13_429885 598 3 c.​ c.​ NNP 10_1101-2021_02_13_429885 598 4 Peak Peak NNP 10_1101-2021_02_13_429885 598 5 analysis analysis NN 10_1101-2021_02_13_429885 598 6 and and CC 10_1101-2021_02_13_429885 598 7 CCF ccf NN 10_1101-2021_02_13_429885 598 8 computation computation NN 10_1101-2021_02_13_429885 598 9 for for IN 10_1101-2021_02_13_429885 598 10 the the DT 10_1101-2021_02_13_429885 598 11 sample sample NN 10_1101-2021_02_13_429885 598 12 . . . 10_1101-2021_02_13_429885 599 1 .CC .CC NFP 10_1101-2021_02_13_429885 599 2 - - : 10_1101-2021_02_13_429885 599 3 BY by IN 10_1101-2021_02_13_429885 599 4 - - HYPH 10_1101-2021_02_13_429885 599 5 NC NC NNP 10_1101-2021_02_13_429885 599 6 - - HYPH 10_1101-2021_02_13_429885 599 7 ND ND NNP 10_1101-2021_02_13_429885 599 8 4.0 4.0 CD 10_1101-2021_02_13_429885 599 9 International International NNP 10_1101-2021_02_13_429885 599 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 599 11 under under IN 10_1101-2021_02_13_429885 599 12 a a DT 10_1101-2021_02_13_429885 599 13 ( ( -LRB- 10_1101-2021_02_13_429885 599 14 which which WDT 10_1101-2021_02_13_429885 599 15 was be VBD 10_1101-2021_02_13_429885 599 16 not not RB 10_1101-2021_02_13_429885 599 17 certified certify VBN 10_1101-2021_02_13_429885 599 18 by by IN 10_1101-2021_02_13_429885 599 19 peer peer NN 10_1101-2021_02_13_429885 599 20 review review NN 10_1101-2021_02_13_429885 599 21 ) ) -RRB- 10_1101-2021_02_13_429885 599 22 is be VBZ 10_1101-2021_02_13_429885 599 23 the the DT 10_1101-2021_02_13_429885 599 24 author author NN 10_1101-2021_02_13_429885 599 25 / / SYM 10_1101-2021_02_13_429885 599 26 funder funder NN 10_1101-2021_02_13_429885 599 27 , , , 10_1101-2021_02_13_429885 599 28 who who WP 10_1101-2021_02_13_429885 599 29 has have VBZ 10_1101-2021_02_13_429885 599 30 granted grant VBN 10_1101-2021_02_13_429885 599 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 599 32 a a DT 10_1101-2021_02_13_429885 599 33 license license NN 10_1101-2021_02_13_429885 599 34 to to TO 10_1101-2021_02_13_429885 599 35 display display VB 10_1101-2021_02_13_429885 599 36 the the DT 10_1101-2021_02_13_429885 599 37 preprint preprint NN 10_1101-2021_02_13_429885 599 38 in in IN 10_1101-2021_02_13_429885 599 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 599 40 . . . 10_1101-2021_02_13_429885 600 1 It -PRON- PRP 10_1101-2021_02_13_429885 600 2 is be VBZ 10_1101-2021_02_13_429885 600 3 made make VBN 10_1101-2021_02_13_429885 600 4 The the DT 10_1101-2021_02_13_429885 600 5 copyright copyright NN 10_1101-2021_02_13_429885 600 6 holder holder NN 10_1101-2021_02_13_429885 600 7 for for IN 10_1101-2021_02_13_429885 600 8 this this DT 10_1101-2021_02_13_429885 600 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 600 10 version version NN 10_1101-2021_02_13_429885 600 11 posted post VBD 10_1101-2021_02_13_429885 600 12 February February NNP 10_1101-2021_02_13_429885 600 13 13 13 CD 10_1101-2021_02_13_429885 600 14 , , , 10_1101-2021_02_13_429885 600 15 2021 2021 CD 10_1101-2021_02_13_429885 600 16 . . . 10_1101-2021_02_13_429885 600 17 ; ; : 10_1101-2021_02_13_429885 600 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 600 19 : : : 10_1101-2021_02_13_429885 600 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 600 21 preprint preprint NN 10_1101-2021_02_13_429885 600 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 600 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 600 24 Househam Househam NNP 10_1101-2021_02_13_429885 600 25 et et FW 10_1101-2021_02_13_429885 600 26 al al NNP 10_1101-2021_02_13_429885 600 27 . . . 10_1101-2021_02_13_429885 601 1 A a DT 10_1101-2021_02_13_429885 601 2 fully fully RB 10_1101-2021_02_13_429885 601 3 automated automate VBN 10_1101-2021_02_13_429885 601 4 approach approach NN 10_1101-2021_02_13_429885 601 5 for for IN 10_1101-2021_02_13_429885 601 6 quality quality NN 10_1101-2021_02_13_429885 601 7 control control NN 10_1101-2021_02_13_429885 601 8 of of IN 10_1101-2021_02_13_429885 601 9 cancer cancer NN 10_1101-2021_02_13_429885 601 10 mutations mutation NNS 10_1101-2021_02_13_429885 601 11 in in IN 10_1101-2021_02_13_429885 601 12 the the DT 10_1101-2021_02_13_429885 601 13 era era NN 10_1101-2021_02_13_429885 601 14 of of IN 10_1101-2021_02_13_429885 601 15 high high JJ 10_1101-2021_02_13_429885 601 16 - - HYPH 10_1101-2021_02_13_429885 601 17 resolution resolution NN 10_1101-2021_02_13_429885 601 18 whole whole JJ 10_1101-2021_02_13_429885 601 19 genome genome JJ 10_1101-2021_02_13_429885 601 20 sequencing sequencing NN 10_1101-2021_02_13_429885 601 21 . . . 10_1101-2021_02_13_429885 602 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 602 2 Figure Figure NNP 10_1101-2021_02_13_429885 602 3 S9 s9 NN 10_1101-2021_02_13_429885 602 4 . . . 10_1101-2021_02_13_429885 603 1 ​Colorectal ​colorectal JJ 10_1101-2021_02_13_429885 603 2 multi multi JJ 10_1101-2021_02_13_429885 603 3 - - JJ 10_1101-2021_02_13_429885 603 4 region region JJ 10_1101-2021_02_13_429885 603 5 sample sample NN 10_1101-2021_02_13_429885 603 6 Set6_47 Set6_47 NNP 10_1101-2021_02_13_429885 603 7 for for IN 10_1101-2021_02_13_429885 603 8 patient patient NN 10_1101-2021_02_13_429885 603 9 Set6 Set6 NNP 10_1101-2021_02_13_429885 603 10 . . . 10_1101-2021_02_13_429885 604 1 ​a ​a NNS 10_1101-2021_02_13_429885 604 2 . . . 10_1101-2021_02_13_429885 605 1 Data datum NNS 10_1101-2021_02_13_429885 605 2 for for IN 10_1101-2021_02_13_429885 605 3 the the DT 10_1101-2021_02_13_429885 605 4 sample sample NN 10_1101-2021_02_13_429885 605 5 ( ( -LRB- 10_1101-2021_02_13_429885 605 6 genome genome NN 10_1101-2021_02_13_429885 605 7 - - HYPH 10_1101-2021_02_13_429885 605 8 wide wide JJ 10_1101-2021_02_13_429885 605 9 CNA cna NN 10_1101-2021_02_13_429885 605 10 segments segment NNS 10_1101-2021_02_13_429885 605 11 , , , 10_1101-2021_02_13_429885 605 12 CCF CCF NNP 10_1101-2021_02_13_429885 605 13 and and CC 10_1101-2021_02_13_429885 605 14 read read VBD 10_1101-2021_02_13_429885 605 15 counts count NNS 10_1101-2021_02_13_429885 605 16 distribution distribution NN 10_1101-2021_02_13_429885 605 17 ) ) -RRB- 10_1101-2021_02_13_429885 605 18 . . . 10_1101-2021_02_13_429885 606 1 b b LS 10_1101-2021_02_13_429885 606 2 , , , 10_1101-2021_02_13_429885 606 3 c.​ c.​ NNP 10_1101-2021_02_13_429885 606 4 Peak Peak NNP 10_1101-2021_02_13_429885 606 5 analysis analysis NN 10_1101-2021_02_13_429885 606 6 and and CC 10_1101-2021_02_13_429885 606 7 CCF ccf NN 10_1101-2021_02_13_429885 606 8 computation computation NN 10_1101-2021_02_13_429885 606 9 for for IN 10_1101-2021_02_13_429885 606 10 the the DT 10_1101-2021_02_13_429885 606 11 sample sample NN 10_1101-2021_02_13_429885 606 12 . . . 10_1101-2021_02_13_429885 607 1 .CC .CC NFP 10_1101-2021_02_13_429885 607 2 - - : 10_1101-2021_02_13_429885 607 3 BY by IN 10_1101-2021_02_13_429885 607 4 - - HYPH 10_1101-2021_02_13_429885 607 5 NC NC NNP 10_1101-2021_02_13_429885 607 6 - - HYPH 10_1101-2021_02_13_429885 607 7 ND ND NNP 10_1101-2021_02_13_429885 607 8 4.0 4.0 CD 10_1101-2021_02_13_429885 607 9 International International NNP 10_1101-2021_02_13_429885 607 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 607 11 under under IN 10_1101-2021_02_13_429885 607 12 a a DT 10_1101-2021_02_13_429885 607 13 ( ( -LRB- 10_1101-2021_02_13_429885 607 14 which which WDT 10_1101-2021_02_13_429885 607 15 was be VBD 10_1101-2021_02_13_429885 607 16 not not RB 10_1101-2021_02_13_429885 607 17 certified certify VBN 10_1101-2021_02_13_429885 607 18 by by IN 10_1101-2021_02_13_429885 607 19 peer peer NN 10_1101-2021_02_13_429885 607 20 review review NN 10_1101-2021_02_13_429885 607 21 ) ) -RRB- 10_1101-2021_02_13_429885 607 22 is be VBZ 10_1101-2021_02_13_429885 607 23 the the DT 10_1101-2021_02_13_429885 607 24 author author NN 10_1101-2021_02_13_429885 607 25 / / SYM 10_1101-2021_02_13_429885 607 26 funder funder NN 10_1101-2021_02_13_429885 607 27 , , , 10_1101-2021_02_13_429885 607 28 who who WP 10_1101-2021_02_13_429885 607 29 has have VBZ 10_1101-2021_02_13_429885 607 30 granted grant VBN 10_1101-2021_02_13_429885 607 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 607 32 a a DT 10_1101-2021_02_13_429885 607 33 license license NN 10_1101-2021_02_13_429885 607 34 to to TO 10_1101-2021_02_13_429885 607 35 display display VB 10_1101-2021_02_13_429885 607 36 the the DT 10_1101-2021_02_13_429885 607 37 preprint preprint NN 10_1101-2021_02_13_429885 607 38 in in IN 10_1101-2021_02_13_429885 607 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 607 40 . . . 10_1101-2021_02_13_429885 608 1 It -PRON- PRP 10_1101-2021_02_13_429885 608 2 is be VBZ 10_1101-2021_02_13_429885 608 3 made make VBN 10_1101-2021_02_13_429885 608 4 The the DT 10_1101-2021_02_13_429885 608 5 copyright copyright NN 10_1101-2021_02_13_429885 608 6 holder holder NN 10_1101-2021_02_13_429885 608 7 for for IN 10_1101-2021_02_13_429885 608 8 this this DT 10_1101-2021_02_13_429885 608 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 608 10 version version NN 10_1101-2021_02_13_429885 608 11 posted post VBD 10_1101-2021_02_13_429885 608 12 February February NNP 10_1101-2021_02_13_429885 608 13 13 13 CD 10_1101-2021_02_13_429885 608 14 , , , 10_1101-2021_02_13_429885 608 15 2021 2021 CD 10_1101-2021_02_13_429885 608 16 . . . 10_1101-2021_02_13_429885 608 17 ; ; : 10_1101-2021_02_13_429885 608 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 608 19 : : : 10_1101-2021_02_13_429885 608 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 608 21 preprint preprint NN 10_1101-2021_02_13_429885 608 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 608 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 608 24 Househam Househam NNP 10_1101-2021_02_13_429885 608 25 et et FW 10_1101-2021_02_13_429885 608 26 al al NNP 10_1101-2021_02_13_429885 608 27 . . . 10_1101-2021_02_13_429885 609 1 A a DT 10_1101-2021_02_13_429885 609 2 fully fully RB 10_1101-2021_02_13_429885 609 3 automated automate VBN 10_1101-2021_02_13_429885 609 4 approach approach NN 10_1101-2021_02_13_429885 609 5 for for IN 10_1101-2021_02_13_429885 609 6 quality quality NN 10_1101-2021_02_13_429885 609 7 control control NN 10_1101-2021_02_13_429885 609 8 of of IN 10_1101-2021_02_13_429885 609 9 cancer cancer NN 10_1101-2021_02_13_429885 609 10 mutations mutation NNS 10_1101-2021_02_13_429885 609 11 in in IN 10_1101-2021_02_13_429885 609 12 the the DT 10_1101-2021_02_13_429885 609 13 era era NN 10_1101-2021_02_13_429885 609 14 of of IN 10_1101-2021_02_13_429885 609 15 high high JJ 10_1101-2021_02_13_429885 609 16 - - HYPH 10_1101-2021_02_13_429885 609 17 resolution resolution NN 10_1101-2021_02_13_429885 609 18 whole whole JJ 10_1101-2021_02_13_429885 609 19 genome genome JJ 10_1101-2021_02_13_429885 609 20 sequencing sequencing NN 10_1101-2021_02_13_429885 609 21 . . . 10_1101-2021_02_13_429885 610 1 Supplementary supplementary JJ 10_1101-2021_02_13_429885 610 2 Figure Figure NNP 10_1101-2021_02_13_429885 610 3 S10 S10 NNP 10_1101-2021_02_13_429885 610 4 . . . 10_1101-2021_02_13_429885 611 1 ​Colorectal ​colorectal JJ 10_1101-2021_02_13_429885 611 2 multi multi JJ 10_1101-2021_02_13_429885 611 3 - - JJ 10_1101-2021_02_13_429885 611 4 region region JJ 10_1101-2021_02_13_429885 611 5 sample sample NN 10_1101-2021_02_13_429885 611 6 Set6_48 Set6_48 NNP 10_1101-2021_02_13_429885 611 7 for for IN 10_1101-2021_02_13_429885 611 8 patient patient NN 10_1101-2021_02_13_429885 611 9 Set6 Set6 NNP 10_1101-2021_02_13_429885 611 10 . . . 10_1101-2021_02_13_429885 612 1 a. a. NN 10_1101-2021_02_13_429885 613 1 ​Data ​Data NNP 10_1101-2021_02_13_429885 613 2 for for IN 10_1101-2021_02_13_429885 613 3 the the DT 10_1101-2021_02_13_429885 613 4 sample sample NN 10_1101-2021_02_13_429885 613 5 ( ( -LRB- 10_1101-2021_02_13_429885 613 6 genome genome NN 10_1101-2021_02_13_429885 613 7 - - HYPH 10_1101-2021_02_13_429885 613 8 wide wide JJ 10_1101-2021_02_13_429885 613 9 CNA cna NN 10_1101-2021_02_13_429885 613 10 segments segment NNS 10_1101-2021_02_13_429885 613 11 , , , 10_1101-2021_02_13_429885 613 12 CCF CCF NNP 10_1101-2021_02_13_429885 613 13 and and CC 10_1101-2021_02_13_429885 613 14 read read VBD 10_1101-2021_02_13_429885 613 15 counts count NNS 10_1101-2021_02_13_429885 613 16 distribution distribution NN 10_1101-2021_02_13_429885 613 17 ) ) -RRB- 10_1101-2021_02_13_429885 613 18 . . . 10_1101-2021_02_13_429885 614 1 ​b ​b NNP 10_1101-2021_02_13_429885 614 2 , , , 10_1101-2021_02_13_429885 614 3 c.​ c.​ NNP 10_1101-2021_02_13_429885 614 4 Peak Peak NNP 10_1101-2021_02_13_429885 614 5 analysis analysis NN 10_1101-2021_02_13_429885 614 6 and and CC 10_1101-2021_02_13_429885 614 7 CCF ccf NN 10_1101-2021_02_13_429885 614 8 computation computation NN 10_1101-2021_02_13_429885 614 9 for for IN 10_1101-2021_02_13_429885 614 10 the the DT 10_1101-2021_02_13_429885 614 11 sample sample NN 10_1101-2021_02_13_429885 614 12 . . . 10_1101-2021_02_13_429885 615 1 .CC .CC NFP 10_1101-2021_02_13_429885 615 2 - - : 10_1101-2021_02_13_429885 615 3 BY by IN 10_1101-2021_02_13_429885 615 4 - - HYPH 10_1101-2021_02_13_429885 615 5 NC NC NNP 10_1101-2021_02_13_429885 615 6 - - HYPH 10_1101-2021_02_13_429885 615 7 ND ND NNP 10_1101-2021_02_13_429885 615 8 4.0 4.0 CD 10_1101-2021_02_13_429885 615 9 International International NNP 10_1101-2021_02_13_429885 615 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 615 11 under under IN 10_1101-2021_02_13_429885 615 12 a a DT 10_1101-2021_02_13_429885 615 13 ( ( -LRB- 10_1101-2021_02_13_429885 615 14 which which WDT 10_1101-2021_02_13_429885 615 15 was be VBD 10_1101-2021_02_13_429885 615 16 not not RB 10_1101-2021_02_13_429885 615 17 certified certify VBN 10_1101-2021_02_13_429885 615 18 by by IN 10_1101-2021_02_13_429885 615 19 peer peer NN 10_1101-2021_02_13_429885 615 20 review review NN 10_1101-2021_02_13_429885 615 21 ) ) -RRB- 10_1101-2021_02_13_429885 615 22 is be VBZ 10_1101-2021_02_13_429885 615 23 the the DT 10_1101-2021_02_13_429885 615 24 author author NN 10_1101-2021_02_13_429885 615 25 / / SYM 10_1101-2021_02_13_429885 615 26 funder funder NN 10_1101-2021_02_13_429885 615 27 , , , 10_1101-2021_02_13_429885 615 28 who who WP 10_1101-2021_02_13_429885 615 29 has have VBZ 10_1101-2021_02_13_429885 615 30 granted grant VBN 10_1101-2021_02_13_429885 615 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 615 32 a a DT 10_1101-2021_02_13_429885 615 33 license license NN 10_1101-2021_02_13_429885 615 34 to to TO 10_1101-2021_02_13_429885 615 35 display display VB 10_1101-2021_02_13_429885 615 36 the the DT 10_1101-2021_02_13_429885 615 37 preprint preprint NN 10_1101-2021_02_13_429885 615 38 in in IN 10_1101-2021_02_13_429885 615 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 615 40 . . . 10_1101-2021_02_13_429885 616 1 It -PRON- PRP 10_1101-2021_02_13_429885 616 2 is be VBZ 10_1101-2021_02_13_429885 616 3 made make VBN 10_1101-2021_02_13_429885 616 4 The the DT 10_1101-2021_02_13_429885 616 5 copyright copyright NN 10_1101-2021_02_13_429885 616 6 holder holder NN 10_1101-2021_02_13_429885 616 7 for for IN 10_1101-2021_02_13_429885 616 8 this this DT 10_1101-2021_02_13_429885 616 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 616 10 version version NN 10_1101-2021_02_13_429885 616 11 posted post VBD 10_1101-2021_02_13_429885 616 12 February February NNP 10_1101-2021_02_13_429885 616 13 13 13 CD 10_1101-2021_02_13_429885 616 14 , , , 10_1101-2021_02_13_429885 616 15 2021 2021 CD 10_1101-2021_02_13_429885 616 16 . . . 10_1101-2021_02_13_429885 616 17 ; ; : 10_1101-2021_02_13_429885 616 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 616 19 : : : 10_1101-2021_02_13_429885 616 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 616 21 preprint preprint NN 10_1101-2021_02_13_429885 616 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 616 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 616 24 Househam Househam NNP 10_1101-2021_02_13_429885 616 25 et et FW 10_1101-2021_02_13_429885 616 26 al al NNP 10_1101-2021_02_13_429885 616 27 . . . 10_1101-2021_02_13_429885 617 1 A a DT 10_1101-2021_02_13_429885 617 2 fully fully RB 10_1101-2021_02_13_429885 617 3 automated automate VBN 10_1101-2021_02_13_429885 617 4 approach approach NN 10_1101-2021_02_13_429885 617 5 for for IN 10_1101-2021_02_13_429885 617 6 quality quality NN 10_1101-2021_02_13_429885 617 7 control control NN 10_1101-2021_02_13_429885 617 8 of of IN 10_1101-2021_02_13_429885 617 9 cancer cancer NN 10_1101-2021_02_13_429885 617 10 mutations mutation NNS 10_1101-2021_02_13_429885 617 11 in in IN 10_1101-2021_02_13_429885 617 12 the the DT 10_1101-2021_02_13_429885 617 13 era era NN 10_1101-2021_02_13_429885 617 14 of of IN 10_1101-2021_02_13_429885 617 15 high high JJ 10_1101-2021_02_13_429885 617 16 - - HYPH 10_1101-2021_02_13_429885 617 17 resolution resolution NN 10_1101-2021_02_13_429885 617 18 whole whole JJ 10_1101-2021_02_13_429885 617 19 genome genome JJ 10_1101-2021_02_13_429885 617 20 sequencing sequencing NN 10_1101-2021_02_13_429885 617 21 . . . 10_1101-2021_02_13_429885 618 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 618 2 Figure Figure NNP 10_1101-2021_02_13_429885 618 3 11 11 CD 10_1101-2021_02_13_429885 618 4 . . . 10_1101-2021_02_13_429885 619 1 ​Example ​Example NNP 10_1101-2021_02_13_429885 619 2 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 619 3 sample sample NN 10_1101-2021_02_13_429885 619 4 with with IN 10_1101-2021_02_13_429885 619 5 purity purity NN 10_1101-2021_02_13_429885 619 6 of of IN 10_1101-2021_02_13_429885 619 7 100 100 CD 10_1101-2021_02_13_429885 619 8 % % NN 10_1101-2021_02_13_429885 619 9 . . . 10_1101-2021_02_13_429885 620 1 ​a ​a NNS 10_1101-2021_02_13_429885 620 2 . . . 10_1101-2021_02_13_429885 621 1 ​Data ​Data NNP 10_1101-2021_02_13_429885 621 2 for for IN 10_1101-2021_02_13_429885 621 3 the the DT 10_1101-2021_02_13_429885 621 4 sample sample NN 10_1101-2021_02_13_429885 621 5 ( ( -LRB- 10_1101-2021_02_13_429885 621 6 genome genome NN 10_1101-2021_02_13_429885 621 7 - - HYPH 10_1101-2021_02_13_429885 621 8 wide wide JJ 10_1101-2021_02_13_429885 621 9 CNA cna NN 10_1101-2021_02_13_429885 621 10 segments segment NNS 10_1101-2021_02_13_429885 621 11 , , , 10_1101-2021_02_13_429885 621 12 CCF CCF NNP 10_1101-2021_02_13_429885 621 13 and and CC 10_1101-2021_02_13_429885 621 14 read read VBD 10_1101-2021_02_13_429885 621 15 counts count NNS 10_1101-2021_02_13_429885 621 16 distribution distribution NN 10_1101-2021_02_13_429885 621 17 ) ) -RRB- 10_1101-2021_02_13_429885 621 18 . . . 10_1101-2021_02_13_429885 622 1 ​b ​b NNP 10_1101-2021_02_13_429885 622 2 . . . 10_1101-2021_02_13_429885 623 1 This this DT 10_1101-2021_02_13_429885 623 2 sample sample NN 10_1101-2021_02_13_429885 623 3 has have VBZ 10_1101-2021_02_13_429885 623 4 75 75 CD 10_1101-2021_02_13_429885 623 5 % % NN 10_1101-2021_02_13_429885 623 6 of of IN 10_1101-2021_02_13_429885 623 7 its -PRON- PRP$ 10_1101-2021_02_13_429885 623 8 SNVs SNVs NNP 10_1101-2021_02_13_429885 623 9 in in IN 10_1101-2021_02_13_429885 623 10 diploid diploid JJ 10_1101-2021_02_13_429885 623 11 tumour tumour NN 10_1101-2021_02_13_429885 623 12 regions region NNS 10_1101-2021_02_13_429885 623 13 , , , 10_1101-2021_02_13_429885 623 14 where where WRB 10_1101-2021_02_13_429885 623 15 a a DT 10_1101-2021_02_13_429885 623 16 small small JJ 10_1101-2021_02_13_429885 623 17 peak peak NN 10_1101-2021_02_13_429885 623 18 is be VBZ 10_1101-2021_02_13_429885 623 19 detectable detectable JJ 10_1101-2021_02_13_429885 623 20 at at IN 10_1101-2021_02_13_429885 623 21 the the DT 10_1101-2021_02_13_429885 623 22 expected expected JJ 10_1101-2021_02_13_429885 623 23 purity purity NN 10_1101-2021_02_13_429885 623 24 . . . 10_1101-2021_02_13_429885 624 1 The the DT 10_1101-2021_02_13_429885 624 2 VAF VAF NNP 10_1101-2021_02_13_429885 624 3 clearly clearly RB 10_1101-2021_02_13_429885 624 4 peaks peak VBZ 10_1101-2021_02_13_429885 624 5 at at IN 10_1101-2021_02_13_429885 624 6 ~10 ~10 NNP 10_1101-2021_02_13_429885 624 7 % % NN 10_1101-2021_02_13_429885 624 8 , , , 10_1101-2021_02_13_429885 624 9 possibly possibly RB 10_1101-2021_02_13_429885 624 10 suggesting suggest VBG 10_1101-2021_02_13_429885 624 11 a a DT 10_1101-2021_02_13_429885 624 12 purity purity NN 10_1101-2021_02_13_429885 624 13 of of IN 10_1101-2021_02_13_429885 624 14 20 20 CD 10_1101-2021_02_13_429885 624 15 % % NN 10_1101-2021_02_13_429885 624 16 or or CC 10_1101-2021_02_13_429885 624 17 lower low JJR 10_1101-2021_02_13_429885 624 18 , , , 10_1101-2021_02_13_429885 624 19 rather rather RB 10_1101-2021_02_13_429885 624 20 than than IN 10_1101-2021_02_13_429885 624 21 100 100 CD 10_1101-2021_02_13_429885 624 22 % % NN 10_1101-2021_02_13_429885 624 23 . . . 10_1101-2021_02_13_429885 625 1 Further further JJ 10_1101-2021_02_13_429885 625 2 doubts doubt NNS 10_1101-2021_02_13_429885 625 3 about about IN 10_1101-2021_02_13_429885 625 4 the the DT 10_1101-2021_02_13_429885 625 5 current current JJ 10_1101-2021_02_13_429885 625 6 purity purity NN 10_1101-2021_02_13_429885 625 7 come come VBP 10_1101-2021_02_13_429885 625 8 from from IN 10_1101-2021_02_13_429885 625 9 non non JJ 10_1101-2021_02_13_429885 625 10 - - JJ 10_1101-2021_02_13_429885 625 11 diploid diploid JJ 10_1101-2021_02_13_429885 625 12 regions region NNS 10_1101-2021_02_13_429885 625 13 , , , 10_1101-2021_02_13_429885 625 14 where where WRB 10_1101-2021_02_13_429885 625 15 all all DT 10_1101-2021_02_13_429885 625 16 peaks peak NNS 10_1101-2021_02_13_429885 625 17 are be VBP 10_1101-2021_02_13_429885 625 18 mismatched mismatch VBN 10_1101-2021_02_13_429885 625 19 ; ; : 10_1101-2021_02_13_429885 625 20 for for IN 10_1101-2021_02_13_429885 625 21 this this DT 10_1101-2021_02_13_429885 625 22 sample sample NN 10_1101-2021_02_13_429885 625 23 CNAs cna NNS 10_1101-2021_02_13_429885 625 24 called call VBD 10_1101-2021_02_13_429885 625 25 with with IN 10_1101-2021_02_13_429885 625 26 a a DT 10_1101-2021_02_13_429885 625 27 low low JJ 10_1101-2021_02_13_429885 625 28 - - HYPH 10_1101-2021_02_13_429885 625 29 purity purity NN 10_1101-2021_02_13_429885 625 30 solution solution NN 10_1101-2021_02_13_429885 625 31 should should MD 10_1101-2021_02_13_429885 625 32 be be VB 10_1101-2021_02_13_429885 625 33 compared compare VBN 10_1101-2021_02_13_429885 625 34 to to IN 10_1101-2021_02_13_429885 625 35 the the DT 10_1101-2021_02_13_429885 625 36 100 100 CD 10_1101-2021_02_13_429885 625 37 % % NN 10_1101-2021_02_13_429885 625 38 purity purity NN 10_1101-2021_02_13_429885 625 39 solution solution NN 10_1101-2021_02_13_429885 625 40 . . . 10_1101-2021_02_13_429885 626 1 ​c ​c NNP 10_1101-2021_02_13_429885 626 2 . . . 10_1101-2021_02_13_429885 627 1 CCF ccf NN 10_1101-2021_02_13_429885 627 2 computation computation NN 10_1101-2021_02_13_429885 627 3 for for IN 10_1101-2021_02_13_429885 627 4 the the DT 10_1101-2021_02_13_429885 627 5 sample sample NN 10_1101-2021_02_13_429885 627 6 . . . 10_1101-2021_02_13_429885 628 1 Notice notice VB 10_1101-2021_02_13_429885 628 2 that that IN 10_1101-2021_02_13_429885 628 3 in in IN 10_1101-2021_02_13_429885 628 4 triploid triploid JJ 10_1101-2021_02_13_429885 628 5 and and CC 10_1101-2021_02_13_429885 628 6 tetraploid tetraploid JJ 10_1101-2021_02_13_429885 628 7 tumour tumour NN 10_1101-2021_02_13_429885 628 8 genomes genome NNS 10_1101-2021_02_13_429885 628 9 we -PRON- PRP 10_1101-2021_02_13_429885 628 10 do do VBP 10_1101-2021_02_13_429885 628 11 not not RB 10_1101-2021_02_13_429885 628 12 find find VB 10_1101-2021_02_13_429885 628 13 mutations mutation NNS 10_1101-2021_02_13_429885 628 14 present present JJ 10_1101-2021_02_13_429885 628 15 in in IN 10_1101-2021_02_13_429885 628 16 2 2 CD 10_1101-2021_02_13_429885 628 17 copies copy NNS 10_1101-2021_02_13_429885 628 18 . . . 10_1101-2021_02_13_429885 629 1 Was be VBD 10_1101-2021_02_13_429885 629 2 this this DT 10_1101-2021_02_13_429885 629 3 true true JJ 10_1101-2021_02_13_429885 629 4 then then RB 10_1101-2021_02_13_429885 629 5 the the DT 10_1101-2021_02_13_429885 629 6 tumour tumour NN 10_1101-2021_02_13_429885 629 7 did do VBD 10_1101-2021_02_13_429885 629 8 not not RB 10_1101-2021_02_13_429885 629 9 acquire acquire VB 10_1101-2021_02_13_429885 629 10 any any DT 10_1101-2021_02_13_429885 629 11 SNV SNV NNP 10_1101-2021_02_13_429885 629 12 right right RB 10_1101-2021_02_13_429885 629 13 before before IN 10_1101-2021_02_13_429885 629 14 the the DT 10_1101-2021_02_13_429885 629 15 CNA CNA NNP 10_1101-2021_02_13_429885 629 16 . . . 10_1101-2021_02_13_429885 630 1 Also also RB 10_1101-2021_02_13_429885 630 2 , , , 10_1101-2021_02_13_429885 630 3 here here RB 10_1101-2021_02_13_429885 630 4 we -PRON- PRP 10_1101-2021_02_13_429885 630 5 are be VBP 10_1101-2021_02_13_429885 630 6 not not RB 10_1101-2021_02_13_429885 630 7 cross cross JJ 10_1101-2021_02_13_429885 630 8 - - JJ 10_1101-2021_02_13_429885 630 9 checking checking JJ 10_1101-2021_02_13_429885 630 10 QC QC NNP 10_1101-2021_02_13_429885 630 11 results result NNS 10_1101-2021_02_13_429885 630 12 from from IN 10_1101-2021_02_13_429885 630 13 peak peak NN 10_1101-2021_02_13_429885 630 14 .CC .CC : 10_1101-2021_02_13_429885 630 15 - - : 10_1101-2021_02_13_429885 630 16 BY by IN 10_1101-2021_02_13_429885 630 17 - - HYPH 10_1101-2021_02_13_429885 630 18 NC NC NNP 10_1101-2021_02_13_429885 630 19 - - HYPH 10_1101-2021_02_13_429885 630 20 ND ND NNP 10_1101-2021_02_13_429885 630 21 4.0 4.0 CD 10_1101-2021_02_13_429885 630 22 International International NNP 10_1101-2021_02_13_429885 630 23 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 630 24 under under IN 10_1101-2021_02_13_429885 630 25 a a DT 10_1101-2021_02_13_429885 630 26 ( ( -LRB- 10_1101-2021_02_13_429885 630 27 which which WDT 10_1101-2021_02_13_429885 630 28 was be VBD 10_1101-2021_02_13_429885 630 29 not not RB 10_1101-2021_02_13_429885 630 30 certified certify VBN 10_1101-2021_02_13_429885 630 31 by by IN 10_1101-2021_02_13_429885 630 32 peer peer NN 10_1101-2021_02_13_429885 630 33 review review NN 10_1101-2021_02_13_429885 630 34 ) ) -RRB- 10_1101-2021_02_13_429885 630 35 is be VBZ 10_1101-2021_02_13_429885 630 36 the the DT 10_1101-2021_02_13_429885 630 37 author author NN 10_1101-2021_02_13_429885 630 38 / / SYM 10_1101-2021_02_13_429885 630 39 funder funder NN 10_1101-2021_02_13_429885 630 40 , , , 10_1101-2021_02_13_429885 630 41 who who WP 10_1101-2021_02_13_429885 630 42 has have VBZ 10_1101-2021_02_13_429885 630 43 granted grant VBN 10_1101-2021_02_13_429885 630 44 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 630 45 a a DT 10_1101-2021_02_13_429885 630 46 license license NN 10_1101-2021_02_13_429885 630 47 to to TO 10_1101-2021_02_13_429885 630 48 display display VB 10_1101-2021_02_13_429885 630 49 the the DT 10_1101-2021_02_13_429885 630 50 preprint preprint NN 10_1101-2021_02_13_429885 630 51 in in IN 10_1101-2021_02_13_429885 630 52 perpetuity perpetuity NN 10_1101-2021_02_13_429885 630 53 . . . 10_1101-2021_02_13_429885 631 1 It -PRON- PRP 10_1101-2021_02_13_429885 631 2 is be VBZ 10_1101-2021_02_13_429885 631 3 made make VBN 10_1101-2021_02_13_429885 631 4 The the DT 10_1101-2021_02_13_429885 631 5 copyright copyright NN 10_1101-2021_02_13_429885 631 6 holder holder NN 10_1101-2021_02_13_429885 631 7 for for IN 10_1101-2021_02_13_429885 631 8 this this DT 10_1101-2021_02_13_429885 631 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 631 10 version version NN 10_1101-2021_02_13_429885 631 11 posted post VBD 10_1101-2021_02_13_429885 631 12 February February NNP 10_1101-2021_02_13_429885 631 13 13 13 CD 10_1101-2021_02_13_429885 631 14 , , , 10_1101-2021_02_13_429885 631 15 2021 2021 CD 10_1101-2021_02_13_429885 631 16 . . . 10_1101-2021_02_13_429885 631 17 ; ; : 10_1101-2021_02_13_429885 631 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 631 19 : : : 10_1101-2021_02_13_429885 631 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 631 21 preprint preprint NN 10_1101-2021_02_13_429885 631 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 631 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 631 24 Househam Househam NNP 10_1101-2021_02_13_429885 631 25 et et FW 10_1101-2021_02_13_429885 631 26 al al NNP 10_1101-2021_02_13_429885 631 27 . . . 10_1101-2021_02_13_429885 632 1 A a DT 10_1101-2021_02_13_429885 632 2 fully fully RB 10_1101-2021_02_13_429885 632 3 automated automate VBN 10_1101-2021_02_13_429885 632 4 approach approach NN 10_1101-2021_02_13_429885 632 5 for for IN 10_1101-2021_02_13_429885 632 6 quality quality NN 10_1101-2021_02_13_429885 632 7 control control NN 10_1101-2021_02_13_429885 632 8 of of IN 10_1101-2021_02_13_429885 632 9 cancer cancer NN 10_1101-2021_02_13_429885 632 10 mutations mutation NNS 10_1101-2021_02_13_429885 632 11 in in IN 10_1101-2021_02_13_429885 632 12 the the DT 10_1101-2021_02_13_429885 632 13 era era NN 10_1101-2021_02_13_429885 632 14 of of IN 10_1101-2021_02_13_429885 632 15 high high JJ 10_1101-2021_02_13_429885 632 16 - - HYPH 10_1101-2021_02_13_429885 632 17 resolution resolution NN 10_1101-2021_02_13_429885 632 18 whole whole JJ 10_1101-2021_02_13_429885 632 19 genome genome JJ 10_1101-2021_02_13_429885 632 20 sequencing sequencing NN 10_1101-2021_02_13_429885 632 21 . . . 10_1101-2021_02_13_429885 633 1 detection detection NN 10_1101-2021_02_13_429885 633 2 ; ; : 10_1101-2021_02_13_429885 633 3 for for IN 10_1101-2021_02_13_429885 633 4 instance instance NN 10_1101-2021_02_13_429885 633 5 we -PRON- PRP 10_1101-2021_02_13_429885 633 6 could could MD 10_1101-2021_02_13_429885 633 7 decide decide VB 10_1101-2021_02_13_429885 633 8 to to TO 10_1101-2021_02_13_429885 633 9 use use VB 10_1101-2021_02_13_429885 633 10 only only JJ 10_1101-2021_02_13_429885 633 11 mutations mutation NNS 10_1101-2021_02_13_429885 633 12 that that WDT 10_1101-2021_02_13_429885 633 13 map map VBP 10_1101-2021_02_13_429885 633 14 to to IN 10_1101-2021_02_13_429885 633 15 PASS PASS NNP 10_1101-2021_02_13_429885 633 16 states state NNS 10_1101-2021_02_13_429885 633 17 ( ( -LRB- 10_1101-2021_02_13_429885 633 18 1:1 1:1 CD 10_1101-2021_02_13_429885 633 19 , , , 10_1101-2021_02_13_429885 633 20 2:2 2:2 CD 10_1101-2021_02_13_429885 633 21 ) ) -RRB- 10_1101-2021_02_13_429885 633 22 , , , 10_1101-2021_02_13_429885 633 23 and and CC 10_1101-2021_02_13_429885 633 24 reject reject VBP 10_1101-2021_02_13_429885 633 25 all all DT 10_1101-2021_02_13_429885 633 26 others other NNS 10_1101-2021_02_13_429885 633 27 . . . 10_1101-2021_02_13_429885 634 1 Supplementary Supplementary NNP 10_1101-2021_02_13_429885 634 2 Figure Figure NNP 10_1101-2021_02_13_429885 634 3 12 12 CD 10_1101-2021_02_13_429885 634 4 . . . 10_1101-2021_02_13_429885 635 1 ​Example ​Example NNP 10_1101-2021_02_13_429885 635 2 PCAWG PCAWG NNP 10_1101-2021_02_13_429885 635 3 pancreatic pancreatic JJ 10_1101-2021_02_13_429885 635 4 adenocarcinoma adenocarcinoma NN 10_1101-2021_02_13_429885 635 5 with with IN 10_1101-2021_02_13_429885 635 6 99 99 CD 10_1101-2021_02_13_429885 635 7 % % NN 10_1101-2021_02_13_429885 635 8 purity purity NN 10_1101-2021_02_13_429885 635 9 ( ( -LRB- 10_1101-2021_02_13_429885 635 10 and and CC 10_1101-2021_02_13_429885 635 11 3 3 CD 10_1101-2021_02_13_429885 635 12 possible possible JJ 10_1101-2021_02_13_429885 635 13 driver driver NN 10_1101-2021_02_13_429885 635 14 SNVs snv NNS 10_1101-2021_02_13_429885 635 15 , , , 10_1101-2021_02_13_429885 635 16 2 2 CD 10_1101-2021_02_13_429885 635 17 of of IN 10_1101-2021_02_13_429885 635 18 them -PRON- PRP 10_1101-2021_02_13_429885 635 19 involving involve VBG 10_1101-2021_02_13_429885 635 20 tumour tumour NN 10_1101-2021_02_13_429885 635 21 suppressor suppressor NN 10_1101-2021_02_13_429885 635 22 genes gene NNS 10_1101-2021_02_13_429885 635 23 in in IN 10_1101-2021_02_13_429885 635 24 LOH LOH NNP 10_1101-2021_02_13_429885 635 25 regions region NNS 10_1101-2021_02_13_429885 635 26 ) ) -RRB- 10_1101-2021_02_13_429885 635 27 . . . 10_1101-2021_02_13_429885 636 1 ​a ​a NNS 10_1101-2021_02_13_429885 636 2 . . . 10_1101-2021_02_13_429885 637 1 ​Data ​Data NNP 10_1101-2021_02_13_429885 637 2 for for IN 10_1101-2021_02_13_429885 637 3 the the DT 10_1101-2021_02_13_429885 637 4 sample sample NN 10_1101-2021_02_13_429885 637 5 ( ( -LRB- 10_1101-2021_02_13_429885 637 6 genome genome NN 10_1101-2021_02_13_429885 637 7 - - HYPH 10_1101-2021_02_13_429885 637 8 wide wide JJ 10_1101-2021_02_13_429885 637 9 CNA cna NN 10_1101-2021_02_13_429885 637 10 segments segment NNS 10_1101-2021_02_13_429885 637 11 , , , 10_1101-2021_02_13_429885 637 12 CCF CCF NNP 10_1101-2021_02_13_429885 637 13 and and CC 10_1101-2021_02_13_429885 637 14 read read VBD 10_1101-2021_02_13_429885 637 15 counts count NNS 10_1101-2021_02_13_429885 637 16 distribution distribution NN 10_1101-2021_02_13_429885 637 17 ) ) -RRB- 10_1101-2021_02_13_429885 637 18 . . . 10_1101-2021_02_13_429885 638 1 ​b ​b NNP 10_1101-2021_02_13_429885 638 2 . . . 10_1101-2021_02_13_429885 639 1 This this DT 10_1101-2021_02_13_429885 639 2 sample sample NN 10_1101-2021_02_13_429885 639 3 has have VBZ 10_1101-2021_02_13_429885 639 4 90 90 CD 10_1101-2021_02_13_429885 639 5 % % NN 10_1101-2021_02_13_429885 639 6 of of IN 10_1101-2021_02_13_429885 639 7 its -PRON- PRP$ 10_1101-2021_02_13_429885 639 8 SNVs SNVs NNP 10_1101-2021_02_13_429885 639 9 in in IN 10_1101-2021_02_13_429885 639 10 diploid diploid JJ 10_1101-2021_02_13_429885 639 11 tumour tumour NN 10_1101-2021_02_13_429885 639 12 regions region NNS 10_1101-2021_02_13_429885 639 13 , , , 10_1101-2021_02_13_429885 639 14 and and CC 10_1101-2021_02_13_429885 639 15 the the DT 10_1101-2021_02_13_429885 639 16 others other NNS 10_1101-2021_02_13_429885 639 17 in in IN 10_1101-2021_02_13_429885 639 18 a a DT 10_1101-2021_02_13_429885 639 19 variety variety NN 10_1101-2021_02_13_429885 639 20 of of IN 10_1101-2021_02_13_429885 639 21 distinct distinct JJ 10_1101-2021_02_13_429885 639 22 CNA CNA NNP 10_1101-2021_02_13_429885 639 23 segments segment NNS 10_1101-2021_02_13_429885 639 24 . . . 10_1101-2021_02_13_429885 640 1 From from IN 10_1101-2021_02_13_429885 640 2 a a DT 10_1101-2021_02_13_429885 640 3 peak peak NN 10_1101-2021_02_13_429885 640 4 analysis analysis NN 10_1101-2021_02_13_429885 640 5 point point NN 10_1101-2021_02_13_429885 640 6 of of IN 10_1101-2021_02_13_429885 640 7 view view NN 10_1101-2021_02_13_429885 640 8 , , , 10_1101-2021_02_13_429885 640 9 all all PDT 10_1101-2021_02_13_429885 640 10 the the DT 10_1101-2021_02_13_429885 640 11 calls call NNS 10_1101-2021_02_13_429885 640 12 are be VBP 10_1101-2021_02_13_429885 640 13 validated validate VBN 10_1101-2021_02_13_429885 640 14 . . . 10_1101-2021_02_13_429885 641 1 ​c.​ ​c.​ NNP 10_1101-2021_02_13_429885 641 2 CCF ccf NN 10_1101-2021_02_13_429885 641 3 values value NNS 10_1101-2021_02_13_429885 641 4 for for IN 10_1101-2021_02_13_429885 641 5 this this DT 10_1101-2021_02_13_429885 641 6 sample sample NN 10_1101-2021_02_13_429885 641 7 are be VBP 10_1101-2021_02_13_429885 641 8 also also RB 10_1101-2021_02_13_429885 641 9 good good JJ 10_1101-2021_02_13_429885 641 10 . . . 10_1101-2021_02_13_429885 642 1 .CC .CC NFP 10_1101-2021_02_13_429885 642 2 - - : 10_1101-2021_02_13_429885 642 3 BY by IN 10_1101-2021_02_13_429885 642 4 - - HYPH 10_1101-2021_02_13_429885 642 5 NC NC NNP 10_1101-2021_02_13_429885 642 6 - - HYPH 10_1101-2021_02_13_429885 642 7 ND ND NNP 10_1101-2021_02_13_429885 642 8 4.0 4.0 CD 10_1101-2021_02_13_429885 642 9 International International NNP 10_1101-2021_02_13_429885 642 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 642 11 under under IN 10_1101-2021_02_13_429885 642 12 a a DT 10_1101-2021_02_13_429885 642 13 ( ( -LRB- 10_1101-2021_02_13_429885 642 14 which which WDT 10_1101-2021_02_13_429885 642 15 was be VBD 10_1101-2021_02_13_429885 642 16 not not RB 10_1101-2021_02_13_429885 642 17 certified certify VBN 10_1101-2021_02_13_429885 642 18 by by IN 10_1101-2021_02_13_429885 642 19 peer peer NN 10_1101-2021_02_13_429885 642 20 review review NN 10_1101-2021_02_13_429885 642 21 ) ) -RRB- 10_1101-2021_02_13_429885 642 22 is be VBZ 10_1101-2021_02_13_429885 642 23 the the DT 10_1101-2021_02_13_429885 642 24 author author NN 10_1101-2021_02_13_429885 642 25 / / SYM 10_1101-2021_02_13_429885 642 26 funder funder NN 10_1101-2021_02_13_429885 642 27 , , , 10_1101-2021_02_13_429885 642 28 who who WP 10_1101-2021_02_13_429885 642 29 has have VBZ 10_1101-2021_02_13_429885 642 30 granted grant VBN 10_1101-2021_02_13_429885 642 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 642 32 a a DT 10_1101-2021_02_13_429885 642 33 license license NN 10_1101-2021_02_13_429885 642 34 to to TO 10_1101-2021_02_13_429885 642 35 display display VB 10_1101-2021_02_13_429885 642 36 the the DT 10_1101-2021_02_13_429885 642 37 preprint preprint NN 10_1101-2021_02_13_429885 642 38 in in IN 10_1101-2021_02_13_429885 642 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 642 40 . . . 10_1101-2021_02_13_429885 643 1 It -PRON- PRP 10_1101-2021_02_13_429885 643 2 is be VBZ 10_1101-2021_02_13_429885 643 3 made make VBN 10_1101-2021_02_13_429885 643 4 The the DT 10_1101-2021_02_13_429885 643 5 copyright copyright NN 10_1101-2021_02_13_429885 643 6 holder holder NN 10_1101-2021_02_13_429885 643 7 for for IN 10_1101-2021_02_13_429885 643 8 this this DT 10_1101-2021_02_13_429885 643 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 643 10 version version NN 10_1101-2021_02_13_429885 643 11 posted post VBD 10_1101-2021_02_13_429885 643 12 February February NNP 10_1101-2021_02_13_429885 643 13 13 13 CD 10_1101-2021_02_13_429885 643 14 , , , 10_1101-2021_02_13_429885 643 15 2021 2021 CD 10_1101-2021_02_13_429885 643 16 . . . 10_1101-2021_02_13_429885 643 17 ; ; : 10_1101-2021_02_13_429885 643 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 643 19 : : : 10_1101-2021_02_13_429885 643 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 643 21 preprint preprint NN 10_1101-2021_02_13_429885 643 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 643 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD 10_1101-2021_02_13_429885 643 24 Househam Househam NNP 10_1101-2021_02_13_429885 643 25 et et FW 10_1101-2021_02_13_429885 643 26 al al NNP 10_1101-2021_02_13_429885 643 27 . . . 10_1101-2021_02_13_429885 644 1 A a DT 10_1101-2021_02_13_429885 644 2 fully fully RB 10_1101-2021_02_13_429885 644 3 automated automate VBN 10_1101-2021_02_13_429885 644 4 approach approach NN 10_1101-2021_02_13_429885 644 5 for for IN 10_1101-2021_02_13_429885 644 6 quality quality NN 10_1101-2021_02_13_429885 644 7 control control NN 10_1101-2021_02_13_429885 644 8 of of IN 10_1101-2021_02_13_429885 644 9 cancer cancer NN 10_1101-2021_02_13_429885 644 10 mutations mutation NNS 10_1101-2021_02_13_429885 644 11 in in IN 10_1101-2021_02_13_429885 644 12 the the DT 10_1101-2021_02_13_429885 644 13 era era NN 10_1101-2021_02_13_429885 644 14 of of IN 10_1101-2021_02_13_429885 644 15 high high JJ 10_1101-2021_02_13_429885 644 16 - - HYPH 10_1101-2021_02_13_429885 644 17 resolution resolution NN 10_1101-2021_02_13_429885 644 18 whole whole JJ 10_1101-2021_02_13_429885 644 19 genome genome JJ 10_1101-2021_02_13_429885 644 20 sequencing sequencing NN 10_1101-2021_02_13_429885 644 21 . . . 10_1101-2021_02_13_429885 645 1 .CC .CC NFP 10_1101-2021_02_13_429885 645 2 - - : 10_1101-2021_02_13_429885 645 3 BY by IN 10_1101-2021_02_13_429885 645 4 - - HYPH 10_1101-2021_02_13_429885 645 5 NC NC NNP 10_1101-2021_02_13_429885 645 6 - - HYPH 10_1101-2021_02_13_429885 645 7 ND ND NNP 10_1101-2021_02_13_429885 645 8 4.0 4.0 CD 10_1101-2021_02_13_429885 645 9 International International NNP 10_1101-2021_02_13_429885 645 10 licenseavailable licenseavailable NN 10_1101-2021_02_13_429885 645 11 under under IN 10_1101-2021_02_13_429885 645 12 a a DT 10_1101-2021_02_13_429885 645 13 ( ( -LRB- 10_1101-2021_02_13_429885 645 14 which which WDT 10_1101-2021_02_13_429885 645 15 was be VBD 10_1101-2021_02_13_429885 645 16 not not RB 10_1101-2021_02_13_429885 645 17 certified certify VBN 10_1101-2021_02_13_429885 645 18 by by IN 10_1101-2021_02_13_429885 645 19 peer peer NN 10_1101-2021_02_13_429885 645 20 review review NN 10_1101-2021_02_13_429885 645 21 ) ) -RRB- 10_1101-2021_02_13_429885 645 22 is be VBZ 10_1101-2021_02_13_429885 645 23 the the DT 10_1101-2021_02_13_429885 645 24 author author NN 10_1101-2021_02_13_429885 645 25 / / SYM 10_1101-2021_02_13_429885 645 26 funder funder NN 10_1101-2021_02_13_429885 645 27 , , , 10_1101-2021_02_13_429885 645 28 who who WP 10_1101-2021_02_13_429885 645 29 has have VBZ 10_1101-2021_02_13_429885 645 30 granted grant VBN 10_1101-2021_02_13_429885 645 31 bioRxiv biorxiv IN 10_1101-2021_02_13_429885 645 32 a a DT 10_1101-2021_02_13_429885 645 33 license license NN 10_1101-2021_02_13_429885 645 34 to to TO 10_1101-2021_02_13_429885 645 35 display display VB 10_1101-2021_02_13_429885 645 36 the the DT 10_1101-2021_02_13_429885 645 37 preprint preprint NN 10_1101-2021_02_13_429885 645 38 in in IN 10_1101-2021_02_13_429885 645 39 perpetuity perpetuity NN 10_1101-2021_02_13_429885 645 40 . . . 10_1101-2021_02_13_429885 646 1 It -PRON- PRP 10_1101-2021_02_13_429885 646 2 is be VBZ 10_1101-2021_02_13_429885 646 3 made make VBN 10_1101-2021_02_13_429885 646 4 The the DT 10_1101-2021_02_13_429885 646 5 copyright copyright NN 10_1101-2021_02_13_429885 646 6 holder holder NN 10_1101-2021_02_13_429885 646 7 for for IN 10_1101-2021_02_13_429885 646 8 this this DT 10_1101-2021_02_13_429885 646 9 preprintthis preprintthis NN 10_1101-2021_02_13_429885 646 10 version version NN 10_1101-2021_02_13_429885 646 11 posted post VBD 10_1101-2021_02_13_429885 646 12 February February NNP 10_1101-2021_02_13_429885 646 13 13 13 CD 10_1101-2021_02_13_429885 646 14 , , , 10_1101-2021_02_13_429885 646 15 2021 2021 CD 10_1101-2021_02_13_429885 646 16 . . . 10_1101-2021_02_13_429885 646 17 ; ; : 10_1101-2021_02_13_429885 646 18 https://doi.org/10.1101/2021.02.13.429885doi https://doi.org/10.1101/2021.02.13.429885doi ADD 10_1101-2021_02_13_429885 646 19 : : : 10_1101-2021_02_13_429885 646 20 bioRxiv biorxiv VB 10_1101-2021_02_13_429885 646 21 preprint preprint NN 10_1101-2021_02_13_429885 646 22 https://doi.org/10.1101/2021.02.13.429885 https://doi.org/10.1101/2021.02.13.429885 ADD 10_1101-2021_02_13_429885 646 23 http://creativecommons.org/licenses/by-nc-nd/4.0/ http://creativecommons.org/licenses/by-nc-nd/4.0/ CD