id sid tid token lemma pos h415p843h2t 1 1 as as ADP h415p843h2t 1 2 the the DET h415p843h2t 1 3 cost cost NOUN h415p843h2t 1 4 of of ADP h415p843h2t 1 5 dna dna PROPN h415p843h2t 1 6 sequencing sequencing NOUN h415p843h2t 1 7 falls fall VERB h415p843h2t 1 8 , , PUNCT h415p843h2t 1 9 the the DET h415p843h2t 1 10 relative relative ADJ h415p843h2t 1 11 cost cost NOUN h415p843h2t 1 12 of of ADP h415p843h2t 1 13 finishing finish VERB h415p843h2t 1 14 steps step NOUN h415p843h2t 1 15 ( ( PUNCT h415p843h2t 1 16 e.g. e.g. ADV h415p843h2t 1 17 , , PUNCT h415p843h2t 1 18 error error NOUN h415p843h2t 1 19 correction correction NOUN h415p843h2t 1 20 and and CCONJ h415p843h2t 1 21 gap gap NOUN h415p843h2t 1 22 - - PUNCT h415p843h2t 1 23 closing closing NOUN h415p843h2t 1 24 ) ) PUNCT h415p843h2t 1 25 is be AUX h415p843h2t 1 26 increasing increase VERB h415p843h2t 1 27 . . PUNCT h415p843h2t 2 1 as as ADP h415p843h2t 2 2 a a DET h415p843h2t 2 3 result result NOUN h415p843h2t 2 4 , , PUNCT h415p843h2t 2 5 many many ADJ h415p843h2t 2 6 completed complete VERB h415p843h2t 2 7 genome genome NOUN h415p843h2t 2 8 projects project NOUN h415p843h2t 2 9 are be AUX h415p843h2t 2 10 only only ADV h415p843h2t 2 11 completed complete VERB h415p843h2t 2 12 to to PART h415p843h2t 2 13 draft draft VERB h415p843h2t 2 14 stages stage NOUN h415p843h2t 2 15 and and CCONJ h415p843h2t 2 16 may may AUX h415p843h2t 2 17 not not PART h415p843h2t 2 18 provide provide VERB h415p843h2t 2 19 full full ADJ h415p843h2t 2 20 information information NOUN h415p843h2t 2 21 about about ADP h415p843h2t 2 22 the the DET h415p843h2t 2 23 location location NOUN h415p843h2t 2 24 of of ADP h415p843h2t 2 25 sequences sequence NOUN h415p843h2t 2 26 on on ADP h415p843h2t 2 27 the the DET h415p843h2t 2 28 chromosome chromosome NOUN h415p843h2t 2 29 . . PUNCT h415p843h2t 3 1 further far ADV h415p843h2t 3 2 , , PUNCT h415p843h2t 3 3 they they PRON h415p843h2t 3 4 may may AUX h415p843h2t 3 5 contain contain VERB h415p843h2t 3 6 gaps gap NOUN h415p843h2t 3 7 and and CCONJ h415p843h2t 3 8 assembly assembly NOUN h415p843h2t 3 9 errors error NOUN h415p843h2t 3 10 . . PUNCT h415p843h2t 4 1 whether whether SCONJ h415p843h2t 4 2 draft draft VERB h415p843h2t 4 3 or or CCONJ h415p843h2t 4 4 finished finish VERB h415p843h2t 4 5 , , PUNCT h415p843h2t 4 6 the the DET h415p843h2t 4 7 output output NOUN h415p843h2t 4 8 of of ADP h415p843h2t 4 9 a a DET h415p843h2t 4 10 genome genome NOUN h415p843h2t 4 11 sequence sequence NOUN h415p843h2t 4 12 project project NOUN h415p843h2t 4 13 serves serve VERB h415p843h2t 4 14 as as ADP h415p843h2t 4 15 the the DET h415p843h2t 4 16 input input NOUN h415p843h2t 4 17 to to ADP h415p843h2t 4 18 a a DET h415p843h2t 4 19 host host NOUN h415p843h2t 4 20 of of ADP h415p843h2t 4 21 analysis analysis NOUN h415p843h2t 4 22 tools tool NOUN h415p843h2t 4 23 such such ADJ h415p843h2t 4 24 as as ADP h415p843h2t 4 25 gene gene NOUN h415p843h2t 4 26 finding finding NOUN h415p843h2t 4 27 or or CCONJ h415p843h2t 4 28 variation variation NOUN h415p843h2t 4 29 analysis analysis NOUN h415p843h2t 4 30 . . PUNCT h415p843h2t 5 1 many many ADJ h415p843h2t 5 2 of of ADP h415p843h2t 5 3 these these DET h415p843h2t 5 4 tools tool NOUN h415p843h2t 5 5 have have AUX h415p843h2t 5 6 been be AUX h415p843h2t 5 7 designed design VERB h415p843h2t 5 8 for for ADP h415p843h2t 5 9 and and CCONJ h415p843h2t 5 10 tested test VERB h415p843h2t 5 11 on on ADP h415p843h2t 5 12 high high ADJ h415p843h2t 5 13 - - PUNCT h415p843h2t 5 14 quality quality NOUN h415p843h2t 5 15 , , PUNCT h415p843h2t 5 16 finished finished ADJ h415p843h2t 5 17 genomes genome NOUN h415p843h2t 5 18 such such ADJ h415p843h2t 5 19 as as ADP h415p843h2t 5 20 human human NOUN h415p843h2t 5 21 or or CCONJ h415p843h2t 5 22 the the DET h415p843h2t 5 23 fruit fruit NOUN h415p843h2t 5 24 fly fly NOUN h415p843h2t 5 25 drosophila drosophila NOUN h415p843h2t 5 26 melanogaster melanogaster NOUN h415p843h2t 5 27 . . PUNCT h415p843h2t 6 1 in in ADP h415p843h2t 6 2 this this DET h415p843h2t 6 3 thesis thesis NOUN h415p843h2t 6 4 we we PRON h415p843h2t 6 5 discuss discuss VERB h415p843h2t 6 6 specific specific ADJ h415p843h2t 6 7 challenges challenge NOUN h415p843h2t 6 8 in in ADP h415p843h2t 6 9 working work VERB h415p843h2t 6 10 with with ADP h415p843h2t 6 11 draft draft NOUN h415p843h2t 6 12 genomes genome NOUN h415p843h2t 6 13 and and CCONJ h415p843h2t 6 14 show show VERB h415p843h2t 6 15 how how SCONJ h415p843h2t 6 16 methods method NOUN h415p843h2t 6 17 can can AUX h415p843h2t 6 18 be be AUX h415p843h2t 6 19 adapted adapt VERB h415p843h2t 6 20 to to PART h415p843h2t 6 21 be be AUX h415p843h2t 6 22 more more ADV h415p843h2t 6 23 effective effective ADJ h415p843h2t 6 24 in in ADP h415p843h2t 6 25 draft draft NOUN h415p843h2t 6 26 genomes genome NOUN h415p843h2t 6 27 . . PUNCT h415p843h2t 7 1 first first ADV h415p843h2t 7 2 , , PUNCT h415p843h2t 7 3 we we PRON h415p843h2t 7 4 examine examine VERB h415p843h2t 7 5 computational computational ADJ h415p843h2t 7 6 methods method NOUN h415p843h2t 7 7 for for ADP h415p843h2t 7 8 finding find VERB h415p843h2t 7 9 errors error NOUN h415p843h2t 7 10 in in ADP h415p843h2t 7 11 draft draft NOUN h415p843h2t 7 12 assemblies assembly NOUN h415p843h2t 7 13 . . PUNCT h415p843h2t 8 1 next next ADV h415p843h2t 8 2 , , PUNCT h415p843h2t 8 3 we we PRON h415p843h2t 8 4 modify modify VERB h415p843h2t 8 5 a a DET h415p843h2t 8 6 technique technique NOUN h415p843h2t 8 7 for for ADP h415p843h2t 8 8 finding find VERB h415p843h2t 8 9 dna dna NOUN h415p843h2t 8 10 inversions inversion NOUN h415p843h2t 8 11 between between ADP h415p843h2t 8 12 two two NUM h415p843h2t 8 13 genomes genome NOUN h415p843h2t 8 14 to to PART h415p843h2t 8 15 account account VERB h415p843h2t 8 16 for for ADP h415p843h2t 8 17 gaps gap NOUN h415p843h2t 8 18 in in ADP h415p843h2t 8 19 the the DET h415p843h2t 8 20 genomes genome NOUN h415p843h2t 8 21 . . PUNCT h415p843h2t 9 1 finally finally ADV h415p843h2t 9 2 , , PUNCT h415p843h2t 9 3 we we PRON h415p843h2t 9 4 develop develop VERB h415p843h2t 9 5 a a DET h415p843h2t 9 6 pipeline pipeline NOUN h415p843h2t 9 7 to to PART h415p843h2t 9 8 construct construct VERB h415p843h2t 9 9 chromosomes chromosome NOUN h415p843h2t 9 10 out out ADP h415p843h2t 9 11 of of ADP h415p843h2t 9 12 draft draft NOUN h415p843h2t 9 13 scaffolds scaffold NOUN h415p843h2t 9 14 using use VERB h415p843h2t 9 15 a a DET h415p843h2t 9 16 closely closely ADV h415p843h2t 9 17 related relate VERB h415p843h2t 9 18 reference reference NOUN h415p843h2t 9 19 genome genome NOUN h415p843h2t 9 20 . . PUNCT h415p843h2t 10 1 we we PRON h415p843h2t 10 2 use use VERB h415p843h2t 10 3 examples example NOUN h415p843h2t 10 4 from from ADP h415p843h2t 10 5 three three NUM h415p843h2t 10 6 different different ADJ h415p843h2t 10 7 species specie NOUN h415p843h2t 10 8 of of ADP h415p843h2t 10 9 importance importance NOUN h415p843h2t 10 10 to to ADP h415p843h2t 10 11 global global ADJ h415p843h2t 10 12 health health NOUN h415p843h2t 10 13 : : PUNCT h415p843h2t 10 14 the the DET h415p843h2t 10 15 body body NOUN h415p843h2t 10 16 louse louse NOUN h415p843h2t 10 17 ( ( PUNCT h415p843h2t 10 18 pediculus pediculus NOUN h415p843h2t 10 19 humanus humanu NOUN h415p843h2t 10 20 ) ) PUNCT h415p843h2t 10 21 , , PUNCT h415p843h2t 10 22 a a DET h415p843h2t 10 23 malaria malaria NOUN h415p843h2t 10 24 mosquito mosquito PROPN h415p843h2t 10 25 ( ( PUNCT h415p843h2t 10 26 anopheles anophele NOUN h415p843h2t 10 27 gambiae gambiae NOUN h415p843h2t 10 28 ) ) PUNCT h415p843h2t 10 29 , , PUNCT h415p843h2t 10 30 and and CCONJ h415p843h2t 10 31 the the DET h415p843h2t 10 32 human human ADJ h415p843h2t 10 33 malaria malaria NOUN h415p843h2t 10 34 parasite parasite NOUN h415p843h2t 10 35 ( ( PUNCT h415p843h2t 10 36 plasmodium plasmodium NOUN h415p843h2t 10 37 falciparum falciparum NOUN h415p843h2t 10 38 ) ) PUNCT h415p843h2t 10 39 . . PUNCT