id sid tid token lemma pos pk02c824t12 1 1 genome genome NOUN pk02c824t12 1 2 structure structure NOUN pk02c824t12 1 3 is be AUX pk02c824t12 1 4 the the DET pk02c824t12 1 5 order order NOUN pk02c824t12 1 6 and and CCONJ pk02c824t12 1 7 orientation orientation NOUN pk02c824t12 1 8 of of ADP pk02c824t12 1 9 pieces piece NOUN pk02c824t12 1 10 of of ADP pk02c824t12 1 11 dna dna NOUN pk02c824t12 1 12 comprising comprise VERB pk02c824t12 1 13 a a DET pk02c824t12 1 14 genome genome NOUN pk02c824t12 1 15 , , PUNCT pk02c824t12 1 16 which which PRON pk02c824t12 1 17 contains contain VERB pk02c824t12 1 18 the the DET pk02c824t12 1 19 information information NOUN pk02c824t12 1 20 of of ADP pk02c824t12 1 21 life life NOUN pk02c824t12 1 22 . . PUNCT pk02c824t12 2 1 with with ADP pk02c824t12 2 2 advances advance NOUN pk02c824t12 2 3 in in ADP pk02c824t12 2 4 dna dna NOUN pk02c824t12 2 5 sequencing sequence VERB pk02c824t12 2 6 technology technology NOUN pk02c824t12 2 7 and and CCONJ pk02c824t12 2 8 now now ADV pk02c824t12 2 9 massive massive ADJ pk02c824t12 2 10 availability availability NOUN pk02c824t12 2 11 of of ADP pk02c824t12 2 12 sequence sequence NOUN pk02c824t12 2 13 data datum NOUN pk02c824t12 2 14 , , PUNCT pk02c824t12 2 15 the the DET pk02c824t12 2 16 study study NOUN pk02c824t12 2 17 of of ADP pk02c824t12 2 18 genome genome NOUN pk02c824t12 2 19 structure structure NOUN pk02c824t12 2 20 can can AUX pk02c824t12 2 21 not not PART pk02c824t12 2 22 be be AUX pk02c824t12 2 23 easily easily ADV pk02c824t12 2 24 carried carry VERB pk02c824t12 2 25 out out ADP pk02c824t12 2 26 without without ADP pk02c824t12 2 27 efficient efficient ADJ pk02c824t12 2 28 and and CCONJ pk02c824t12 2 29 expressly expressly ADV pk02c824t12 2 30 designed design VERB pk02c824t12 2 31 algorithms algorithm NOUN pk02c824t12 2 32 . . PUNCT pk02c824t12 3 1 in in ADP pk02c824t12 3 2 this this DET pk02c824t12 3 3 dissertation dissertation NOUN pk02c824t12 3 4 , , PUNCT pk02c824t12 3 5 we we PRON pk02c824t12 3 6 study study VERB pk02c824t12 3 7 three three NUM pk02c824t12 3 8 genome genome NOUN pk02c824t12 3 9 structure structure NOUN pk02c824t12 3 10 - - PUNCT pk02c824t12 3 11 related relate VERB pk02c824t12 3 12 problems problem NOUN pk02c824t12 3 13 : : PUNCT pk02c824t12 3 14 structural structural ADJ pk02c824t12 3 15 error error NOUN pk02c824t12 3 16 correction correction NOUN pk02c824t12 3 17 of of ADP pk02c824t12 3 18 draft draft NOUN pk02c824t12 3 19 genome genome NOUN pk02c824t12 3 20 assemblies assembly NOUN pk02c824t12 3 21 , , PUNCT pk02c824t12 3 22 inversion inversion NOUN pk02c824t12 3 23 prediction prediction NOUN pk02c824t12 3 24 , , PUNCT pk02c824t12 3 25 and and CCONJ pk02c824t12 3 26 predicting predict VERB pk02c824t12 3 27 operons operon NOUN pk02c824t12 3 28 . . PUNCT pk02c824t12 4 1 our our PRON pk02c824t12 4 2 work work NOUN pk02c824t12 4 3 with with ADP pk02c824t12 4 4 draft draft NOUN pk02c824t12 4 5 genome genome NOUN pk02c824t12 4 6 assemblies assembly NOUN pk02c824t12 4 7 explores explore VERB pk02c824t12 4 8 a a DET pk02c824t12 4 9 novel novel ADJ pk02c824t12 4 10 maximum maximum NOUN pk02c824t12 4 11 alternating alternate VERB pk02c824t12 4 12 path path NOUN pk02c824t12 4 13 cover cover NOUN pk02c824t12 4 14 ( ( PUNCT pk02c824t12 4 15 mapc mapc NOUN pk02c824t12 4 16 ) ) PUNCT pk02c824t12 4 17 model model NOUN pk02c824t12 4 18 to to PART pk02c824t12 4 19 improve improve VERB pk02c824t12 4 20 genome genome NOUN pk02c824t12 4 21 correctness correctness NOUN pk02c824t12 4 22 and and CCONJ pk02c824t12 4 23 downstream downstream ADJ pk02c824t12 4 24 analysis analysis NOUN pk02c824t12 4 25 . . PUNCT pk02c824t12 5 1 our our PRON pk02c824t12 5 2 work work NOUN pk02c824t12 5 3 on on ADP pk02c824t12 5 4 inversion inversion NOUN pk02c824t12 5 5 prediction prediction NOUN pk02c824t12 5 6 aims aim VERB pk02c824t12 5 7 to to PART pk02c824t12 5 8 predict predict VERB pk02c824t12 5 9 and and CCONJ pk02c824t12 5 10 catalog catalog NOUN pk02c824t12 5 11 inversions inversion NOUN pk02c824t12 5 12 by by ADP pk02c824t12 5 13 exploring explore VERB pk02c824t12 5 14 the the DET pk02c824t12 5 15 well well ADV pk02c824t12 5 16 - - PUNCT pk02c824t12 5 17 known know VERB pk02c824t12 5 18 range range NOUN pk02c824t12 5 19 maximum maximum ADJ pk02c824t12 5 20 query query NOUN pk02c824t12 5 21 model model NOUN pk02c824t12 5 22 and and CCONJ pk02c824t12 5 23 max max PROPN pk02c824t12 5 24 - - PUNCT pk02c824t12 5 25 cut cut VERB pk02c824t12 5 26 model model NOUN pk02c824t12 5 27 for for ADP pk02c824t12 5 28 what what PRON pk02c824t12 5 29 we we PRON pk02c824t12 5 30 call call VERB pk02c824t12 5 31 ` ` PUNCT pk02c824t12 5 32 ` ` PUNCT pk02c824t12 5 33 global global ADJ pk02c824t12 5 34 '' '' PUNCT pk02c824t12 5 35 inversions inversion NOUN pk02c824t12 5 36 , , PUNCT pk02c824t12 5 37 and and CCONJ pk02c824t12 5 38 the the DET pk02c824t12 5 39 novel novel ADJ pk02c824t12 5 40 rectangle rectangle NOUN pk02c824t12 5 41 clustering clustering NOUN pk02c824t12 5 42 model model NOUN pk02c824t12 5 43 and and CCONJ pk02c824t12 5 44 representative representative ADJ pk02c824t12 5 45 rectangle rectangle ADJ pk02c824t12 5 46 prediction prediction NOUN pk02c824t12 5 47 model model NOUN pk02c824t12 5 48 for for ADP pk02c824t12 5 49 more more ADV pk02c824t12 5 50 localized localized ADJ pk02c824t12 5 51 inversions inversion NOUN pk02c824t12 5 52 . . PUNCT pk02c824t12 6 1 for for ADP pk02c824t12 6 2 operon operon NOUN pk02c824t12 6 3 prediction prediction NOUN pk02c824t12 6 4 , , PUNCT pk02c824t12 6 5 we we PRON pk02c824t12 6 6 again again ADV pk02c824t12 6 7 apply apply VERB pk02c824t12 6 8 the the DET pk02c824t12 6 9 mapc mapc NOUN pk02c824t12 6 10 model model NOUN pk02c824t12 6 11 ( ( PUNCT pk02c824t12 6 12 with with ADP pk02c824t12 6 13 improved improved ADJ pk02c824t12 6 14 algorithms algorithm NOUN pk02c824t12 6 15 and and CCONJ pk02c824t12 6 16 theoretical theoretical ADJ pk02c824t12 6 17 analysis analysis NOUN pk02c824t12 6 18 ) ) PUNCT pk02c824t12 6 19 , , PUNCT pk02c824t12 6 20 coupled couple VERB pk02c824t12 6 21 with with ADP pk02c824t12 6 22 a a DET pk02c824t12 6 23 novel novel ADJ pk02c824t12 6 24 intro intro PROPN pk02c824t12 6 25 - - PUNCT pk02c824t12 6 26 column column NOUN pk02c824t12 6 27 exclusive exclusive ADJ pk02c824t12 6 28 clustering clustering NOUN pk02c824t12 6 29 model model NOUN pk02c824t12 6 30 , , PUNCT pk02c824t12 6 31 to to PART pk02c824t12 6 32 predict predict VERB pk02c824t12 6 33 and and CCONJ pk02c824t12 6 34 catalog catalog NOUN pk02c824t12 6 35 operons operon NOUN pk02c824t12 6 36 in in ADP pk02c824t12 6 37 closely closely ADV pk02c824t12 6 38 related relate VERB pk02c824t12 6 39 species specie NOUN pk02c824t12 6 40 . . PUNCT pk02c824t12 7 1 evaluated evaluate VERB pk02c824t12 7 2 using use VERB pk02c824t12 7 3 both both CCONJ pk02c824t12 7 4 simulated simulated ADJ pk02c824t12 7 5 and and CCONJ pk02c824t12 7 6 real real ADJ pk02c824t12 7 7 genome genome NOUN pk02c824t12 7 8 data datum NOUN pk02c824t12 7 9 , , PUNCT pk02c824t12 7 10 our our PRON pk02c824t12 7 11 algorithms algorithm NOUN pk02c824t12 7 12 and and CCONJ pk02c824t12 7 13 implementations implementation NOUN pk02c824t12 7 14 have have AUX pk02c824t12 7 15 shown show VERB pk02c824t12 7 16 substantial substantial ADJ pk02c824t12 7 17 promise promise NOUN pk02c824t12 7 18 for for ADP pk02c824t12 7 19 accurate accurate ADJ pk02c824t12 7 20 computational computational ADJ pk02c824t12 7 21 analysis analysis NOUN pk02c824t12 7 22 of of ADP pk02c824t12 7 23 genome genome NOUN pk02c824t12 7 24 structure structure NOUN pk02c824t12 7 25 in in ADP pk02c824t12 7 26 significantly significantly ADV pk02c824t12 7 27 shorter short ADJ pk02c824t12 7 28 time time NOUN pk02c824t12 7 29 . . PUNCT