id sid tid token lemma pos cf95j96248h 1 1 a a DET cf95j96248h 1 2 bac bac PROPN cf95j96248h 1 3 clone clone NOUN cf95j96248h 1 4 already already ADV cf95j96248h 1 5 known know VERB cf95j96248h 1 6 to to PART cf95j96248h 1 7 map map VERB cf95j96248h 1 8 to to ADP cf95j96248h 1 9 the the DET cf95j96248h 1 10 vicinity vicinity NOUN cf95j96248h 1 11 of of ADP cf95j96248h 1 12 the the DET cf95j96248h 1 13 proximal proximal ADJ cf95j96248h 1 14 breakpoints breakpoint NOUN cf95j96248h 1 15 the the DET cf95j96248h 1 16 2rj+ 2rj+ NUM cf95j96248h 1 17 ( ( PUNCT cf95j96248h 1 18 standard standard NOUN cf95j96248h 1 19 ) ) PUNCT cf95j96248h 1 20 was be AUX cf95j96248h 1 21 used use VERB cf95j96248h 1 22 as as ADP cf95j96248h 1 23 a a DET cf95j96248h 1 24 starting starting NOUN cf95j96248h 1 25 material material NOUN cf95j96248h 1 26 to to PART cf95j96248h 1 27 identify identify VERB cf95j96248h 1 28 the the DET cf95j96248h 1 29 2rj 2rj NOUN cf95j96248h 1 30 ( ( PUNCT cf95j96248h 1 31 inverted invert VERB cf95j96248h 1 32 ) ) PUNCT cf95j96248h 1 33 breakpoints breakpoint NOUN cf95j96248h 1 34 in in ADP cf95j96248h 1 35 bamako bamako PROPN cf95j96248h 1 36 chromosomal chromosomal NOUN cf95j96248h 1 37 form form NOUN cf95j96248h 1 38 . . PUNCT cf95j96248h 2 1 the the DET cf95j96248h 2 2 2rj 2rj ADJ cf95j96248h 2 3 distal distal ADJ cf95j96248h 2 4 and and CCONJ cf95j96248h 2 5 proximal proximal ADJ cf95j96248h 2 6 breakpoints breakpoint NOUN cf95j96248h 2 7 have have AUX cf95j96248h 2 8 been be AUX cf95j96248h 2 9 identified identify VERB cf95j96248h 2 10 , , PUNCT cf95j96248h 2 11 cloned clone VERB cf95j96248h 2 12 and and CCONJ cf95j96248h 2 13 sequenced sequence VERB cf95j96248h 2 14 . . PUNCT cf95j96248h 3 1 the the DET cf95j96248h 3 2 structure structure NOUN cf95j96248h 3 3 of of ADP cf95j96248h 3 4 the the DET cf95j96248h 3 5 breakpoints breakpoint NOUN cf95j96248h 3 6 showed show VERB cf95j96248h 3 7 that that SCONJ cf95j96248h 3 8 it it PRON cf95j96248h 3 9 is be AUX cf95j96248h 3 10 more more ADJ cf95j96248h 3 11 that that SCONJ cf95j96248h 3 12 a a DET cf95j96248h 3 13 simple simple ADJ cf95j96248h 3 14 cut cut NOUN cf95j96248h 3 15 and and CCONJ cf95j96248h 3 16 flip flip VERB cf95j96248h 3 17 . . PUNCT cf95j96248h 4 1 a a DET cf95j96248h 4 2 14.6 14.6 NUM cf95j96248h 4 3 kb kb PROPN cf95j96248h 4 4 insertion insertion NOUN cf95j96248h 4 5 at at ADP cf95j96248h 4 6 each each DET cf95j96248h 4 7 breakpoint breakpoint NOUN cf95j96248h 4 8 but but CCONJ cf95j96248h 4 9 opposite opposite ADJ cf95j96248h 4 10 orientation orientation NOUN cf95j96248h 4 11 has have AUX cf95j96248h 4 12 been be AUX cf95j96248h 4 13 identified identify VERB cf95j96248h 4 14 . . PUNCT cf95j96248h 5 1 this this DET cf95j96248h 5 2 fragment fragment NOUN cf95j96248h 5 3 is be AUX cf95j96248h 5 4 made make VERB cf95j96248h 5 5 of of ADP cf95j96248h 5 6 two two NUM cf95j96248h 5 7 almost almost ADV cf95j96248h 5 8 perfect perfect ADJ cf95j96248h 5 9 5.3 5.3 NUM cf95j96248h 5 10 kb kb PROPN cf95j96248h 5 11 inverted invert VERB cf95j96248h 5 12 repeats repeat NOUN cf95j96248h 5 13 separated separate VERB cf95j96248h 5 14 by by ADP cf95j96248h 5 15 a a DET cf95j96248h 5 16 4 4 NUM cf95j96248h 5 17 kb kb PROPN cf95j96248h 5 18 section section NOUN cf95j96248h 5 19 and and CCONJ cf95j96248h 5 20 is be AUX cf95j96248h 5 21 structurally structurally ADV cf95j96248h 5 22 very very ADV cf95j96248h 5 23 similar similar ADJ cf95j96248h 5 24 to to ADP cf95j96248h 5 25 type type NOUN cf95j96248h 5 26 3 3 NUM cf95j96248h 5 27 fold fold VERB cf95j96248h 5 28 back back ADV cf95j96248h 5 29 transposable transposable ADJ cf95j96248h 5 30 elements element NOUN cf95j96248h 5 31 . . PUNCT cf95j96248h 6 1 sequence sequence NOUN cf95j96248h 6 2 analysis analysis NOUN cf95j96248h 6 3 of of ADP cf95j96248h 6 4 the the DET cf95j96248h 6 5 flanking flank VERB cf95j96248h 6 6 regions region NOUN cf95j96248h 6 7 of of ADP cf95j96248h 6 8 the the DET cf95j96248h 6 9 breakpoints breakpoint NOUN cf95j96248h 6 10 revealed reveal VERB cf95j96248h 6 11 the the DET cf95j96248h 6 12 presence presence NOUN cf95j96248h 6 13 of of ADP cf95j96248h 6 14 four four NUM cf95j96248h 6 15 genes gene NOUN cf95j96248h 6 16 . . PUNCT cf95j96248h 7 1 however however ADV cf95j96248h 7 2 we we PRON cf95j96248h 7 3 found find VERB cf95j96248h 7 4 no no DET cf95j96248h 7 5 evidence evidence NOUN cf95j96248h 7 6 of of ADP cf95j96248h 7 7 interrupted interrupted ADJ cf95j96248h 7 8 transcripts transcript NOUN cf95j96248h 7 9 by by ADP cf95j96248h 7 10 the the DET cf95j96248h 7 11 inversion inversion NOUN cf95j96248h 7 12 breaks break NOUN cf95j96248h 7 13 . . PUNCT cf95j96248h 8 1 a a DET cf95j96248h 8 2 simple simple ADJ cf95j96248h 8 3 pcr pcr NOUN cf95j96248h 8 4 assay assay NOUN cf95j96248h 8 5 was be AUX cf95j96248h 8 6 designed design VERB cf95j96248h 8 7 to to PART cf95j96248h 8 8 diagnose diagnose VERB cf95j96248h 8 9 the the DET cf95j96248h 8 10 2rj 2rj ADJ cf95j96248h 8 11 inversion inversion NOUN cf95j96248h 8 12 . . PUNCT