id sid tid token lemma pos 8951 1 1 201 201 CD 8951 1 2 Application application NN 8951 1 3 of of IN 8951 1 4 the the DT 8951 1 5 Variety Variety NNP 8951 1 6 - - HYPH 8951 1 7 Generator Generator NNP 8951 1 8 Approach Approach NNP 8951 1 9 to to IN 8951 1 10 Searches search NNS 8951 1 11 of of IN 8951 1 12 Personal Personal NNP 8951 1 13 Names Names NNPS 8951 1 14 in in IN 8951 1 15 Bibliographic Bibliographic NNP 8951 1 16 Data Data NNP 8951 1 17 Bases Bases NNPS 8951 1 18 - - HYPH 8951 1 19 Part Part NNP 8951 1 20 2 2 CD 8951 1 21 . . . 8951 2 1 Optimization optimization NN 8951 2 2 of of IN 8951 2 3 Key Key NNP 8951 2 4 - - HYPH 8951 2 5 Sets Sets NNPS 8951 2 6 , , , 8951 2 7 and and CC 8951 2 8 Evaluation evaluation NN 8951 2 9 of of IN 8951 2 10 Their -PRON- PRP$ 8951 2 11 Retrieval Retrieval NNP 8951 2 12 Efficiency Efficiency NNP 8951 2 13 Dirk Dirk NNP 8951 2 14 W. W. NNP 8951 2 15 FOKKER FOKKER NNP 8951 2 16 and and CC 8951 2 17 Michael Michael NNP 8951 2 18 F. F. NNP 8951 2 19 LYNCH LYNCH NNP 8951 2 20 : : : 8951 2 21 Postgraduate Postgraduate NNP 8951 2 22 School School NNP 8951 2 23 of of IN 8951 2 24 Librarianship Librarianship NNP 8951 2 25 and and CC 8951 2 26 Information Information NNP 8951 2 27 Science Science NNP 8951 2 28 , , , 8951 2 29 University University NNP 8951 2 30 of of IN 8951 2 31 Sheffield Sheffield NNP 8951 2 32 , , , 8951 2 33 England England NNP 8951 2 34 . . . 8951 3 1 Keys key NNS 8951 3 2 consisting consist VBG 8951 3 3 of of IN 8951 3 4 variable variable JJ 8951 3 5 - - HYPH 8951 3 6 length length NN 8951 3 7 chamcter chamcter NN 8951 3 8 strings string NNS 8951 3 9 from from IN 8951 3 10 the the DT 8951 3 11 front front NN 8951 3 12 and and CC 8951 3 13 rear rear NN 8951 3 14 of of IN 8951 3 15 surnames surname NNS 8951 3 16 , , , 8951 3 17 derived derive VBN 8951 3 18 by by IN 8951 3 19 analysis analysis NN 8951 3 20 of of IN 8951 3 21 author author NN 8951 3 22 names name NNS 8951 3 23 in in IN 8951 3 24 a a DT 8951 3 25 particular particular JJ 8951 3 26 data data NN 8951 3 27 base base NN 8951 3 28 , , , 8951 3 29 am be VBP 8951 3 30 used use VBN 8951 3 31 to to TO 8951 3 32 provide provide VB 8951 3 33 approximate approximate JJ 8951 3 34 representations representation NNS 8951 3 35 of of IN 8951 3 36 author author NN 8951 3 37 names name NNS 8951 3 38 . . . 8951 4 1 When when WRB 8951 4 2 combined combine VBN 8951 4 3 in in IN 8951 4 4 appropriate appropriate JJ 8951 4 5 mtios mtio NNS 8951 4 6 , , , 8951 4 7 and and CC 8951 4 8 used use VBN 8951 4 9 together together RB 8951 4 10 with with IN 8951 4 11 keys key NNS 8951 4 12 for for IN 8951 4 13 each each DT 8951 4 14 of of IN 8951 4 15 the the DT 8951 4 16 first first JJ 8951 4 17 two two CD 8951 4 18 initials initial NNS 8951 4 19 of of IN 8951 4 20 personal personal JJ 8951 4 21 names name NNS 8951 4 22 , , , 8951 4 23 they -PRON- PRP 8951 4 24 provide provide VBP 8951 4 25 a a DT 8951 4 26 high high JJ 8951 4 27 degme degme NNS 8951 4 28 of of IN 8951 4 29 discrimination discrimination NN 8951 4 30 in in IN 8951 4 31 search search NN 8951 4 32 . . . 8951 5 1 Methods method NNS 8951 5 2 for for IN 8951 5 3 optimization optimization NN 8951 5 4 of of IN 8951 5 5 key key NN 8951 5 6 - - HYPH 8951 5 7 sets set NNS 8951 5 8 are be VBP 8951 5 9 desc1·ibed desc1·ibed NNP 8951 5 10 , , , 8951 5 11 and and CC 8951 5 12 the the DT 8951 5 13 perform- perform- NN 8951 5 14 ance ance NN 8951 5 15 of of IN 8951 5 16 key key JJ 8951 5 17 - - HYPH 8951 5 18 sets set NNS 8951 5 19 varying vary VBG 8951 5 20 in in IN 8951 5 21 size size NN 8951 5 22 between between IN 8951 5 23 150 150 CD 8951 5 24 and and CC 8951 5 25 300 300 CD 8951 5 26 is be VBZ 8951 5 27 determined determine VBN 8951 5 28 at at IN 8951 5 29 file file NN 8951 5 30 sizes size NNS 8951 5 31 of of IN 8951 5 32 up up IN 8951 5 33 to to TO 8951 5 34 50,000 50,000 CD 8951 5 35 name name NN 8951 5 36 entries entry NNS 8951 5 37 . . . 8951 6 1 The the DT 8951 6 2 effects effect NNS 8951 6 3 of of IN 8951 6 4 varying vary VBG 8951 6 5 the the DT 8951 6 6 proportions proportion NNS 8951 6 7 of of IN 8951 6 8 the the DT 8951 6 9 queries query NNS 8951 6 10 present present JJ 8951 6 11 in in IN 8951 6 12 the the DT 8951 6 13 file file NN 8951 6 14 are be VBP 8951 6 15 also also RB 8951 6 16 examined examine VBN 8951 6 17 . . . 8951 7 1 The the DT 8951 7 2 results result NNS 8951 7 3 obtained obtain VBN 8951 7 4 with with IN 8951 7 5 fixed fix VBN 8951 7 6 - - HYPH 8951 7 7 length length NN 8951 7 8 keys key NNS 8951 7 9 are be VBP 8951 7 10 compared compare VBN 8951 7 11 with with IN 8951 7 12 those those DT 8951 7 13 f01 f01 NN 8951 7 14 ' ' `` 8951 7 15 variable variable JJ 8951 7 16 - - HYPH 8951 7 17 length length NN 8951 7 18 keys key NNS 8951 7 19 , , , 8951 7 20 showing show VBG 8951 7 21 the the DT 8951 7 22 latter latter NN 8951 7 23 to to TO 8951 7 24 be be VB 8951 7 25 greatly greatly RB 8951 7 26 superior superior JJ 8951 7 27 . . . 8951 8 1 Implications implication NNS 8951 8 2 of of IN 8951 8 3 the the DT 8951 8 4 work work NN 8951 8 5 for for IN 8951 8 6 a a DT 8951 8 7 variety variety NN 8951 8 8 of of IN 8951 8 9 types type NNS 8951 8 10 of of IN 8951 8 11 information information NN 8951 8 12 systems system NNS 8951 8 13 a1'e a1'e NNP 8951 8 14 discussed discuss VBN 8951 8 15 . . . 8951 9 1 INTRODUCTION INTRODUCTION NNP 8951 9 2 In in IN 8951 9 3 Part Part NNP 8951 9 4 I -PRON- PRP 8951 9 5 of of IN 8951 9 6 this this DT 8951 9 7 series series NN 8951 9 8 the the DT 8951 9 9 development development NN 8951 9 10 of of IN 8951 9 11 variety variety NN 8951 9 12 generators generator NNS 8951 9 13 , , , 8951 9 14 or or CC 8951 9 15 sets set NNS 8951 9 16 of of IN 8951 9 17 variable variable JJ 8951 9 18 - - HYPH 8951 9 19 length length NN 8951 9 20 keys key NNS 8951 9 21 with with IN 8951 9 22 high high JJ 8951 9 23 relative relative JJ 8951 9 24 entropies entropy NNS 8951 9 25 of of IN 8951 9 26 occurrence occurrence NN 8951 9 27 , , , 8951 9 28 from from IN 8951 9 29 the the DT 8951 9 30 initial initial JJ 8951 9 31 and and CC 8951 9 32 terminal terminal JJ 8951 9 33 character character NN 8951 9 34 strings string NNS 8951 9 35 of of IN 8951 9 36 authors author NNS 8951 9 37 ' ' POS 8951 9 38 surnames surname NNS 8951 9 39 was be VBD 8951 9 40 described.1 described.1 IN 8951 9 41 Their -PRON- PRP$ 8951 9 42 purpose purpose NN 8951 9 43 , , , 8951 9 44 used use VBD 8951 9 45 singly singly RB 8951 9 46 or or CC 8951 9 47 in in IN 8951 9 48 combination combination NN 8951 9 49 , , , 8951 9 50 is be VBZ 8951 9 51 to to TO 8951 9 52 provide provide VB 8951 9 53 a a DT 8951 9 54 high high JJ 8951 9 55 and and CC 8951 9 56 con- con- NN 8951 9 57 stant stant JJ 8951 9 58 degree degree NN 8951 9 59 of of IN 8951 9 60 discrimination discrimination NN 8951 9 61 among among IN 8951 9 62 personal personal JJ 8951 9 63 names name NNS 8951 9 64 so so IN 8951 9 65 as as IN 8951 9 66 to to TO 8951 9 67 facilitate facilitate VB 8951 9 68 searches search NNS 8951 9 69 for for IN 8951 9 70 them -PRON- PRP 8951 9 71 . . . 8951 10 1 In in IN 8951 10 2 this this DT 8951 10 3 paper paper NN 8951 10 4 the the DT 8951 10 5 selection selection NN 8951 10 6 of of IN 8951 10 7 optimal optimal JJ 8951 10 8 combinations combination NNS 8951 10 9 of of IN 8951 10 10 the the DT 8951 10 11 keys key NNS 8951 10 12 and and CC 8951 10 13 evaluation evaluation NN 8951 10 14 of of IN 8951 10 15 their -PRON- PRP$ 8951 10 16 efficiency efficiency NN 8951 10 17 in in IN 8951 10 18 search search NN 8951 10 19 are be VBP 8951 10 20 described describe VBN 8951 10 21 . . . 8951 11 1 The the DT 8951 11 2 performance performance NN 8951 11 3 of of IN 8951 11 4 combined combine VBN 8951 11 5 key key NN 8951 11 6 - - HYPH 8951 11 7 sets set NNS 8951 11 8 of of IN 8951 11 9 various various JJ 8951 11 10 compositions composition NNS 8951 11 11 is be VBZ 8951 11 12 determined determine VBN 8951 11 13 at at IN 8951 11 14 a a DT 8951 11 15 range range NN 8951 11 16 of of IN 8951 11 17 file file NN 8951 11 18 sizes size NNS 8951 11 19 and and CC 8951 11 20 compared compare VBN 8951 11 21 with with IN 8951 11 22 fixed fix VBN 8951 11 23 - - HYPH 8951 11 24 length length NN 8951 11 25 keys key NNS 8951 11 26 . . . 8951 12 1 In in IN 8951 12 2 addition addition NN 8951 12 3 , , , 8951 12 4 202 202 CD 8951 12 5 1 1 CD 8951 12 6 ournal ournal NN 8951 12 7 of of IN 8951 12 8 Lib1'm'y Lib1'm'y NNP 8951 12 9 Automation Automation NNP 8951 12 10 Vol Vol NNP 8951 12 11 . . . 8951 13 1 7 7 CD 8951 13 2 I i NN 8951 13 3 3 3 CD 8951 13 4 September September NNP 8951 13 5 197 197 CD 8951 13 6 4 4 CD 8951 13 7 the the DT 8951 13 8 extent extent NN 8951 13 9 of of IN 8951 13 10 statistical statistical JJ 8951 13 11 associations association NNS 8951 13 12 among among IN 8951 13 13 keys key NNS 8951 13 14 from from IN 8951 13 15 different different JJ 8951 13 16 positions position NNS 8951 13 17 in in IN 8951 13 18 the the DT 8951 13 19 names name NNS 8951 13 20 is be VBZ 8951 13 21 determined determine VBN 8951 13 22 . . . 8951 14 1 BALANCING balancing NN 8951 14 2 OF of IN 8951 14 3 KEY KEY NNP 8951 14 4 - - HYPH 8951 14 5 SETS SETS NNP 8951 14 6 The the DT 8951 14 7 relative relative JJ 8951 14 8 entropies entropy NNS 8951 14 9 of of IN 8951 14 10 distribution distribution NN 8951 14 11 of of IN 8951 14 12 the the DT 8951 14 13 first first JJ 8951 14 14 and and CC 8951 14 15 last last JJ 8951 14 16 letters letter NNS 8951 14 17 of of IN 8951 14 18 the the DT 8951 14 19 surnames surname NNS 8951 14 20 of of IN 8951 14 21 authors author NNS 8951 14 22 in in IN 8951 14 23 the the DT 8951 14 24 file file NN 8951 14 25 of of IN 8951 14 26 100,000 100,000 CD 8951 14 27 entries entry NNS 8951 14 28 from from IN 8951 14 29 the the DT 8951 14 30 INSPEC INSPEC NNP 8951 14 31 data data NN 8951 14 32 base base NN 8951 14 33 differ differ VBP 8951 14 34 significantly significantly RB 8951 14 35 , , , 8951 14 36 the the DT 8951 14 37 former former JJ 8951 14 38 being be VBG 8951 14 39 0.92 0.92 CD 8951 14 40 and and CC 8951 14 41 the the DT 8951 14 42 latter latter JJ 8951 14 43 0.86 0.86 CD 8951 14 44 . . . 8951 15 1 As as IN 8951 15 2 a a DT 8951 15 3 re- re- JJ 8951 15 4 sult sult NN 8951 15 5 , , , 8951 15 6 a a DT 8951 15 7 larger large JJR 8951 15 8 key key NN 8951 15 9 - - HYPH 8951 15 10 set set NN 8951 15 11 has have VBZ 8951 15 12 to to TO 8951 15 13 be be VB 8951 15 14 produced produce VBN 8951 15 15 from from IN 8951 15 16 the the DT 8951 15 17 back back NN 8951 15 18 of of IN 8951 15 19 the the DT 8951 15 20 surnames surname NNS 8951 15 21 to to TO 8951 15 22 reach reach VB 8951 15 23 the the DT 8951 15 24 same same JJ 8951 15 25 value value NN 8951 15 26 of of IN 8951 15 27 the the DT 8951 15 28 relative relative JJ 8951 15 29 entropy entropy RB 8951 15 30 as as IN 8951 15 31 that that DT 8951 15 32 of of IN 8951 15 33 a a DT 8951 15 34 key key NN 8951 15 35 - - HYPH 8951 15 36 set set NN 8951 15 37 of of IN 8951 15 38 given give VBN 8951 15 39 size size NN 8951 15 40 from from IN 8951 15 41 the the DT 8951 15 42 front front NN 8951 15 43 of of IN 8951 15 44 the the DT 8951 15 45 surname surname NN 8951 15 46 . . . 8951 16 1 For for IN 8951 16 2 instance instance NN 8951 16 3 , , , 8951 16 4 the the DT 8951 16 5 value value NN 8951 16 6 of of IN 8951 16 7 0.954 0.954 CD 8951 16 8 is be VBZ 8951 16 9 reached reach VBN 8951 16 10 by by IN 8951 16 11 a a DT 8951 16 12 key key NN 8951 16 13 - - HYPH 8951 16 14 set set NN 8951 16 15 comprising comprise VBG 8951 16 16 41 41 CD 8951 16 17 keys key NNS 8951 16 18 from from IN 8951 16 19 the the DT 8951 16 20 front front NN 8951 16 21 of of IN 8951 16 22 the the DT 8951 16 23 name name NN 8951 16 24 , , , 8951 16 25 but but CC 8951 16 26 a a DT 8951 16 27 set set NN 8951 16 28 of of IN 8951 16 29 101 101 CD 8951 16 30 keys key NNS 8951 16 31 from from IN 8951 16 32 the the DT 8951 16 33 back back NN 8951 16 34 is be VBZ 8951 16 35 needed need VBN 8951 16 36 to to TO 8951 16 37 attain attain VB 8951 16 38 this this DT 8951 16 39 value value NN 8951 16 40 . . . 8951 17 1 It -PRON- PRP 8951 17 2 seemed seem VBD 8951 17 3 reasonable reasonable JJ 8951 17 4 to to TO 8951 17 5 assume assume VB 8951 17 6 that that IN 8951 17 7 keys key NNS 8951 17 8 from from IN 8951 17 9 the the DT 8951 17 10 front front JJ 8951 17 11 and and CC 8951 17 12 rear rear JJ 8951 17 13 should should MD 8951 17 14 be be VB 8951 17 15 com- com- NN 8951 17 16 bined bin VBN 8951 17 17 in in IN 8951 17 18 different different JJ 8951 17 19 proportions proportion NNS 8951 17 20 in in IN 8951 17 21 order order NN 8951 17 22 to to TO 8951 17 23 maximize maximize VB 8951 17 24 the the DT 8951 17 25 relative relative JJ 8951 17 26 entropy entropy RB 8951 17 27 of of IN 8951 17 28 the the DT 8951 17 29 combined combined JJ 8951 17 30 system system NN 8951 17 31 , , , 8951 17 32 and and CC 8951 17 33 that that IN 8951 17 34 their -PRON- PRP$ 8951 17 35 proportions proportion NNS 8951 17 36 should should MD 8951 17 37 reflect reflect VB 8951 17 38 the the DT 8951 17 39 redun- redun- JJ 8951 17 40 dancies dancie NNS 8951 17 41 of of IN 8951 17 42 each each DT 8951 17 43 distribution distribution NN 8951 17 44 ( ( -LRB- 8951 17 45 redundancy redundancy NN 8951 17 46 = = SYM 8951 17 47 1 1 CD 8951 17 48 - - HYPH 8951 17 49 Hr hr NN 8951 17 50 ) ) -RRB- 8951 17 51 . . . 8951 18 1 In in IN 8951 18 2 order order NN 8951 18 3 to to TO 8951 18 4 test test VB 8951 18 5 this this DT 8951 18 6 , , , 8951 18 7 a a DT 8951 18 8 series series NN 8951 18 9 of of IN 8951 18 10 combined combine VBN 8951 18 11 key key NN 8951 18 12 - - HYPH 8951 18 13 sets set NNS 8951 18 14 of of IN 8951 18 15 different different JJ 8951 18 16 total total JJ 8951 18 17 sizes size NNS 8951 18 18 was be VBD 8951 18 19 produced produce VBN 8951 18 20 , , , 8951 18 21 in in IN 8951 18 22 which which WDT 8951 18 23 the the DT 8951 18 24 proportions proportion NNS 8951 18 25 of of IN 8951 18 26 keys key NNS 8951 18 27 were be VBD 8951 18 28 varied varied JJ 8951 18 29 around around IN 8951 18 30 the the DT 8951 18 31 ratio ratio NN 8951 18 32 of of IN 8951 18 33 the the DT 8951 18 34 redun- redun- JJ 8951 18 35 dancies dancie NNS 8951 18 36 of of IN 8951 18 37 the the DT 8951 18 38 first first JJ 8951 18 39 and and CC 8951 18 40 last last JJ 8951 18 41 character character NN 8951 18 42 positions position NNS 8951 18 43 , , , 8951 18 44 i.e. i.e. FW 8951 18 45 , , , 8951 18 46 ( ( -LRB- 8951 18 47 1 1 CD 8951 18 48 - - SYM 8951 18 49 0.92 0.92 CD 8951 18 50 ) ) -RRB- 8951 18 51 : : : 8951 18 52 ( ( -LRB- 8951 18 53 1 1 CD 8951 18 54 - - HYPH 8951 18 55 0.86 0.86 CD 8951 18 56 ) ) -RRB- 8951 18 57 , , , 8951 18 58 or or CC 8951 18 59 8:14 8:14 CD 8951 18 60 . . . 8951 19 1 The the DT 8951 19 2 relative relative JJ 8951 19 3 entropies entropy NNS 8951 19 4 of of IN 8951 19 5 the the DT 8951 19 6 name name NN 8951 19 7 representations representation NNS 8951 19 8 provided provide VBN 8951 19 9 by by IN 8951 19 10 combining combine VBG 8951 19 11 these these DT 8951 19 12 key key NN 8951 19 13 - - HYPH 8951 19 14 sets set NNS 8951 19 15 with with IN 8951 19 16 keys key NNS 8951 19 17 for for IN 8951 19 18 the the DT 8951 19 19 first first JJ 8951 19 20 and and CC 8951 19 21 second second JJ 8951 19 22 initials initial NNS 8951 19 23 were be VBD 8951 19 24 de- de- RB 8951 19 25 termined termine VBN 8951 19 26 by by IN 8951 19 27 applying apply VBG 8951 19 28 them -PRON- PRP 8951 19 29 to to IN 8951 19 30 the the DT 8951 19 31 50,000 50,000 CD 8951 19 32 name name NN 8951 19 33 file file NN 8951 19 34 , , , 8951 19 35 and and CC 8951 19 36 the the DT 8951 19 37 entropy entropy JJ 8951 19 38 value value NN 8951 19 39 used use VBN 8951 19 40 to to TO 8951 19 41 determine determine VB 8951 19 42 the the DT 8951 19 43 optimal optimal JJ 8951 19 44 ratio ratio NN 8951 19 45 of of IN 8951 19 46 keys key NNS 8951 19 47 . . . 8951 20 1 In in IN 8951 20 2 one one CD 8951 20 3 case case NN 8951 20 4 , , , 8951 20 5 the the DT 8951 20 6 correlation correlation NN 8951 20 7 between between IN 8951 20 8 the the DT 8951 20 9 value value NN 8951 20 10 of of IN 8951 20 11 the the DT 8951 20 12 relative relative JJ 8951 20 13 entropy entropy JJ 8951 20 14 and and CC 8951 20 15 retrieval retrieval NN 8951 20 16 efficiency efficiency NN 8951 20 17 , , , 8951 20 18 as as IN 8951 20 19 mea- mea- NNS 8951 20 20 sured sure VBN 8951 20 21 by by IN 8951 20 22 the the DT 8951 20 23 precision precision NN 8951 20 24 ratio ratio NN 8951 20 25 , , , 8951 20 26 was be VBD 8951 20 27 also also RB 8951 20 28 studied study VBN 8951 20 29 , , , 8951 20 30 and and CC 8951 20 31 shown show VBN 8951 20 32 to to TO 8951 20 33 be be VB 8951 20 34 high high JJ 8951 20 35 . . . 8951 21 1 The the DT 8951 21 2 sizes size NNS 8951 21 3 of of IN 8951 21 4 the the DT 8951 21 5 combined combine VBN 8951 21 6 key key NN 8951 21 7 - - HYPH 8951 21 8 sets set NNS 8951 21 9 studied study VBN 8951 21 10 were be VBD 8951 21 11 148 148 CD 8951 21 12 and and CC 8951 21 13 296 296 CD 8951 21 14 , , , 8951 21 15 with with IN 8951 21 16 an an DT 8951 21 17 in- in- JJ 8951 21 18 termediate termediate NN 8951 21 19 set set NN 8951 21 20 of of IN 8951 21 21 254 254 CD 8951 21 22 keys key NNS 8951 21 23 . . . 8951 22 1 The the DT 8951 22 2 values value NNS 8951 22 3 of of IN 8951 22 4 148 148 CD 8951 22 5 and and CC 8951 22 6 296 296 CD 8951 22 7 were be VBD 8951 22 8 chosen choose VBN 8951 22 9 in in IN 8951 22 10 view view NN 8951 22 11 of of IN 8951 22 12 the the DT 8951 22 13 projected project VBN 8951 22 14 implementation implementation NN 8951 22 15 in in IN 8951 22 16 the the DT 8951 22 17 serial serial JJ 8951 22 18 - - HYPH 8951 22 19 parallel parallel NN 8951 22 20 file file NN 8951 22 21 organization.2 organization.2 UH 8951 22 22 This this DT 8951 22 23 relates relate VBZ 8951 22 24 the the DT 8951 22 25 size size NN 8951 22 26 of of IN 8951 22 27 the the DT 8951 22 28 key key NN 8951 22 29 - - HYPH 8951 22 30 set set NN 8951 22 31 to to IN 8951 22 32 the the DT 8951 22 33 number number NN 8951 22 34 of of IN 8951 22 35 blocks block NNS 8951 22 36 on on IN 8951 22 37 one one CD 8951 22 38 cylinder cylinder NN 8951 22 39 of of IN 8951 22 40 a a DT 8951 22 41 disc disc NN 8951 22 42 . . . 8951 23 1 ( ( -LRB- 8951 23 2 The the DT 8951 23 3 30Mbyte 30Mbyte NNP 8951 23 4 disc disc NN 8951 23 5 cartridges cartridge NNS 8951 23 6 available available JJ 8951 23 7 to to IN 8951 23 8 us -PRON- PRP 8951 23 9 have have VBP 8951 23 10 296 296 CD 8951 23 11 blocks block NNS 8951 23 12 per per IN 8951 23 13 cylinder cylinder NN 8951 23 14 . . . 8951 23 15 ) ) -RRB- 8951 24 1 Otherwise otherwise RB 8951 24 2 the the DT 8951 24 3 choice choice NN 8951 24 4 of of IN 8951 24 5 key key NN 8951 24 6 - - HYPH 8951 24 7 set set NN 8951 24 8 is be VBZ 8951 24 9 arbitrary arbitrary JJ 8951 24 10 , , , 8951 24 11 and and CC 8951 24 12 can can MD 8951 24 13 be be VB 8951 24 14 varied vary VBN 8951 24 15 at at IN 8951 24 16 will will NN 8951 24 17 . . . 8951 25 1 The the DT 8951 25 2 minimum minimum JJ 8951 25 3 key key JJ 8951 25 4 - - HYPH 8951 25 5 set set NN 8951 25 6 size size NN 8951 25 7 is be VBZ 8951 25 8 106 106 CD 8951 25 9 , , , 8951 25 10 consisting consist VBG 8951 25 11 of of IN 8951 25 12 26 26 CD 8951 25 13 letters letter NNS 8951 25 14 each each DT 8951 25 15 for for IN 8951 25 16 the the DT 8951 25 17 first first JJ 8951 25 18 and and CC 8951 25 19 last last JJ 8951 25 20 letter letter NN 8951 25 21 of of IN 8951 25 22 the the DT 8951 25 23 surname surname NN 8951 25 24 , , , 8951 25 25 and and CC 8951 25 26 27 27 CD 8951 25 27 ( ( -LRB- 8951 25 28 26 26 CD 8951 25 29 letters letter NNS 8951 25 30 and and CC 8951 25 31 the the DT 8951 25 32 space space NN 8951 25 33 sym- sym- IN 8951 25 34 bol bol NNP 8951 25 35 ) ) -RRB- 8951 25 36 each each DT 8951 25 37 for for IN 8951 25 38 the the DT 8951 25 39 first first JJ 8951 25 40 and and CC 8951 25 41 second second JJ 8951 25 42 initials initial NNS 8951 25 43 . . . 8951 26 1 The the DT 8951 26 2 numbers number NNS 8951 26 3 of of IN 8951 26 4 n n NN 8951 26 5 - - HYPH 8951 26 6 gram gram NN 8951 26 7 keys key NNS 8951 26 8 ( ( -LRB- 8951 26 9 n n NN 8951 26 10 : : : 8951 26 11 : : : 8951 26 12 : : : 8951 26 13 : : : 8951 26 14 , , , 8951 26 15 . . . 8951 27 1 2 2 LS 8951 27 2 ) ) -RRB- 8951 27 3 required require VBN 8951 27 4 for for IN 8951 27 5 the the DT 8951 27 6 key key JJ 8951 27 7 - - HYPH 8951 27 8 sets set NNS 8951 27 9 numbering number VBG 8951 27 10 148 148 CD 8951 27 11 , , , 8951 27 12 254 254 CD 8951 27 13 , , , 8951 27 14 and and CC 8951 27 15 296 296 CD 8951 27 16 in in IN 8951 27 17 size size NN 8951 27 18 are be VBP 8951 27 19 . . . 8951 28 1 thus thus RB 8951 28 2 42 42 CD 8951 28 3 , , , 8951 28 4 148 148 CD 8951 28 5 , , , 8951 28 6 and and CC 8951 28 7 190 190 CD 8951 28 8 . . . 8951 29 1 Full full JJ 8951 29 2 details detail NNS 8951 29 3 are be VBP 8951 29 4 given give VBN 8951 29 5 of of IN 8951 29 6 the the DT 8951 29 7 composition composition NN 8951 29 8 of of IN 8951 29 9 the the DT 8951 29 10 first first JJ 8951 29 11 and and CC 8951 29 12 third third JJ 8951 29 13 of of IN 8951 29 14 these these DT 8951 29 15 sets set NNS 8951 29 16 . . . 8951 30 1 A a DT 8951 30 2 slight slight JJ 8951 30 3 refinement refinement NN 8951 30 4 to to IN 8951 30 5 key key NN 8951 30 6 - - HYPH 8951 30 7 set set NN 8951 30 8 generation generation NN 8951 30 9 was be VBD 8951 30 10 employed employ VBN 8951 30 11 to to TO 8951 30 12 ensure ensure VB 8951 30 13 as as IN 8951 30 14 close close JJ 8951 30 15 an an DT 8951 30 16 approximation approximation NN 8951 30 17 to to IN 8951 30 18 equifrequency equifrequency VB 8951 30 19 as as IN 8951 30 20 possible possible JJ 8951 30 21 , , , 8951 30 22 especially especially RB 8951 30 23 with with IN 8951 30 24 the the DT 8951 30 25 small- small- JJ 8951 30 26 est est NNP 8951 30 27 key key NN 8951 30 28 - - HYPH 8951 30 29 sets set NNS 8951 30 30 . . . 8951 31 1 Precise precise JJ 8951 31 2 application application NN 8951 31 3 of of IN 8951 31 4 a a DT 8951 31 5 threshold threshold NN 8951 31 6 frequency frequency NN 8951 31 7 may may MD 8951 31 8 occasionally occasionally RB 8951 31 9 result result VB 8951 31 10 in in IN 8951 31 11 arbitrary arbitrary JJ 8951 31 12 inclusion inclusion NN 8951 31 13 of of IN 8951 31 14 either either DT 8951 31 15 very very RB 8951 31 16 high high JJ 8951 31 17 or or CC 8951 31 18 very very RB 8951 31 19 low low JJ 8951 31 20 frequency frequency NN 8951 31 21 keys key NNS 8951 31 22 . . . 8951 32 1 Thus thus RB 8951 32 2 , , , 8951 32 3 if if IN 8951 32 4 almost almost RB 8951 32 5 all all PDT 8951 32 6 the the DT 8951 32 7 occurrences occurrence NNS 8951 32 8 of of IN 8951 32 9 a a DT 8951 32 10 longer long JJR 8951 32 11 key key NN 8951 32 12 are be VBP 8951 32 13 accounted account VBN 8951 32 14 for for IN 8951 32 15 by by IN 8951 32 16 a a DT 8951 32 17 shorter short JJR 8951 32 18 key key NN 8951 32 19 ( ( -LRB- 8951 32 20 as as IN 8951 32 21 with with IN 8951 32 22 -MANN -mann NN 8951 32 23 and and CC 8951 32 24 -ANN -ANN . 8951 32 25 ) ) -RRB- 8951 32 26 , , , 8951 32 27 only only RB 8951 32 28 the the DT 8951 32 29 shorter short JJR 8951 32 30 n n NN 8951 32 31 - - HYPH 8951 32 32 gram gram NN 8951 32 33 is be VBZ 8951 32 34 included include VBN 8951 32 35 . . . 8951 33 1 Va1'iety Va1'iety NNP 8951 33 2 - - : 8951 33 3 Generato1 Generato1 NNP 8951 33 4 · · NFP 8951 33 5 Approach Approach NNP 8951 33 6 / / SYM 8951 33 7 FOKKER FOKKER NNP 8951 33 8 and and CC 8951 33 9 LYNCH LYNCH NNP 8951 33 10 203 203 CD 8951 33 11 OPTIMAL OPTIMAL NNP 8951 33 12 SET set NN 8951 33 13 OF of IN 8951 33 14 148 148 CD 8951 33 15 KEYS KEYS NNP 8951 33 16 The the DT 8951 33 17 number number NN 8951 33 18 of of IN 8951 33 19 n n NN 8951 33 20 - - HYPH 8951 33 21 gram gram NN 8951 33 22 keys key NNS 8951 33 23 ( ( -LRB- 8951 33 24 n n NN 8951 33 25 : : : 8951 33 26 : : : 8951 33 27 : : : 8951 33 28 : : : 8951 33 29 : : : 8951 33 30 : : : 8951 33 31 , , , 8951 33 32 . . . 8951 34 1 2 2 LS 8951 34 2 ) ) -RRB- 8951 34 3 to to TO 8951 34 4 be be VB 8951 34 5 added add VBN 8951 34 6 to to IN 8951 34 7 the the DT 8951 34 8 minimum minimum JJ 8951 34 9 set set NN 8951 34 10 of of IN 8951 34 11 106 106 CD 8951 34 12 keys key NNS 8951 34 13 is be VBZ 8951 34 14 42 42 CD 8951 34 15 , , , 8951 34 16 the the DT 8951 34 17 presumed presume VBN 8951 34 18 optimum optimum JJ 8951 34 19 proportion proportion NN 8951 34 20 being be VBG 8951 34 21 8:14 8:14 CD 8951 34 22 , , , 8951 34 23 which which WDT 8951 34 24 im- im- VBZ 8951 34 25 plies ply NNS 8951 34 26 about about IN 8951 34 27 16 16 CD 8951 34 28 keys key NNS 8951 34 29 from from IN 8951 34 30 the the DT 8951 34 31 front front NN 8951 34 32 of of IN 8951 34 33 the the DT 8951 34 34 name name NN 8951 34 35 and and CC 8951 34 36 26 26 CD 8951 34 37 from from IN 8951 34 38 the the DT 8951 34 39 back back NN 8951 34 40 . . . 8951 35 1 In in IN 8951 35 2 order order NN 8951 35 3 to to TO 8951 35 4 examine examine VB 8951 35 5 the the DT 8951 35 6 relationship relationship NN 8951 35 7 between between IN 8951 35 8 the the DT 8951 35 9 ratio ratio NN 8951 35 10 of of IN 8951 35 11 keys key NNS 8951 35 12 from from IN 8951 35 13 the the DT 8951 35 14 front front NN 8951 35 15 and and CC 8951 35 16 rear rear NN 8951 35 17 of of IN 8951 35 18 the the DT 8951 35 19 surname surname NN 8951 35 20 and and CC 8951 35 21 the the DT 8951 35 22 relative relative JJ 8951 35 23 entropy entropy RB 8951 35 24 of of IN 8951 35 25 the the DT 8951 35 26 combined combine VBN 8951 35 27 sets set NNS 8951 35 28 , , , 8951 35 29 the the DT 8951 35 30 ratios ratio NNS 8951 35 31 were be VBD 8951 35 32 varied vary VBN 8951 35 33 at at IN 8951 35 34 intervals interval NNS 8951 35 35 between between IN 8951 35 36 1:1 1:1 CD 8951 35 37 and and CC 8951 35 38 1:3 1:3 CD 8951 35 39 so so IN 8951 35 40 that that IN 8951 35 41 the the DT 8951 35 42 numbers number NNS 8951 35 43 of of IN 8951 35 44 n n NNP 8951 35 45 - - HYPH 8951 35 46 grams gram NNS 8951 35 47 varied vary VBD 8951 35 48 from from IN 8951 35 49 21 21 CD 8951 35 50 and and CC 8951 35 51 21 21 CD 8951 35 52 to to IN 8951 35 53 11 11 CD 8951 35 54 and and CC 8951 35 55 31 31 CD 8951 35 56 respectively respectively RB 8951 35 57 . . . 8951 36 1 For for IN 8951 36 2 each each DT 8951 36 3 ratio ratio NN 8951 36 4 the the DT 8951 36 5 keys key NNS 8951 36 6 were be VBD 8951 36 7 applied apply VBN 8951 36 8 to to IN 8951 36 9 the the DT 8951 36 10 50,000 50,000 CD 8951 36 11 name name NN 8951 36 12 entries entry NNS 8951 36 13 , , , 8951 36 14 and and CC 8951 36 15 the the DT 8951 36 16 distri- distri- JJ 8951 36 17 bution bution NN 8951 36 18 of of IN 8951 36 19 the the DT 8951 36 20 resultant resultant JJ 8951 36 21 descriptions description NNS 8951 36 22 determined determine VBN 8951 36 23 . . . 8951 37 1 The the DT 8951 37 2 ratios ratio NNS 8951 37 3 , , , 8951 37 4 the the DT 8951 37 5 number number NN 8951 37 6 of of IN 8951 37 7 n n NN 8951 37 8 - - HYPH 8951 37 9 gram gram NN 8951 37 10 keys key NNS 8951 37 11 , , , 8951 37 12 and and CC 8951 37 13 the the DT 8951 37 14 relative relative JJ 8951 37 15 entropies entropy NNS 8951 37 16 of of IN 8951 37 17 the the DT 8951 37 18 distributions distribution NNS 8951 37 19 are be VBP 8951 37 20 shown show VBN 8951 37 21 in in IN 8951 37 22 Table table NN 8951 37 23 1 1 CD 8951 37 24 . . . 8951 38 1 The the DT 8951 38 2 maximum maximum JJ 8951 38 3 value value NN 8951 38 4 of of IN 8951 38 5 the the DT 8951 38 6 entropy entropy JJ 8951 38 7 is be VBZ 8951 38 8 taken take VBN 8951 38 9 to to TO 8951 38 10 be be VB 8951 38 11 log250,000 log250,000 NNP 8951 38 12 . . . 8951 39 1 In in IN 8951 39 2 this this DT 8951 39 3 case case NN 8951 39 4 the the DT 8951 39 5 balancing balancing NN 8951 39 6 point point NN 8951 39 7 , , , 8951 39 8 with with IN 8951 39 9 the the DT 8951 39 10 key key NN 8951 39 11 - - HYPH 8951 39 12 set set NN 8951 39 13 including include VBG 8951 39 14 16 16 CD 8951 39 15 n n JJ 8951 39 16 - - HYPH 8951 39 17 gram gram NN 8951 39 18 keys key NNS 8951 39 19 Table table NN 8951 39 20 1 1 CD 8951 39 21 . . . 8951 40 1 Relation relation NN 8951 40 2 between between IN 8951 40 3 Ratio ratio NN 8951 40 4 of of IN 8951 40 5 n n NN 8951 40 6 - - HYPH 8951 40 7 grams gram NNS 8951 40 8 f1'0 f1'0 NNP 8951 40 9 m m NNP 8951 40 10 F1'Dnt F1'Dnt NNP 8951 40 11 and and CC 8951 40 12 Rear Rear NNP 8951 40 13 of of IN 8951 40 14 Surname Surname NNP 8951 40 15 , , , 8951 40 16 Entropy Entropy NNP 8951 40 17 of of IN 8951 40 18 Combined Combined NNP 8951 40 19 Key Key NNP 8951 40 20 - - HYPH 8951 40 21 Sets Sets NNPS 8951 40 22 , , , 8951 40 23 and and CC 8951 40 24 Retrieval Retrieval NNP 8951 40 25 Efficiency Efficiency NNP 8951 40 26 for for IN 8951 40 27 a a DT 8951 40 28 Series series NN 8951 40 29 of of IN 8951 40 30 Sets Sets NNPS 8951 40 31 of of IN 8951 40 32 148 148 CD 8951 40 33 Keys Keys NNPS 8951 40 34 Ratio Ratio NNP 8951 40 35 Numbm Numbm NNP 8951 40 36 · · NFP 8951 40 37 of of IN 8951 40 38 n n NN 8951 40 39 - - HYPH 8951 40 40 gram gram NN 8951 40 41 Number Number NNP 8951 40 42 of of IN 8951 40 43 Diffm·ent Diffm·ent NNP 8951 40 44 Relative relative JJ 8951 40 45 · · NFP 8951 40 46 Precision(% Precision(% NNPS 8951 40 47 ) ) -RRB- 8951 40 48 of of IN 8951 40 49 n n NN 8951 40 50 - - HYPH 8951 40 51 gram gram NN 8951 40 52 Keys Keys NNP 8951 40 53 Representations Representations NNPS 8951 40 54 Entropy Entropy NNP 8951 40 55 ( ( -LRB- 8951 40 56 File File NNP 8951 40 57 Size= size= NN 8951 40 58 Keys Keys NNP 8951 40 59 Front Front NNP 8951 40 60 Back back RB 8951 40 61 in in IN 8951 40 62 50,000 50,000 CD 8951 40 63 Entries entry NNS 8951 40 64 of of IN 8951 40 65 System System NNP 8951 40 66 25,000 25,000 CD 8951 40 67 ) ) -RRB- 8951 40 68 1:1 1:1 CD 8951 40 69 21 21 CD 8951 40 70 21 21 CD 8951 40 71 33,485 33,485 CD 8951 40 72 0.9450 0.9450 CD 8951 40 73 71.5 71.5 CD 8951 40 74 3:4 3:4 CD 8951 40 75 18 18 CD 8951 40 76 24 24 CD 8951 40 77 33,501 33,501 CD 8951 40 78 0.9450 0.9450 CD 8951 40 79 71.3 71.3 CD 8951 40 80 17:25 17:25 CD 8951 40 81 17 17 CD 8951 40 82 25 25 CD 8951 40 83 33,434 33,434 CD 8951 40 84 0.9447 0.9447 CD 8951 40 85 70.9 70.9 CD 8951 40 86 8:13 8:13 CD 8951 40 87 16 16 CD 8951 40 88 26 26 CD 8951 40 89 ' ' CD 8951 40 90 * * CD 8951 40 91 33,454 33,454 CD 8951 40 92 0.9453 0.9453 CD 8951 40 93 72.2 72.2 CD 8951 40 94 5:9 5:9 CD 8951 40 95 15 15 CD 8951 40 96 27 27 CD 8951 40 97 33,402 33,402 CD 8951 40 98 0.9450 0.9450 CD 8951 40 99 72.0 72.0 CD 8951 40 100 1:2 1:2 CD 8951 40 101 14 14 CD 8951 40 102 28 28 CD 8951 40 103 33,378 33,378 CD 8951 40 104 0.9449 0.9449 CD 8951 40 105 72.1 72.1 CD 8951 40 106 . . . 8951 41 1 1:3 1:3 CD 8951 41 2 11 11 CD 8951 41 3 31 31 CD 8951 41 4 33,126 33,126 CD 8951 41 5 0.9437 0.9437 CD 8951 41 6 71.5 71.5 CD 8951 41 7 Total total JJ 8951 41 8 number number NN 8951 41 9 of of IN 8951 41 10 different different JJ 8951 41 11 name name NN 8951 41 12 entries entry NNS 8951 41 13 = = SYM 8951 41 14 41,469 41,469 CD 8951 41 15 . . . 8951 42 1 ' ' `` 8951 42 2 * * NFP 8951 42 3 Key key NN 8951 42 4 - - HYPH 8951 42 5 set set VBN 8951 42 6 with with IN 8951 42 7 highest high JJS 8951 42 8 relative relative JJ 8951 42 9 entropy entropy NN 8951 42 10 . . . 8951 43 1 from from IN 8951 43 2 the the DT 8951 43 3 front front NN 8951 43 4 and and CC 8951 43 5 26 26 CD 8951 43 6 from from IN 8951 43 7 the the DT 8951 43 8 back back NN 8951 43 9 , , , 8951 43 10 corresponds correspond VBZ 8951 43 11 with with IN 8951 43 12 the the DT 8951 43 13 ratio ratio NN 8951 43 14 of of IN 8951 43 15 the the DT 8951 43 16 redundancies redundancy NNS 8951 43 17 of of IN 8951 43 18 the the DT 8951 43 19 first first JJ 8951 43 20 and and CC 8951 43 21 last last JJ 8951 43 22 letters letter NNS 8951 43 23 of of IN 8951 43 24 the the DT 8951 43 25 surnames surname NNS 8951 43 26 . . . 8951 44 1 Table table NN 8951 44 2 2 2 CD 8951 44 3 shows show VBZ 8951 44 4 the the DT 8951 44 5 composition composition NN 8951 44 6 of of IN 8951 44 7 the the DT 8951 44 8 optimal optimal JJ 8951 44 9 key key NN 8951 44 10 - - HYPH 8951 44 11 set set NN 8951 44 12 of of IN 8951 44 13 148 148 CD 8951 44 14 keys key NNS 8951 44 15 , , , 8951 44 16 while while IN 8951 44 17 Table table NN 8951 44 18 3 3 CD 8951 44 19 gives give VBZ 8951 44 20 the the DT 8951 44 21 distribution distribution NN 8951 44 22 of of IN 8951 44 23 the the DT 8951 44 24 name name NN 8951 44 25 representations representation NNS 8951 44 26 compiled compile VBD 8951 44 27 from from IN 8951 44 28 the the DT 8951 44 29 combined combine VBN 8951 44 30 key key NN 8951 44 31 - - HYPH 8951 44 32 set set NN 8951 44 33 , , , 8951 44 34 and and CC 8951 44 35 its -PRON- PRP$ 8951 44 36 corresponding corresponding JJ 8951 44 37 relative relative JJ 8951 44 38 entropy entropy NN 8951 44 39 . . . 8951 45 1 OPTIMAL OPTIMAL NNP 8951 45 2 SET set NN 8951 45 3 OF of IN 8951 45 4 296 296 CD 8951 45 5 KEYS KEYS NNP 8951 45 6 A a DT 8951 45 7 similar similar JJ 8951 45 8 procedure procedure NN 8951 45 9 to to IN 8951 45 10 that that DT 8951 45 11 used use VBN 8951 45 12 for for IN 8951 45 13 the the DT 8951 45 14 optimal148-key optimal148-key NNP 8951 45 15 key key NN 8951 45 16 - - HYPH 8951 45 17 set set NN 8951 45 18 was be VBD 8951 45 19 also also RB 8951 45 20 applied apply VBN 8951 45 21 in in IN 8951 45 22 this this DT 8951 45 23 instance instance NN 8951 45 24 . . . 8951 46 1 Here here RB 8951 46 2 the the DT 8951 46 3 ratios ratio NNS 8951 46 4 of of IN 8951 46 5 front front NN 8951 46 6 and and CC 8951 46 7 rear rear JJ 8951 46 8 n n JJ 8951 46 9 - - HYPH 8951 46 10 gram gram NN 8951 46 11 keys key NNS 8951 46 12 varied vary VBD 8951 46 13 from from IN 8951 46 14 57 57 CD 8951 46 15 and and CC 8951 46 16 133 133 CD 8951 46 17 to to IN 8951 46 18 69 69 CD 8951 46 19 and and CC 8951 46 20 121 121 CD 8951 46 21 respectively respectively RB 8951 46 22 . . . 8951 47 1 For for IN 8951 47 2 each each DT 8951 47 3 of of IN 8951 47 4 the the DT 8951 47 5 sets set NNS 8951 47 6 chosen choose VBN 8951 47 7 , , , 8951 47 8 the the DT 8951 47 9 distributions distribution NNS 8951 47 10 of of IN 8951 47 11 the the DT 8951 47 12 entries entry NNS 8951 47 13 resulting result VBG 8951 47 14 from from IN 8951 47 15 application application NN 8951 47 16 of of IN 8951 47 17 the the DT 8951 47 18 combined combine VBN 8951 47 19 key key NN 8951 47 20 - - HYPH 8951 47 21 sets set NNS 8951 47 22 to to IN 8951 47 23 the the DT 8951 47 24 file file NN 8951 47 25 of of IN 8951 47 26 50,000 50,000 CD 8951 47 27 names name NNS 8951 47 28 were be VBD 8951 47 29 determined determine VBN 8951 47 30 . . . 8951 48 1 These these DT 8951 48 2 showed show VBD 8951 48 3 virtually virtually RB 8951 48 4 no no DT 8951 48 5 difference difference NN 8951 48 6 in in IN 8951 48 7 terms term NNS 8951 48 8 of of IN 8951 48 9 the the DT 8951 48 10 relative relative JJ 8951 48 11 entropy entropy RB 8951 48 12 alone alone RB 8951 48 13 , , , 8951 48 14 al- al- XX 8951 48 15 though though IN 8951 48 16 the the DT 8951 48 17 total total JJ 8951 48 18 number number NN 8951 48 19 of of IN 8951 48 20 different different JJ 8951 48 21 entries entry NNS 8951 48 22 differed differ VBD 8951 48 23 slightly slightly RB 8951 48 24 between between IN 8951 48 25 key- key- VBG 8951 48 26 sets set NNS 8951 48 27 , , , 8951 48 28 and and CC 8951 48 29 the the DT 8951 48 30 highest high JJS 8951 48 31 value value NN 8951 48 32 was be VBD 8951 48 33 used use VBN 8951 48 34 to to TO 8951 48 35 choose choose VB 8951 48 36 the the DT 8951 48 37 optimal optimal JJ 8951 48 38 set set NN 8951 48 39 , , , 8951 48 40 detailed detail VBN 8951 48 41 in in IN 8951 48 42 Table table NN 8951 48 43 4 4 CD 8951 48 44 . . . 8951 49 1 The the DT 8951 49 2 range range NN 8951 49 3 of of IN 8951 49 4 combinations combination NNS 8951 49 5 studied study VBN 8951 49 6 is be VBZ 8951 49 7 shown show VBN 8951 49 8 in in IN 8951 49 9 Table Table NNP 8951 49 10 5 5 CD 8951 49 11 , , , 8951 49 12 and and CC 8951 49 13 the the DT 8951 49 14 distribution distribution NN 8951 49 15 of of IN 8951 49 16 the the DT 8951 49 17 entries entry NNS 8951 49 18 for for IN 8951 49 19 the the DT 8951 49 20 optimal optimal JJ 8951 49 21 set set NN 8951 49 22 is be VBZ 8951 49 23 given give VBN 8951 49 24 in in IN 8951 49 25 Table Table NNP 8951 49 26 6 6 CD 8951 49 27 . . . 8951 50 1 .. .. NFP 8951 50 2 , , , 8951 50 3 - - : 8951 50 4 : : : 8951 50 5 : : : 8951 50 6 : : : 8951 50 7 204 204 CD 8951 50 8 Journal Journal NNP 8951 50 9 of of IN 8951 50 10 Library Library NNP 8951 50 11 Automation Automation NNP 8951 50 12 Vol Vol NNP 8951 50 13 . . . 8951 51 1 7/3 7/3 CD 8951 51 2 September September NNP 8951 51 3 197 197 CD 8951 51 4 4 4 CD 8951 51 5 Table table NN 8951 51 6 2 2 CD 8951 51 7 . . . 8951 52 1 Composition composition NN 8951 52 2 of of IN 8951 52 3 Balanced Balanced NNP 8951 52 4 Key Key NNP 8951 52 5 - - HYPH 8951 52 6 Set Set NNP 8951 52 7 of of IN 8951 52 8 148 148 CD 8951 52 9 Keys Keys NNP 8951 52 10 Keys Keys NNP 8951 52 11 from from IN 8951 52 12 front front NN 8951 52 13 of of IN 8951 52 14 surname surname NN 8951 52 15 ( ( -LRB- 8951 52 16 42 42 CD 8951 52 17 ) ) -RRB- 8951 52 18 : : : 8951 52 19 Key key JJ 8951 52 20 P• p• CD 8951 52 21 Key key JJ 8951 52 22 P• p• CD 8951 52 23 Key key JJ 8951 52 24 P• p• CD 8951 52 25 Key key JJ 8951 52 26 P• p• CD 8951 52 27 A a NN 8951 52 28 .035 .035 CD 8951 52 29 G g NN 8951 52 30 .055 .055 CD 8951 52 31 MA MA NNP 8951 52 32 .030 .030 CD 8951 52 33 SH sh NN 8951 52 34 .016 .016 CD 8951 52 35 B b NN 8951 52 36 .020 .020 CD 8951 52 37 H h NN 8951 52 38 .035 .035 CD 8951 52 39 N n CD 8951 52 40 .025 .025 CD 8951 52 41 ST ST NNP 8951 52 42 .016 .016 CD 8951 52 43 BA BA NNP 8951 52 44 .020 .020 CD 8951 52 45 HA HA NNP 8951 52 46 .021 .021 CD 8951 52 47 0 0 CD 8951 52 48 .017 .017 CD 8951 52 49 T t NN 8951 52 50 .040 .040 CD 8951 52 51 BE be NN 8951 52 52 .017 .017 CD 8951 52 53 I i NN 8951 52 54 .013 .013 CD 8951 52 55 p p NN 8951 52 56 .038 .038 CD 8951 52 57 u u CD 8951 52 58 .005 .005 CD 8951 52 59 BO BO NNP 8951 52 60 .014 .014 CD 8951 52 61 J J NNP 8951 52 62 .017 .017 CD 8951 52 63 PA PA NNP 8951 52 64 .014 .014 CD 8951 52 65 v v NN 8951 52 66 .025 .025 CD 8951 52 67 BR BR NNP 8951 52 68 .014 .014 CD 8951 52 69 K k NN 8951 52 70 .041 .041 CD 8951 52 71 Q q NN 8951 52 72 .001 .001 CD 8951 52 73 w w NN 8951 52 74 .040 .040 CD 8951 52 75 c c NN 8951 52 76 .036 .036 CD 8951 52 77 KA ka NN 8951 52 78 .017 .017 CD 8951 52 79 R r NN 8951 52 80 .032 .032 CD 8951 52 81 X x NN 8951 52 82 CH CH NNP 8951 52 83 .016 .016 CD 8951 52 84 KO KO NNP 8951 52 85 .017 .017 CD 8951 52 86 RO ro NN 8951 52 87 .017 .017 CD 8951 52 88 y y NN 8951 52 89 .011 .011 CD 8951 52 90 D d NN 8951 52 91 .044 .044 CD 8951 52 92 L l NN 8951 52 93 .033 .033 CD 8951 52 94 s s NN 8951 52 95 .049 .049 CD 8951 52 96 z z NN 8951 52 97 .013 .013 CD 8951 52 98 E e NN 8951 52 99 .018 .018 CD 8951 52 100 LE LE NNP 8951 52 101 .014 .014 CD 8951 52 102 SA SA NNP 8951 52 103 , , , 8951 52 104 016 016 CD 8951 52 105 F f NN 8951 52 106 .034 .034 CD 8951 52 107 M m NN 8951 52 108 .050 .050 CD 8951 52 109 sc sc IN 8951 52 110 .015 .015 CD 8951 52 111 Keys key NNS 8951 52 112 from from IN 8951 52 113 rear rear NN 8951 52 114 of of IN 8951 52 115 surname surname NN 8951 52 116 ( ( -LRB- 8951 52 117 52 52 CD 8951 52 118 ) ) -RRB- 8951 52 119 : : : 8951 52 120 A a DT 8951 52 121 .060 .060 CD 8951 52 122 II ii CD 8951 52 123 .015 .015 CD 8951 52 124 NN NN NNP 8951 52 125 .010 .010 CD 8951 52 126 IS be VBZ 8951 52 127 .012 .012 CD 8951 52 128 RA RA NNP 8951 52 129 .010 .010 CD 8951 52 130 KI KI NNP 8951 52 131 .015 .015 CD 8951 52 132 ON on IN 8951 52 133 .018 .018 CD 8951 52 134 T t NN 8951 52 135 .042 .042 CD 8951 52 136 VA VA NNP 8951 52 137 .015 .015 CD 8951 52 138 J J NNP 8951 52 139 .001 .001 CD 8951 52 140 SON SON NNP 8951 52 141 .027 .027 CD 8951 52 142 u u NN 8951 52 143 .013 .013 CD 8951 52 144 B b NN 8951 52 145 .003 .003 CD 8951 52 146 K k NN 8951 52 147 .033 .033 CD 8951 52 148 0 0 CD 8951 52 149 .028 .028 CD 8951 52 150 v v NN 8951 52 151 .001 .001 CD 8951 52 152 c c NN 8951 52 153 .005 .005 CD 8951 52 154 L l NN 8951 52 155 .013 .013 CD 8951 52 156 KO KO NNP 8951 52 157 .013 .013 CD 8951 52 158 EV ev NN 8951 52 159 .018 .018 CD 8951 52 160 D d NN 8951 52 161 .030 .030 CD 8951 52 162 EL el NN 8951 52 163 .012 .012 CD 8951 52 164 p p NN 8951 52 165 .004 .004 CD 8951 52 166 ov ov IN 8951 52 167 .026 .026 CD 8951 52 168 E e NN 8951 52 169 .068 .068 CD 8951 52 170 LL ll NN 8951 52 171 .016 .016 CD 8951 52 172 Q q NN 8951 52 173 .001 .001 CD 8951 52 174 , , , 8951 52 175 KOV KOV NNP 8951 52 176 .012 .012 CD 8951 52 177 F f NN 8951 52 178 .006 .006 CD 8951 52 179 M m NN 8951 52 180 .013 .013 CD 8951 52 181 R r NN 8951 52 182 .016 .016 CD 8951 52 183 NOV NOV NNP 8951 52 184 .on .on NN 8951 52 185 G g NN 8951 52 186 .012 .012 CD 8951 52 187 N n CD 8951 52 188 .009 .009 CD 8951 52 189 ER er NN 8951 52 190 .064 .064 CD 8951 52 191 w w CD 8951 52 192 .005 .005 CD 8951 52 193 NG NG NNP 8951 52 194 .014 .014 CD 8951 52 195 AN an NN 8951 52 196 .020 .020 CD 8951 52 197 LER ler NN 8951 52 198 .013 .013 CD 8951 52 199 X x NN 8951 52 200 .003 .003 CD 8951 52 201 H h NN 8951 52 202 .020 .020 CD 8951 52 203 MAN man NN 8951 52 204 .017 .017 CD 8951 52 205 NER NER NNP 8951 52 206 .010 .010 CD 8951 52 207 y y NN 8951 52 208 .031 .031 CD 8951 52 209 CH CH NNP 8951 52 210 .017 .017 CD 8951 52 211 EN en NN 8951 52 212 .025 .025 CD 8951 52 213 s s NN 8951 52 214 .055 .055 CD 8951 52 215 EY ey NN 8951 52 216 .012 .012 CD 8951 52 217 I i NN 8951 52 218 .044 .044 CD 8951 52 219 IN in IN 8951 52 220 .039 .039 CD 8951 52 221 ES es NN 8951 52 222 .015 .015 CD 8951 52 223 z z NN 8951 52 224 .013 .013 CD 8951 52 225 Keys key NNS 8951 52 226 from from IN 8951 52 227 first first JJ 8951 52 228 initial initial JJ 8951 52 229 : : : 8951 52 230 27 27 CD 8951 52 231 characters character NNS 8951 52 232 Keys Keys NNPS 8951 52 233 from from IN 8951 52 234 second second JJ 8951 52 235 initial initial NN 8951 52 236 : : : 8951 52 237 27 27 CD 8951 52 238 characters character NNS 8951 52 239 Table table NN 8951 52 240 3 3 CD 8951 52 241 . . . 8951 53 1 Frequencies frequency NNS 8951 53 2 of of IN 8951 53 3 Entries entry NNS 8951 53 4 Represented represent VBN 8951 53 5 by by IN 8951 53 6 Optimall48-Key Optimall48-Key NNP 8951 53 7 Key Key NNP 8951 53 8 - - HYPH 8951 53 9 Set set NN 8951 53 10 in in IN 8951 53 11 a a DT 8951 53 12 File file NN 8951 53 13 of of IN 8951 53 14 50,000 50,000 CD 8951 53 15 Names Names NNPS 8951 53 16 Frequency Frequency NNP 8951 53 17 Number Number NNP 8951 53 18 of of IN 8951 53 19 Entries Entries NNPS 8951 53 20 with with IN 8951 53 21 f f NNP 8951 53 22 Frequencyf Frequencyf NNS 8951 53 23 1 1 CD 8951 53 24 24,363 24,363 CD 8951 53 25 2 2 CD 8951 53 26 5,622 5,622 CD 8951 53 27 3 3 CD 8951 53 28 1,850 1,850 CD 8951 53 29 4 4 CD 8951 53 30 757 757 CD 8951 53 31 5 5 CD 8951 53 32 372 372 CD 8951 53 33 6 6 CD 8951 53 34 193 193 CD 8951 53 35 7 7 CD 8951 53 36 103 103 CD 8951 53 37 8 8 CD 8951 53 38 68 68 CD 8951 53 39 9 9 CD 8951 53 40 32 32 CD 8951 53 41 10 10 CD 8951 53 42 24 24 CD 8951 53 43 11 11 CD 8951 53 44 - - SYM 8951 53 45 15 15 CD 8951 53 46 54 54 CD 8951 53 47 16 16 CD 8951 53 48 - - HYPH 8951 53 49 -20 -20 CD 8951 53 50 11 11 CD 8951 53 51 21 21 CD 8951 53 52 - - SYM 8951 53 53 30 30 CD 8951 53 54 4 4 CD 8951 53 55 33 33 CD 8951 53 56 1 1 CD 8951 53 57 Total total JJ 8951 53 58 number number NN 8951 53 59 of of IN 8951 53 60 different different JJ 8951 53 61 entries entry NNS 8951 53 62 = = SYM 8951 53 63 33,454 33,454 CD 8951 53 64 Maximum maximum JJ 8951 53 65 number number NN 8951 53 66 of of IN 8951 53 67 possible possible JJ 8951 53 68 combinations= combinations= NN 8951 53 69 1,592,136 1,592,136 CD 8951 53 70 ( ( -LRB- 8951 53 71 i.e. i.e. FW 8951 53 72 , , , 8951 53 73 42 42 CD 8951 53 74 x x SYM 8951 53 75 52 52 CD 8951 53 76 x x SYM 8951 53 77 27 27 CD 8951 53 78 " " '' 8951 53 79 ) ) -RRB- 8951 53 80 H h NN 8951 53 81 = = SYM 8951 53 82 14.7553 14.7553 CD 8951 53 83 Hmax Hmax NNP 8951 53 84 = = SYM 8951 53 85 15.6096{log,50,000 15.6096{log,50,000 CD 8951 53 86 ) ) -RRB- 8951 53 87 Hr Hr NNP 8951 53 88 = = NFP 8951 53 89 0.9453 0.9453 CD 8951 53 90 Variety Variety NNP 8951 53 91 - - HYPH 8951 53 92 Generator Generator NNP 8951 53 93 Approach Approach NNP 8951 53 94 / / SYM 8951 53 95 FOKKER FOKKER NNP 8951 53 96 and and CC 8951 53 97 LYNCH LYNCH NNP 8951 53 98 205 205 CD 8951 53 99 Table table NN 8951 53 100 4 4 CD 8951 53 101 . . . 8951 54 1 Composition composition NN 8951 54 2 of of IN 8951 54 3 Balanced Balanced NNP 8951 54 4 Key Key NNP 8951 54 5 - - HYPH 8951 54 6 Set Set NNP 8951 54 7 of of IN 8951 54 8 296 296 CD 8951 54 9 Keys Keys NNP 8951 54 10 Keys Keys NNP 8951 54 11 from from IN 8951 54 12 front front NN 8951 54 13 of of IN 8951 54 14 surname surname NN 8951 54 15 ( ( -LRB- 8951 54 16 87 87 CD 8951 54 17 ) ) -RRB- 8951 54 18 : : : 8951 54 19 A a DT 8951 54 20 BU BU NNP 8951 54 21 E E NNP 8951 54 22 HA HA NNP 8951 54 23 KI KI NNP 8951 54 24 MA MA NNP 8951 54 25 NI NI NNP 8951 54 26 RA RA NNP 8951 54 27 SI SI NNP 8951 54 28 WA WA NNP 8951 54 29 AL AL NNP 8951 54 30 c c NN 8951 54 31 F F NNP 8951 54 32 HE HE NNP 8951 54 33 KO KO NNP 8951 54 34 MAR MAR NNP 8951 54 35 0 0 CD 8951 54 36 RE RE NNP 8951 54 37 so so RB 8951 54 38 WE WE NNP 8951 54 39 AN an DT 8951 54 40 QA qa NN 8951 54 41 FR FR NNP 8951 54 42 HO HO NNP 8951 54 43 KR KR NNP 8951 54 44 MC MC NNP 8951 54 45 p p NNP 8951 54 46 RI RI NNP 8951 54 47 ST ST NNP 8951 54 48 WI WI NNP 8951 54 49 B B NNP 8951 54 50 CH CH NNP 8951 54 51 G G NNP 8951 54 52 HU HU NNP 8951 54 53 KU KU NNP 8951 54 54 ME ME NNP 8951 54 55 PA PA NNP 8951 54 56 RO ro NN 8951 54 57 T t NN 8951 54 58 X x NN 8951 54 59 BA BA NNP 8951 54 60 co co NN 8951 54 61 GA GA NNP 8951 54 62 I I NNP 8951 54 63 L L NNP 8951 54 64 MI MI NNP 8951 54 65 PE PE NNP 8951 54 66 s s POS 8951 54 67 TA ta NN 8951 54 68 y y NN 8951 54 69 BAR BAR NNP 8951 54 70 D D NNP 8951 54 71 GO GO NNP 8951 54 72 J J NNP 8951 54 73 LA LA NNP 8951 54 74 MO MO NNP 8951 54 75 PO PO NNP 8951 54 76 SA SA NNP 8951 54 77 u u NNP 8951 54 78 z z NNP 8951 54 79 BE BE NNP 8951 54 80 DA DA NNP 8951 54 81 GR GR NNP 8951 54 82 JO JO NNP 8951 54 83 LE LE NNP 8951 54 84 MU MU NNP 8951 54 85 PR pr RB 8951 54 86 sc sc RB 8951 54 87 v v IN 8951 54 88 BO BO NNP 8951 54 89 DE DE NNP 8951 54 90 GU GU NNP 8951 54 91 K K NNP 8951 54 92 Ll Ll NNP 8951 54 93 N N NNP 8951 54 94 Q Q NNP 8951 54 95 SE SE NNP 8951 54 96 · · NFP 8951 54 97 VA VA NNP 8951 54 98 BR BR NNP 8951 54 99 DO DO NNP 8951 54 100 H h NN 8951 54 101 KA ka NN 8951 54 102 M m NN 8951 54 103 NA NA NNP 8951 54 104 R R NNP 8951 54 105 SH SH NNP 8951 54 106 w w NNP 8951 54 107 Keys Keys NNP 8951 54 108 from from IN 8951 54 109 rear rear NN 8951 54 110 of of IN 8951 54 111 surname surname NN 8951 54 112 ( ( -LRB- 8951 54 113 155 155 CD 8951 54 114 ) ) -RRB- 8951 54 115 : : : 8951 54 116 A a DT 8951 54 117 LD LD NNP 8951 54 118 NG NG NNP 8951 54 119 VSKII vskii NN 8951 54 120 EL el NN 8951 54 121 LIN LIN NNP 8951 54 122 R R NNP 8951 54 123 OR or CC 8951 54 124 NT NT NNP 8951 54 125 sov sov NN 8951 54 126 CA CA NNP 8951 54 127 ND ND NNP 8951 54 128 ANG ANG NNP 8951 54 129 KI KI NNP 8951 54 130 LL LL NNP 8951 54 131 TIN TIN NNP 8951 54 132 AR AR NNP 8951 54 133 s s NN 8951 54 134 RT RT NNP 8951 54 135 w w NNP 8951 54 136 DA DA NNP 8951 54 137 RD RD NNP 8951 54 138 lNG lng NN 8951 54 139 SKI SKI NNP 8951 54 140 ALL ALL NNP 8951 54 141 NN NN NNP 8951 54 142 ER ER NNP 8951 54 143 AS as IN 8951 54 144 ERT ert NN 8951 54 145 X X NNP 8951 54 146 KA KA NNP 8951 54 147 E E NNP 8951 54 148 RG RG NNP 8951 54 149 WSKI WSKI NNP 8951 54 150 ELL ELL NNP 8951 54 151 ON on IN 8951 54 152 BER BER NNP 8951 54 153 ES ES NNP 8951 54 154 ST ST NNP 8951 54 155 y y NNP 8951 54 156 MA MA NNP 8951 54 157 DE DE NNP 8951 54 158 H H NNP 8951 54 159 LI LI NNP 8951 54 160 M M NNP 8951 54 161 SON SON NNP 8951 54 162 DER DER NNP 8951 54 163 NES NES NNP 8951 54 164 TT TT NNP 8951 54 165 AY AY NNP 8951 54 166 NA NA NNP 8951 54 167 EE EE NNP 8951 54 168 CH CH NNP 8951 54 169 NI NI NNP 8951 54 170 AM AM NNP 8951 54 171 LSON lson NN 8951 54 172 GER GER NNP 8951 54 173 IS be VBZ 8951 54 174 ETT ETT NNP 8951 54 175 EY EY NNP 8951 54 176 INA INA NNP 8951 54 177 GE GE NNP 8951 54 178 ICH ICH NNP 8951 54 179 RI RI NNP 8951 54 180 N N NNP 8951 54 181 NSON NSON NNP 8951 54 182 NGER NGER NNP 8951 54 183 NS NS NNP 8951 54 184 u u NNP 8951 54 185 LEY LEY NNP 8951 54 186 RA RA NNP 8951 54 187 KE KE NNP 8951 54 188 VICH vich NN 8951 54 189 TI TI NNP 8951 54 190 AN an DT 8951 54 191 RSON rson NN 8951 54 192 HER her PRP$ 8951 54 193 INS INS NNP 8951 54 194 v v NN 8951 54 195 KY KY NNP 8951 54 196 TA TA NNP 8951 54 197 LE LE NNP 8951 54 198 GH GH NNP 8951 54 199 J J NNP 8951 54 200 MAN MAN NNP 8951 54 201 TON TON NNP 8951 54 202 IER IER NNP 8951 54 203 OS os NN 8951 54 204 EV EV NNP 8951 54 205 RY RY NNP 8951 54 206 VA VA NNP 8951 54 207 NE NE NNP 8951 54 208 SH SH NNP 8951 54 209 K K NNP 8951 54 210 RMAN RMAN NNP 8951 54 211 0 0 CD 8951 54 212 KER KER NNP 8951 54 213 RS RS NNP 8951 54 214 ov ov IN 8951 54 215 z z NNP 8951 54 216 OVA OVA NNP 8951 54 217 RE RE NNP 8951 54 218 TH th NN 8951 54 219 AK ak NN 8951 54 220 YAN YAN NNP 8951 54 221 KO KO NNP 8951 54 222 LER LER NNP 8951 54 223 ss ss NNP 8951 54 224 KOV KOV NNP 8951 54 225 TZ TZ NNP 8951 54 226 WA WA NNP 8951 54 227 SE SE NNP 8951 54 228 ITH ITH NNP 8951 54 229 CK CK NNP 8951 54 230 EN EN NNP 8951 54 231 NKO NKO NNP 8951 54 232 LLER LLER NNP 8951 54 233 TS TS NNP 8951 54 234 IKOV IKOV NNP 8951 54 235 YA YA NNP 8951 54 236 TE TE NNP 8951 54 237 I i NN 8951 54 238 EK EK NNP 8951 54 239 SEN SEN NNP 8951 54 240 NO no DT 8951 54 241 MER MER NNP 8951 54 242 us -PRON- PRP 8951 54 243 LOV LOV NNP 8951 54 244 B B NNP 8951 54 245 F F NNP 8951 54 246 AI AI NNP 8951 54 247 IK IK NNP 8951 54 248 IN in IN 8951 54 249 TO to IN 8951 54 250 NER NER NNP 8951 54 251 T T NNP 8951 54 252 NOV NOV NNP 8951 54 253 c c NNP 8951 54 254 FF FF NNP 8951 54 255 HI HI NNP 8951 54 256 L L NNP 8951 54 257 EIN EIN NNP 8951 54 258 p p NN 8951 54 259 SER SER NNP 8951 54 260 DT DT NNP 8951 54 261 ANOV anov NN 8951 54 262 D D NNP 8951 54 263 G g NN 8951 54 264 II II NNP 8951 54 265 AL AL NNP 8951 54 266 KIN KIN NNP 8951 54 267 Q Q NNP 8951 54 268 TER TER NNP 8951 54 269 ET ET NNP 8951 54 270 ROV ROV NNP 8951 54 271 Keys Keys NNPS 8951 54 272 from from IN 8951 54 273 first first JJ 8951 54 274 initial initial NN 8951 54 275 : : : 8951 54 276 27 27 CD 8951 54 277 characters character NNS 8951 54 278 Keys Keys NNPS 8951 54 279 from from IN 8951 54 280 second second JJ 8951 54 281 initial initial NN 8951 54 282 : : : 8951 54 283 27 27 CD 8951 54 284 characters character NNS 8951 54 285 Table table NN 8951 54 286 5 5 CD 8951 54 287 . . . 8951 55 1 Relation relation NN 8951 55 2 between between IN 8951 55 3 Ratio ratio NN 8951 55 4 of of IN 8951 55 5 n n NN 8951 55 6 - - HYPH 8951 55 7 grams gram NNS 8951 55 8 from from IN 8951 55 9 Front Front NNP 8951 55 10 and and CC 8951 55 11 Rear Rear NNP 8951 55 12 of of IN 8951 55 13 Surname Surname NNP 8951 55 14 and and CC 8951 55 15 Entropy Entropy NNP 8951 55 16 of of IN 8951 55 17 Combined Combined NNP 8951 55 18 Key Key NNP 8951 55 19 - - HYPH 8951 55 20 Sets Sets NNPS 8951 55 21 for for IN 8951 55 22 a a DT 8951 55 23 Series series NN 8951 55 24 of of IN 8951 55 25 Sets Sets NNPS 8951 55 26 of of IN 8951 55 27 296 296 CD 8951 55 28 Keys Keys NNPS 8951 55 29 ( ( -LRB- 8951 55 30 File File NNP 8951 55 31 Size= size= NN 8951 55 32 50,000 50,000 CD 8951 55 33 ) ) -RRB- 8951 55 34 Ratio ratio NN 8951 55 35 ofn ofn NN 8951 55 36 - - HYPH 8951 55 37 gram gram NN 8951 55 38 Keys key NNS 8951 55 39 3:7 3:7 NFP 8951 55 40 61:129 61:129 CD 8951 55 41 13:25 13:25 CD 8951 55 42 69:121 69:121 CD 8951 55 43 Number Number NNP 8951 55 44 of of IN 8951 55 45 n n NN 8951 55 46 - - HYPH 8951 55 47 gram gram NN 8951 55 48 Keys Keys NNP 8951 55 49 Front Front NNP 8951 55 50 57 57 CD 8951 55 51 61 61 CD 8951 55 52 65 65 CD 8951 55 53 69 69 CD 8951 55 54 Back back RB 8951 55 55 133 133 CD 8951 55 56 129 129 CD 8951 55 57 ' ' NN 8951 55 58 * * NFP 8951 55 59 125 125 CD 8951 55 60 121 121 CD 8951 55 61 ' ' CD 8951 55 62 * * NFP 8951 55 63 Key key NN 8951 55 64 - - HYPH 8951 55 65 set set VBN 8951 55 66 with with IN 8951 55 67 highest high JJS 8951 55 68 number number NN 8951 55 69 of of IN 8951 55 70 different different JJ 8951 55 71 entries entry NNS 8951 55 72 . . . 8951 56 1 Number number NN 8951 56 2 of of IN 8951 56 3 Different Different NNP 8951 56 4 Representations Representations NNPS 8951 56 5 39,182 39,182 CD 8951 56 6 39,191 39,191 CD 8951 56 7 39,186 39,186 CD 8951 56 8 39,179 39,179 CD 8951 56 9 Relative relative JJ 8951 56 10 Entropy Entropy NNP 8951 56 11 of of IN 8951 56 12 System System NNP 8951 56 13 0.9679 0.9679 CD 8951 56 14 0.9679 0.9679 CD 8951 56 15 0.9679 0.9679 CD 8951 56 16 0.9679 0.9679 CD 8951 56 17 In in IN 8951 56 18 this this DT 8951 56 19 instance instance NN 8951 56 20 , , , 8951 56 21 the the DT 8951 56 22 ratio ratio NN 8951 56 23 of of IN 8951 56 24 n n NN 8951 56 25 - - HYPH 8951 56 26 gram gram NN 8951 56 27 keys key NNS 8951 56 28 from from IN 8951 56 29 the the DT 8951 56 30 front front NN 8951 56 31 and and CC 8951 56 32 back back NN 8951 56 33 of of IN 8951 56 34 the the DT 8951 56 35 surnames surname NNS 8951 56 36 has have VBZ 8951 56 37 been be VBN 8951 56 38 displaced displace VBN 8951 56 39 from from IN 8951 56 40 the the DT 8951 56 41 ratio ratio NN 8951 56 42 of of IN 8951 56 43 the the DT 8951 56 44 redundancies redundancy NNS 8951 56 45 of of IN 8951 56 46 the the DT 8951 56 47 first first JJ 8951 56 48 and and CC 8951 56 49 last last JJ 8951 56 50 characters character NNS 8951 56 51 of of IN 8951 56 52 the the DT 8951 56 53 surnames surname NNS 8951 56 54 , , , 8951 56 55 i.e. i.e. FW 8951 56 56 , , , 8951 56 57 8:14 8:14 CD 8951 56 58 ( ( -LRB- 8951 56 59 1:1.7 1:1.7 CD 8951 56 60 ) ) -RRB- 8951 56 61 . . . 8951 57 1 Here here RB 8951 57 2 the the DT 8951 57 3 ratio ratio NN 8951 57 4 is be VBZ 8951 57 5 roughly roughly RB 8951 57 6 1:2 1:2 CD 8951 57 7 . . . 8951 58 1 This this DT 8951 58 2 is be VBZ 8951 58 3 undoubtedly undoubtedly RB 8951 58 4 due due IN 8951 58 5 to to IN 8951 58 6 the the DT 8951 58 7 fact fact NN 8951 58 8 that that IN 8951 58 9 the the DT 8951 58 10 relative relative JJ 8951 58 11 entro- entro- NN 8951 58 12 pies pie NNS 8951 58 13 of of IN 8951 58 14 key key NN 8951 58 15 - - HYPH 8951 58 16 sets set NNS 8951 58 17 from from IN 8951 58 18 the the DT 8951 58 19 back back NN 8951 58 20 of of IN 8951 58 21 the the DT 8951 58 22 surname surname NN 8951 58 23 increase increase NN 8951 58 24 less less RBR 8951 58 25 rapidly rapidly RB 8951 58 26 than than IN 8951 58 27 those those DT 8951 58 28 of of IN 8951 58 29 key key NN 8951 58 30 - - HYPH 8951 58 31 sets set NNS 8951 58 32 from from IN 8951 58 33 the the DT 8951 58 34 front front NN 8951 58 35 , , , 8951 58 36 and and CC 8951 58 37 hence hence RB 8951 58 38 larger large JJR 8951 58 39 sets set NNS 8951 58 40 must must MD 8951 58 41 be be VB 8951 58 42 employed employ VBN 8951 58 43 . . . 8951 59 1 EVALUATION evaluation NN 8951 59 2 OF of IN 8951 59 3 RETRIEVAL RETRIEVAL NNS 8951 59 4 EFFECTIVENESS effectiveness NN 8951 59 5 The the DT 8951 59 6 keys key NNS 8951 59 7 in in IN 8951 59 8 the the DT 8951 59 9 optimized optimize VBN 8951 59 10 key key JJ 8951 59 11 - - HYPH 8951 59 12 sets set NNS 8951 59 13 represent represent VBP 8951 59 14 name name NN 8951 59 15 entries entry NNS 8951 59 16 in in IN 8951 59 17 an an DT 8951 59 18 approxi- approxi- NN 8951 59 19 , , , 8951 59 20 , , , 8951 59 21 I -PRON- PRP 8951 59 22 ' ' `` 8951 59 23 i i PRP 8951 59 24 : : : 8951 59 25 206 206 CD 8951 59 26 ] ] -RRB- 8951 59 27 oumal oumal JJ 8951 59 28 of of IN 8951 59 29 Librm·y Librm·y NNP 8951 59 30 Automation Automation NNP 8951 59 31 Vol Vol NNP 8951 59 32 . . . 8951 60 1 7 7 CD 8951 60 2 I i NN 8951 60 3 3 3 CD 8951 60 4 September September NNP 8951 60 5 197 197 CD 8951 60 6 4 4 CD 8951 60 7 Table table NN 8951 60 8 6 6 CD 8951 60 9 . . . 8951 61 1 Frequencies frequency NNS 8951 61 2 of of IN 8951 61 3 Entries entry NNS 8951 61 4 Represented represent VBN 8951 61 5 by by IN 8951 61 6 Optimal Optimal NNP 8951 61 7 Key Key NNP 8951 61 8 - - HYPH 8951 61 9 Set set NN 8951 61 10 of of IN 8951 61 11 296 296 CD 8951 61 12 Keys Keys NNPS 8951 61 13 in in IN 8951 61 14 a a DT 8951 61 15 File file NN 8951 61 16 of of IN 8951 61 17 50,000 50,000 CD 8951 61 18 Names Names NNPS 8951 61 19 Frequency Frequency NNP 8951 61 20 f f NNP 8951 61 21 1 1 CD 8951 61 22 2 2 CD 8951 61 23 3 3 CD 8951 61 24 4 4 CD 8951 61 25 5 5 CD 8951 61 26 6 6 CD 8951 61 27 7 7 CD 8951 61 28 8 8 CD 8951 61 29 9 9 CD 8951 61 30 10 10 CD 8951 61 31 11 11 CD 8951 61 32 12 12 CD 8951 61 33 13 13 CD 8951 61 34 14 14 CD 8951 61 35 15 15 CD 8951 61 36 16 16 CD 8951 61 37 Total total JJ 8951 61 38 number number NN 8951 61 39 of of IN 8951 61 40 different different JJ 8951 61 41 entries entry NNS 8951 61 42 = = SYM 8951 61 43 39,191 39,191 CD 8951 61 44 Number Number NNP 8951 61 45 of of IN 8951 61 46 Entries Entries NNPS 8951 61 47 with with IN 8951 61 48 Frequencyf Frequencyf NNP 8951 61 49 31,705 31,705 CD 8951 61 50 5,394 5,394 CD 8951 61 51 1,371 1,371 CD 8951 61 52 442 442 CD 8951 61 53 164 164 CD 8951 61 54 63 63 CD 8951 61 55 27 27 CD 8951 61 56 12 12 CD 8951 61 57 4 4 CD 8951 61 58 3 3 CD 8951 61 59 2 2 CD 8951 61 60 2 2 CD 8951 61 61 1 1 CD 8951 61 62 1 1 CD 8951 61 63 Maximum maximum JJ 8951 61 64 number number NN 8951 61 65 of of IN 8951 61 66 possible possible JJ 8951 61 67 combinations= combinations= NN 8951 61 68 9,830,565 9,830,565 CD 8951 61 69 ( ( -LRB- 8951 61 70 i.e. i.e. FW 8951 61 71 , , , 8951 61 72 87 87 CD 8951 61 73 X x NN 8951 61 74 155 155 CD 8951 61 75 x x SYM 8951 61 76 27 27 CD 8951 61 77 ' ' POS 8951 61 78 ) ) -RRB- 8951 61 79 H h NN 8951 61 80 = = NFP 8951 61 81 15.108 15.108 CD 8951 61 82 Hmax Hmax NNP 8951 61 83 = = SYM 8951 61 84 15.6096(log,50,000 15.6096(log,50,000 CD 8951 61 85 ) ) -RRB- 8951 61 86 Hr Hr NNP 8951 61 87 = = SYM 8951 61 88 0.9679 0.9679 CD 8951 61 89 mate mate NN 8951 61 90 manner manner NN 8951 61 91 only only RB 8951 61 92 , , , 8951 61 93 so so IN 8951 61 94 that that IN 8951 61 95 when when WRB 8951 61 96 a a DT 8951 61 97 search search NN 8951 61 98 for for IN 8951 61 99 a a DT 8951 61 100 name name NN 8951 61 101 is be VBZ 8951 61 102 performed perform VBN 8951 61 103 , , , 8951 61 104 addi- addi- NNP 8951 61 105 tional tional JJ 8951 61 106 entries entry NNS 8951 61 107 represented represent VBN 8951 61 108 by by IN 8951 61 109 the the DT 8951 61 110 same same JJ 8951 61 111 combination combination NN 8951 61 112 of of IN 8951 61 113 keys key NNS 8951 61 114 are be VBP 8951 61 115 identified identify VBN 8951 61 116 . . . 8951 62 1 While while IN 8951 62 2 these these DT 8951 62 3 may may MD 8951 62 4 be be VB 8951 62 5 eliminated eliminate VBN 8951 62 6 in in IN 8951 62 7 a a DT 8951 62 8 subsequent subsequent JJ 8951 62 9 character character NN 8951 62 10 - - HYPH 8951 62 11 by by IN 8951 62 12 - - HYPH 8951 62 13 character character NN 8951 62 14 match match NN 8951 62 15 of of IN 8951 62 16 the the DT 8951 62 17 candidate candidate NN 8951 62 18 hits hit VBZ 8951 62 19 , , , 8951 62 20 the the DT 8951 62 21 proportion proportion NN 8951 62 22 of of IN 8951 62 23 unwanted unwanted JJ 8951 62 24 items item NNS 8951 62 25 should should MD 8951 62 26 re- re- VB 8951 62 27 main main JJ 8951 62 28 low low NN 8951 62 29 if if IN 8951 62 30 the the DT 8951 62 31 method method NN 8951 62 32 is be VBZ 8951 62 33 to to TO 8951 62 34 offer offer VB 8951 62 35 advantages advantage NNS 8951 62 36 . . . 8951 63 1 In in IN 8951 63 2 evaluating evaluate VBG 8951 63 3 the the DT 8951 63 4 effectiveness effectiveness NN 8951 63 5 of of IN 8951 63 6 the the DT 8951 63 7 key key NN 8951 63 8 - - HYPH 8951 63 9 sets set NNS 8951 63 10 in in IN 8951 63 11 the the DT 8951 63 12 retrieval retrieval NN 8951 63 13 , , , 8951 63 14 the the DT 8951 63 15 names name NNS 8951 63 16 in in IN 8951 63 17 the the DT 8951 63 18 search search NN 8951 63 19 file file NN 8951 63 20 were be VBD 8951 63 21 represented represent VBN 8951 63 22 by by IN 8951 63 23 concatenating concatenate VBG 8951 63 24 the the DT 8951 63 25 codes code NNS 8951 63 26 for for IN 8951 63 27 the the DT 8951 63 28 keys key NNS 8951 63 29 from from IN 8951 63 30 the the DT 8951 63 31 front front NN 8951 63 32 and and CC 8951 63 33 back back NN 8951 63 34 of of IN 8951 63 35 the the DT 8951 63 36 surnames surname NNS 8951 63 37 and and CC 8951 63 38 the the DT 8951 63 39 initials initial NNS 8951 63 40 , , , 8951 63 41 and and CC 8951 63 42 subjecting subject VBG 8951 63 43 the the DT 8951 63 44 query query NN 8951 63 45 names name NNS 8951 63 46 to to IN 8951 63 47 the the DT 8951 63 48 same same JJ 8951 63 49 procedure procedure NN 8951 63 50 . . . 8951 64 1 The the DT 8951 64 2 matching match VBG 8951 64 3 procedure procedure NN 8951 64 4 produced produce VBD 8951 64 5 lists list NNS 8951 64 6 of of IN 8951 64 7 candidate candidate NN 8951 64 8 entries entry NNS 8951 64 9 , , , 8951 64 10 of of IN 8951 64 11 which which WDT 8951 64 12 the the DT 8951 64 13 desired desire VBN 8951 64 14 entries entry NNS 8951 64 15 were be VBD 8951 64 16 a a DT 8951 64 17 subset subset NN 8951 64 18 . . . 8951 65 1 The the DT 8951 65 2 final final JJ 8951 65 3 determination determination NN 8951 65 4 was be VBD 8951 65 5 carried carry VBN 8951 65 6 out out RP 8951 65 7 manually manually RB 8951 65 8 . . . 8951 66 1 The the DT 8951 66 2 tests test NNS 8951 66 3 were be VBD 8951 66 4 performed perform VBN 8951 66 5 first first RB 8951 66 6 with with IN 8951 66 7 names name NNS 8951 66 8 sampled sample VBN 8951 66 9 from from IN 8951 66 10 the the DT 8951 66 11 search search NN 8951 66 12 file file NN 8951 66 13 , , , 8951 66 14 so so IN 8951 66 15 that that IN 8951 66 16 correct correct JJ 8951 66 17 items item NNS 8951 66 18 were be VBD 8951 66 19 retrieved retrieve VBN 8951 66 20 for for IN 8951 66 21 each each DT 8951 66 22 query query NN 8951 66 23 . . . 8951 67 1 Since since IN 8951 67 2 searches search NNS 8951 67 3 for for IN 8951 67 4 name name NN 8951 67 5 entries entry NNS 8951 67 6 may may MD 8951 67 7 be be VB 8951 67 8 performed perform VBN 8951 67 9 with with IN 8951 67 10 varying vary VBG 8951 67 11 probabilities probability NNS 8951 67 12 that that WDT 8951 67 13 the the DT 8951 67 14 authors author NNS 8951 67 15 ' ' POS 8951 67 16 names name NNS 8951 67 17 are be VBP 8951 67 18 present present JJ 8951 67 19 in in IN 8951 67 20 the the DT 8951 67 21 file file NN 8951 67 22 ( ( -LRB- 8951 67 23 especially especially RB 8951 67 24 in in IN 8951 67 25 current current JJ 8951 67 26 - - HYPH 8951 67 27 awareness awareness NN 8951 67 28 searches search NNS 8951 67 29 ) ) -RRB- 8951 67 30 , , , 8951 67 31 varying vary VBG 8951 67 32 proportions proportion NNS 8951 67 33 of of IN 8951 67 34 names name NNS 8951 67 35 of of IN 8951 67 36 the the DT 8951 67 37 same same JJ 8951 67 38 provenance provenance NN 8951 67 39 , , , 8951 67 40 but but CC 8951 67 41 known know VBN 8951 67 42 not not RB 8951 67 43 to to TO 8951 67 44 be be VB 8951 67 45 present present JJ 8951 67 46 in in IN 8951 67 47 the the DT 8951 67 48 search search NN 8951 67 49 file file NN 8951 67 50 , , , 8951 67 51 were be VBD 8951 67 52 also also RB 8951 67 53 added add VBN 8951 67 54 . . . 8951 68 1 In in IN 8951 68 2 these these DT 8951 68 3 cases case NNS 8951 68 4 candidate candidate VBP 8951 68 5 items item NNS 8951 68 6 were be VBD 8951 68 7 selected select VBN 8951 68 8 which which WDT 8951 68 9 included include VBD 8951 68 10 none none NN 8951 68 11 of of IN 8951 68 12 the the DT 8951 68 13 desired desire VBN 8951 68 14 entries entry NNS 8951 68 15 . . . 8951 69 1 Recall recall NN 8951 69 2 tests test NNS 8951 69 3 were be VBD 8951 69 4 also also RB 8951 69 5 performed perform VBN 8951 69 6 and and CC 8951 69 7 recall recall VB 8951 69 8 shown show VBN 8951 69 9 to to TO 8951 69 10 be be VB 8951 69 11 complete complete JJ 8951 69 12 . . . 8951 70 1 The the DT 8951 70 2 measure measure NN 8951 70 3 used use VBN 8951 70 4 in in IN 8951 70 5 determining determine VBG 8951 70 6 the the DT 8951 70 7 performance performance NN 8951 70 8 of of IN 8951 70 9 the the DT 8951 70 10 variety variety NN 8951 70 11 - - HYPH 8951 70 12 gen- gen- NN 8951 70 13 erator erator NN 8951 70 14 search search NN 8951 70 15 method method NN 8951 70 16 is be VBZ 8951 70 17 the the DT 8951 70 18 precision precision NN 8951 70 19 ratio ratio NN 8951 70 20 , , , 8951 70 21 defined define VBN 8951 70 22 as as IN 8951 70 23 the the DT 8951 70 24 ratio ratio NN 8951 70 25 of of IN 8951 70 26 correctly correctly RB 8951 70 27 identified identify VBN 8951 70 28 names name NNS 8951 70 29 to to IN 8951 70 30 all all DT 8951 70 31 names name NNS 8951 70 32 retrieved retrieve VBN 8951 70 33 . . . 8951 71 1 It -PRON- PRP 8951 71 2 is be VBZ 8951 71 3 presented present VBN 8951 71 4 both both DT 8951 71 5 as as IN 8951 71 6 the the DT 8951 71 7 ratio ratio NN 8951 71 8 of of IN 8951 71 9 averages average NNS 8951 71 10 ( ( -LRB- 8951 71 11 i.e. i.e. FW 8951 71 12 , , , 8951 71 13 the the DT 8951 71 14 summation summation NN 8951 71 15 of of IN 8951 71 16 items item NNS 8951 71 17 retrieved retrieve VBN 8951 71 18 in in IN 8951 71 19 the the DT 8951 71 20 search search NN 8951 71 21 and and CC 8951 71 22 cal- cal- NN 8951 71 23 culation culation NN 8951 71 24 of of IN 8951 71 25 the the DT 8951 71 26 average average NN 8951 71 27 ) ) -RRB- 8951 71 28 and and CC 8951 71 29 as as IN 8951 71 30 the the DT 8951 71 31 average average NN 8951 71 32 of of IN 8951 71 33 ratios ratio NNS 8951 71 34 ( ( -LRB- 8951 71 35 i.e. i.e. FW 8951 71 36 , , , 8951 71 37 averaging average VBG 8951 71 38 the the DT 8951 71 39 Val'iety Val'iety NNP 8951 71 40 - - HYPH 8951 71 41 Genemtor Genemtor NNP 8951 71 42 App1'0ach App1'0ach NNP 8951 71 43 / / SYM 8951 71 44 FOKKER FOKKER NNP 8951 71 45 and and CC 8951 71 46 LYNCH LYNCH NNP 8951 71 47 207 207 CD 8951 71 48 figures figure NNS 8951 71 49 for for IN 8951 71 50 individual individual JJ 8951 71 51 searches search NNS 8951 71 52 ) ) -RRB- 8951 71 53 . . . 8951 72 1 The the DT 8951 72 2 latter latter JJ 8951 72 3 gives give VBZ 8951 72 4 higher high JJR 8951 72 5 figures figure NNS 8951 72 6 , , , 8951 72 7 since since IN 8951 72 8 many many JJ 8951 72 9 of of IN 8951 72 10 the the DT 8951 72 11 individual individual JJ 8951 72 12 searches search NNS 8951 72 13 give give VBP 8951 72 14 100 100 CD 8951 72 15 percent percent NN 8951 72 16 precision precision NN 8951 72 17 ratios ratio NNS 8951 72 18 . . . 8951 73 1 The the DT 8951 73 2 precision precision NN 8951 73 3 ratio ratio NN 8951 73 4 was be VBD 8951 73 5 found find VBN 8951 73 6 to to TO 8951 73 7 be be VB 8951 73 8 dependent dependent JJ 8951 73 9 on on IN 8951 73 10 file file NN 8951 73 11 size size NN 8951 73 12 and and CC 8951 73 13 to to TO 8951 73 14 fall fall VB 8951 73 15 somewhat somewhat RB 8951 73 16 as as IN 8951 73 17 the the DT 8951 73 18 size size NN 8951 73 19 of of IN 8951 73 20 file file NN 8951 73 21 increases increase NNS 8951 73 22 . . . 8951 74 1 This this DT 8951 74 2 is be VBZ 8951 74 3 due due JJ 8951 74 4 to to IN 8951 74 5 the the DT 8951 74 6 fact fact NN 8951 74 7 that that IN 8951 74 8 the the DT 8951 74 9 key- key- NN 8951 74 10 sets set NNS 8951 74 11 provided provide VBN 8951 74 12 only only RB 8951 74 13 a a DT 8951 74 14 limited limit VBN 8951 74 15 , , , 8951 74 16 if if IN 8951 74 17 very very RB 8951 74 18 high high JJ 8951 74 19 , , , 8951 74 20 total total JJ 8951 74 21 number number NN 8951 74 22 of of IN 8951 74 23 possible possible JJ 8951 74 24 combi- combi- JJ 8951 74 25 nations nation NNS 8951 74 26 , , , 8951 74 27 while while IN 8951 74 28 the the DT 8951 74 29 total total JJ 8951 74 30 possible possible JJ 8951 74 31 variety variety NN 8951 74 32 of of IN 8951 74 33 personal personal JJ 8951 74 34 names name NNS 8951 74 35 is be VBZ 8951 74 36 virtually virtually RB 8951 74 37 un- un- JJ 8951 74 38 limited limited JJ 8951 74 39 . . . 8951 75 1 The the DT 8951 75 2 evaluation evaluation NN 8951 75 3 was be VBD 8951 75 4 performed perform VBN 8951 75 5 with with IN 8951 75 6 a a DT 8951 75 7 sample sample NN 8951 75 8 of of IN 8951 75 9 700 700 CD 8951 75 10 names name NNS 8951 75 11 , , , 8951 75 12 selected select VBN 8951 75 13 by by IN 8951 75 14 interval interval NN 8951 75 15 sampling sampling NN 8951 75 16 . . . 8951 76 1 This this DT 8951 76 2 number number NN 8951 76 3 ensured ensure VBD 8951 76 4 a a DT 8951 76 5 99 99 CD 8951 76 6 percent percent NN 8951 76 7 confidence confidence NN 8951 76 8 limit limit NN 8951 76 9 in in IN 8951 76 10 the the DT 8951 76 11 results result NNS 8951 76 12 . . . 8951 77 1 A a DT 8951 77 2 comparison comparison NN 8951 77 3 of of IN 8951 77 4 the the DT 8951 77 5 interval interval NN 8951 77 6 sampled sample VBN 8951 77 7 query query NN 8951 77 8 names name NNS 8951 77 9 with with IN 8951 77 10 ran- ran- NNP 8951 77 11 domly domly RB 8951 77 12 sampled sample VBN 8951 77 13 names name NNS 8951 77 14 showed show VBD 8951 77 15 that that IN 8951 77 16 no no DT 8951 77 17 bias bias NN 8951 77 18 was be VBD 8951 77 19 introduced introduce VBN 8951 77 20 by by IN 8951 77 21 interval interval VBG 8951 77 22 sam- sam- JJ 8951 77 23 pling pling NN 8951 77 24 . . . 8951 78 1 A a DT 8951 78 2 test test NN 8951 78 3 to to TO 8951 78 4 confirm confirm VB 8951 78 5 that that IN 8951 78 6 the the DT 8951 78 7 retrieval retrieval NN 8951 78 8 effectiveness effectiveness NN 8951 78 9 reached reach VBD 8951 78 10 a a DT 8951 78 11 peak peak NN 8951 78 12 at at IN 8951 78 13 the the DT 8951 78 14 maximum maximum JJ 8951 78 15 value value NN 8951 78 16 of of IN 8951 78 17 the the DT 8951 78 18 relative relative JJ 8951 78 19 entropy entropy RB 8951 78 20 of of IN 8951 78 21 a a DT 8951 78 22 balanced balanced JJ 8951 78 23 key key NN 8951 78 24 - - HYPH 8951 78 25 set set NN 8951 78 26 was be VBD 8951 78 27 per- per- RB 8951 78 28 formed form VBN 8951 78 29 first first RB 8951 78 30 . . . 8951 79 1 This this DT 8951 79 2 was be VBD 8951 79 3 carried carry VBN 8951 79 4 out out RP 8951 79 5 on on IN 8951 79 6 a a DT 8951 79 7 file file NN 8951 79 8 of of IN 8951 79 9 25,000 25,000 CD 8951 79 10 names name NNS 8951 79 11 , , , 8951 79 12 using use VBG 8951 79 13 as as IN 8951 79 14 queries query NNS 8951 79 15 names name NNS 8951 79 16 selected select VBN 8951 79 17 from from IN 8951 79 18 the the DT 8951 79 19 file file NN 8951 79 20 and and CC 8951 79 21 the the DT 8951 79 22 optimal optimal JJ 8951 79 23 148-key 148-key CD 8951 79 24 key key NN 8951 79 25 - - HYPH 8951 79 26 set set NN 8951 79 27 . . . 8951 80 1 As as IN 8951 80 2 shown show VBN 8951 80 3 in in IN 8951 80 4 Table table NN 8951 80 5 1 1 CD 8951 80 6 , , , 8951 80 7 the the DT 8951 80 8 values value NNS 8951 80 9 of of IN 8951 80 10 the the DT 8951 80 11 precision precision NN 8951 80 12 ratio ratio NN 8951 80 13 ( ( -LRB- 8951 80 14 ratio ratio NN 8951 80 15 of of IN 8951 80 16 averages average NNS 8951 80 17 ) ) -RRB- 8951 80 18 and and CC 8951 80 19 of of IN 8951 80 20 the the DT 8951 80 21 relative relative JJ 8951 80 22 entropy entropy JJ 8951 80 23 both both DT 8951 80 24 peak peak NN 8951 80 25 at at IN 8951 80 26 the the DT 8951 80 27 same same JJ 8951 80 28 ratio ratio NN 8951 80 29 of of IN 8951 80 30 n n NN 8951 80 31 - - HYPH 8951 80 32 gram gram NN 8951 80 33 keys key NNS 8951 80 34 from from IN 8951 80 35 the the DT 8951 80 36 front front NN 8951 80 37 and and CC 8951 80 38 back back NN 8951 80 39 of of IN 8951 80 40 the the DT 8951 80 41 surnames surname NNS 8951 80 42 . . . 8951 81 1 The the DT 8951 81 2 performance performance NN 8951 81 3 of of IN 8951 81 4 the the DT 8951 81 5 optimal optimal JJ 8951 81 6 key key NN 8951 81 7 - - HYPH 8951 81 8 sets set NNS 8951 81 9 of of IN 8951 81 10 148 148 CD 8951 81 11 , , , 8951 81 12 254 254 CD 8951 81 13 , , , 8951 81 14 and and CC 8951 81 15 296 296 CD 8951 81 16 keys key NNS 8951 81 17 with with IN 8951 81 18 files file NNS 8951 81 19 of of IN 8951 81 20 10,000 10,000 CD 8951 81 21 , , , 8951 81 22 25,000 25,000 CD 8951 81 23 , , , 8951 81 24 and and CC 8951 81 25 50,000 50,000 CD 8951 81 26 names name NNS 8951 81 27 is be VBZ 8951 81 28 shown show VBN 8951 81 29 in in IN 8951 81 30 Table Table NNP 8951 81 31 7 7 CD 8951 81 32 . . . 8951 82 1 Calculated calculate VBN 8951 82 2 as as IN 8951 82 3 the the DT 8951 82 4 ratio ratio NN 8951 82 5 of of IN 8951 82 6 averages average NNS 8951 82 7 , , , 8951 82 8 the the DT 8951 82 9 smallest small JJS 8951 82 10 key key NN 8951 82 11 - - HYPH 8951 82 12 set set NN 8951 82 13 ( ( -LRB- 8951 82 14 148 148 CD 8951 82 15 keys key NNS 8951 82 16 ) ) -RRB- 8951 82 17 shows show VBZ 8951 82 18 a a DT 8951 82 19 precision precision NN 8951 82 20 ratio ratio NN 8951 82 21 of of IN 8951 82 22 64 64 CD 8951 82 23 percent percent NN 8951 82 24 with with IN 8951 82 25 a a DT 8951 82 26 file file NN 8951 82 27 of of IN 8951 82 28 50,000 50,000 CD 8951 82 29 names name NNS 8951 82 30 , , , 8951 82 31 which which WDT 8951 82 32 means mean VBZ 8951 82 33 that that DT 8951 82 34 of of IN 8951 82 35 every every DT 8951 82 36 three three CD 8951 82 37 names name NNS 8951 82 38 identified identify VBN 8951 82 39 in in IN 8951 82 40 the the DT 8951 82 41 variety variety NN 8951 82 42 - - HYPH 8951 82 43 generator generator NN 8951 82 44 search search NN 8951 82 45 , , , 8951 82 46 two two CD 8951 82 47 are be VBP 8951 82 48 those those DT 8951 82 49 de- de- RB 8951 82 50 sired sire VBD 8951 82 51 . . . 8951 83 1 With with IN 8951 83 2 the the DT 8951 83 3 largest large JJS 8951 83 4 key key NN 8951 83 5 - - HYPH 8951 83 6 set set NN 8951 83 7 ( ( -LRB- 8951 83 8 296 296 CD 8951 83 9 keys key NNS 8951 83 10 ) ) -RRB- 8951 83 11 , , , 8951 83 12 this this DT 8951 83 13 rises rise VBZ 8951 83 14 to to IN 8951 83 15 nine nine CD 8951 83 16 correctly correctly RB 8951 83 17 identi- identi- NN 8951 83 18 fied fie VBN 8951 83 19 names name NNS 8951 83 20 in in IN 8951 83 21 every every DT 8951 83 22 ten ten CD 8951 83 23 retrieved retrieve VBN 8951 83 24 at at IN 8951 83 25 this this DT 8951 83 26 stage stage NN 8951 83 27 . . . 8951 84 1 On on IN 8951 84 2 the the DT 8951 84 3 other other JJ 8951 84 4 hand hand NN 8951 84 5 , , , 8951 84 6 calculat- calculat- NNP 8951 84 7 ed ed NNP 8951 84 8 as as IN 8951 84 9 the the DT 8951 84 10 average average NN 8951 84 11 of of IN 8951 84 12 ratios ratio NNS 8951 84 13 , , , 8951 84 14 the the DT 8951 84 15 precision precision NN 8951 84 16 ratios ratio NNS 8951 84 17 rise rise VBP 8951 84 18 to to IN 8951 84 19 81 81 CD 8951 84 20 percent percent NN 8951 84 21 and and CC 8951 84 22 94 94 CD 8951 84 23 percent percent NN 8951 84 24 respectively respectively RB 8951 84 25 . . . 8951 85 1 For for IN 8951 85 2 smaller small JJR 8951 85 3 file file NN 8951 85 4 sizes size NNS 8951 85 5 - - : 8951 85 6 typical typical JJ 8951 85 7 , , , 8951 85 8 for for IN 8951 85 9 instance instance NN 8951 85 10 , , , 8951 85 11 of of IN 8951 85 12 cur- cur- DT 8951 85 13 rent rent NN 8951 85 14 - - HYPH 8951 85 15 awareness awareness NN 8951 85 16 searches search NNS 8951 85 17 - - : 8951 85 18 the the DT 8951 85 19 figures figure NNS 8951 85 20 for for IN 8951 85 21 all all DT 8951 85 22 of of IN 8951 85 23 these these DT 8951 85 24 are be VBP 8951 85 25 cotTespondingly cottespondingly RB 8951 85 26 higher high JJR 8951 85 27 . . . 8951 86 1 Table table NN 8951 86 2 7 7 CD 8951 86 3 . . . 8951 87 1 Precision Precision NNP 8951 87 2 Ratios Ratios NNPS 8951 87 3 Obtained obtain VBN 8951 87 4 in in IN 8951 87 5 Variety Variety NNP 8951 87 6 - - HYPH 8951 87 7 Generator Generator NNP 8951 87 8 Searches Searches NNPS 8951 87 9 of of IN 8951 87 10 Personal Personal NNP 8951 87 11 Names Names NNPS 8951 87 12 - - HYPH 8951 87 13 Queries Queries NNPS 8951 87 14 Sampled sample VBN 8951 87 15 from from IN 8951 87 16 Sea1'ch sea1'ch NN 8951 87 17 File file NN 8951 87 18 ( ( -LRB- 8951 87 19 Confidence confidence NN 8951 87 20 Level= level= NN 8951 87 21 99 99 CD 8951 87 22 Pm·cent pm·cent NN 8951 87 23 ) ) -RRB- 8951 87 24 Precision Precision NNP 8951 87 25 as as IN 8951 87 26 ratio ratio NN 8951 87 27 of of IN 8951 87 28 averages average NNS 8951 87 29 ( ( -LRB- 8951 87 30 % % NN 8951 87 31 ) ) -RRB- 8951 87 32 : : : 8951 87 33 File file NN 8951 87 34 Size size NN 8951 87 35 50,000 50,000 CD 8951 87 36 25,000 25,000 CD 8951 87 37 10,000 10,000 CD 8951 87 38 Precision Precision NNP 8951 87 39 as as IN 8951 87 40 average average JJ 8951 87 41 of of IN 8951 87 42 ratios ratio NNS 8951 87 43 ( ( -LRB- 8951 87 44 % % NN 8951 87 45 ) ) -RRB- 8951 87 46 : : : 8951 87 47 File file NN 8951 87 48 Size size NN 8951 87 49 50,000 50,000 CD 8951 87 50 25,000 25,000 CD 8951 87 51 10,000 10,000 CD 8951 87 52 148 148 CD 8951 87 53 64 64 CD 8951 87 54 71 71 CD 8951 87 55 84 84 CD 8951 87 56 148 148 CD 8951 87 57 81 81 CD 8951 87 58 87 87 CD 8951 87 59 93 93 CD 8951 87 60 Key Key NNP 8951 87 61 - - HYPH 8951 87 62 Set set NN 8951 87 63 Size Size NNP 8951 87 64 254 254 CD 8951 87 65 87 87 CD 8951 87 66 90 90 CD 8951 87 67 93 93 CD 8951 87 68 Key Key NNP 8951 87 69 - - HYPH 8951 87 70 Set set NN 8951 87 71 Size Size NNP 8951 87 72 254 254 CD 8951 87 73 91 91 CD 8951 87 74 95 95 CD 8951 87 75 97 97 CD 8951 87 76 296 296 CD 8951 87 77 90 90 CD 8951 87 78 91 91 CD 8951 87 79 94 94 CD 8951 87 80 296 296 CD 8951 87 81 94 94 CD 8951 87 82 96 96 CD 8951 87 83 97 97 CD 8951 87 84 ' ' CD 8951 87 85 ; ; : 8951 87 86 ~ ~ ADD 8951 87 87 ; ; : 8951 87 88 : : : 8951 87 89 208 208 CD 8951 87 90 Journal Journal NNP 8951 87 91 of of IN 8951 87 92 Library Library NNP 8951 87 93 Automation Automation NNP 8951 87 94 Vol Vol NNP 8951 87 95 . . . 8951 88 1 7/3 7/3 CD 8951 88 2 September September NNP 8951 88 3 1974 1974 CD 8951 88 4 The the DT 8951 88 5 effect effect NN 8951 88 6 of of IN 8951 88 7 sampling sample VBG 8951 88 8 from from IN 8951 88 9 a a DT 8951 88 10 larger large JJR 8951 88 11 file file NN 8951 88 12 , , , 8951 88 13 so so IN 8951 88 14 that that IN 8951 88 15 increasing increase VBG 8951 88 16 proportions proportion NNS 8951 88 17 of of IN 8951 88 18 the the DT 8951 88 19 names name NNS 8951 88 20 searched search VBD 8951 88 21 for for IN 8951 88 22 are be VBP 8951 88 23 not not RB 8951 88 24 present present JJ 8951 88 25 in in IN 8951 88 26 the the DT 8951 88 27 search search NN 8951 88 28 file file NN 8951 88 29 , , , 8951 88 30 is be VBZ 8951 88 31 shown show VBN 8951 88 32 in in IN 8951 88 33 Table table NN 8951 88 34 8 8 CD 8951 88 35 for for IN 8951 88 36 a a DT 8951 88 37 file file NN 8951 88 38 of of IN 8951 88 39 25,000 25,000 CD 8951 88 40 names name NNS 8951 88 41 . . . 8951 89 1 In in IN 8951 89 2 this this DT 8951 89 3 case case NN 8951 89 4 , , , 8951 89 5 the the DT 8951 89 6 proportion proportion NN 8951 89 7 of of IN 8951 89 8 correct- correct- NN 8951 89 9 ly ly XX 8951 89 10 identified identify VBN 8951 89 11 names name NNS 8951 89 12 in in IN 8951 89 13 the the DT 8951 89 14 total total JJ 8951 89 15 falls fall NNS 8951 89 16 , , , 8951 89 17 so so IN 8951 89 18 that that IN 8951 89 19 overall overall JJ 8951 89 20 performance performance NN 8951 89 21 is be VBZ 8951 89 22 some- some- NN 8951 89 23 what what WDT 8951 89 24 reduced reduce VBD 8951 89 25 . . . 8951 90 1 Thus thus RB 8951 90 2 , , , 8951 90 3 depending depend VBG 8951 90 4 both both DT 8951 90 5 on on IN 8951 90 6 file file NN 8951 90 7 size size NN 8951 90 8 and and CC 8951 90 9 on on IN 8951 90 10 the the DT 8951 90 11 expected expected JJ 8951 90 12 pro- pro- NN 8951 90 13 portion portion NN 8951 90 14 of of IN 8951 90 15 queries query NNS 8951 90 16 identifying identify VBG 8951 90 17 hits hit NNS 8951 90 18 , , , 8951 90 19 the the DT 8951 90 20 key key JJ 8951 90 21 - - HYPH 8951 90 22 set set NN 8951 90 23 size size NN 8951 90 24 can can MD 8951 90 25 be be VB 8951 90 26 adjusted adjust VBN 8951 90 27 to to TO 8951 90 28 reach reach VB 8951 90 29 a a DT 8951 90 30 desired desire VBN 8951 90 31 level level NN 8951 90 32 of of IN 8951 90 33 performance performance NN 8951 90 34 . . . 8951 91 1 In in IN 8951 91 2 addition addition NN 8951 91 3 , , , 8951 91 4 tests test NNS 8951 91 5 to to TO 8951 91 6 determine determine VB 8951 91 7 the the DT 8951 91 8 Table Table NNP 8951 91 9 B. b. NN 8951 92 1 Effect effect NN 8951 92 2 of of IN 8951 92 3 Varying Varying NNP 8951 92 4 Proportion proportion NN 8951 92 5 of of IN 8951 92 6 Query query NN 8951 92 7 Names name NNS 8951 92 8 Not not RB 8951 92 9 Present present JJ 8951 92 10 in in IN 8951 92 11 Search Search NNP 8951 92 12 File File NNP 8951 92 13 of of IN 8951 92 14 25,000 25,000 CD 8951 92 15 Names Names NNPS 8951 92 16 , , , 8951 92 17 Using use VBG 8951 92 18 296 296 CD 8951 92 19 Keys key NNS 8951 92 20 ( ( -LRB- 8951 92 21 Ratio ratio NN 8951 92 22 of of IN 8951 92 23 Averages Averages NNP 8951 92 24 ) ) -RRB- 8951 92 25 % % NN 8951 92 26 of of IN 8951 92 27 Names Names NNPS 8951 92 28 Not not RB 8951 92 29 Precision% precision% NN 8951 92 30 Number number NN 8951 92 31 of of IN 8951 92 32 Names Names NNP 8951 92 33 Number Number NNP 8951 92 34 of of IN 8951 92 35 Names Names NNPS 8951 92 36 in in IN 8951 92 37 Search Search NNP 8951 92 38 File File NNP 8951 92 39 ( ( -LRB- 8951 92 40 Ratio ratio NN 8951 92 41 of of IN 8951 92 42 Averages Averages NNP 8951 92 43 ) ) -RRB- 8951 92 44 Ret1·ieved Ret1·ieved NNP 8951 92 45 Correctly correctly RB 8951 92 46 Retrieved Retrieved NNP 8951 92 47 21 21 CD 8951 92 48 90 90 CD 8951 92 49 766 766 CD 8951 92 50 691 691 CD 8951 92 51 42 42 CD 8951 92 52 85 85 CD 8951 92 53 595 595 CD 8951 92 54 505 505 CD 8951 92 55 61 61 CD 8951 92 56 83 83 CD 8951 92 57 449 449 CD 8951 92 58 371 371 CD 8951 92 59 74 74 CD 8951 92 60 76 76 CD 8951 92 61 319 319 CD 8951 92 62 242 242 CD 8951 92 63 84 84 CD 8951 92 64 68 68 CD 8951 92 65 228 228 CD 8951 92 66 154 154 CD 8951 92 67 applicability applicability NN 8951 92 68 of of IN 8951 92 69 a a DT 8951 92 70 key key NN 8951 92 71 - - HYPH 8951 92 72 set set NN 8951 92 73 optimized optimize VBN 8951 92 74 for for IN 8951 92 75 one one CD 8951 92 76 file file NN 8951 92 77 of of IN 8951 92 78 50,000 50,000 CD 8951 92 79 names name NNS 8951 92 80 to to IN 8951 92 81 another another DT 8951 92 82 file file NN 8951 92 83 of of IN 8951 92 84 the the DT 8951 92 85 same same JJ 8951 92 86 provenance provenance NN 8951 92 87 and and CC 8951 92 88 size size NN 8951 92 89 were be VBD 8951 92 90 carried carry VBN 8951 92 91 out out RP 8951 92 92 . . . 8951 93 1 The the DT 8951 93 2 three three CD 8951 93 3 key key JJ 8951 93 4 - - HYPH 8951 93 5 sets set NNS 8951 93 6 derived derive VBN 8951 93 7 from from IN 8951 93 8 the the DT 8951 93 9 first first JJ 8951 93 10 file file NN 8951 93 11 were be VBD 8951 93 12 applied apply VBN 8951 93 13 to to IN 8951 93 14 the the DT 8951 93 15 second second JJ 8951 93 16 , , , 8951 93 17 query query NN 8951 93 18 names name NNS 8951 93 19 sam- sam- RB 8951 93 20 pled plead VBD 8951 93 21 from from IN 8951 93 22 the the DT 8951 93 23 latter latter JJ 8951 93 24 , , , 8951 93 25 and and CC 8951 93 26 the the DT 8951 93 27 precision precision NN 8951 93 28 ratios ratio NNS 8951 93 29 determined determine VBN 8951 93 30 . . . 8951 94 1 Some some DT 8951 94 2 reduction reduction NN 8951 94 3 in in IN 8951 94 4 performance performance NN 8951 94 5 was be VBD 8951 94 6 observed observe VBN 8951 94 7 ; ; : 8951 94 8 expressed express VBN 8951 94 9 as as IN 8951 94 10 ratio ratio NN 8951 94 11 of of IN 8951 94 12 averages average NNS 8951 94 13 , , , 8951 94 14 the the DT 8951 94 15 precision precision NN 8951 94 16 with with IN 8951 94 17 the the DT 8951 94 18 296-key 296-key CD 8951 94 19 key key NN 8951 94 20 - - HYPH 8951 94 21 set set NN 8951 94 22 fell fall VBD 8951 94 23 from from IN 8951 94 24 90 90 CD 8951 94 25 to to TO 8951 94 26 83 83 CD 8951 94 27 percent percent NN 8951 94 28 , , , 8951 94 29 with with IN 8951 94 30 the the DT 8951 94 31 254-key 254-key CD 8951 94 32 key- key- NNS 8951 94 33 set set VBN 8951 94 34 from from IN 8951 94 35 87 87 CD 8951 94 36 to to TO 8951 94 37 82 82 CD 8951 94 38 percent percent NN 8951 94 39 , , , 8951 94 40 and and CC 8951 94 41 with with IN 8951 94 42 the the DT 8951 94 43 148-key 148-key CD 8951 94 44 key key NN 8951 94 45 - - HYPH 8951 94 46 set set NN 8951 94 47 from from IN 8951 94 48 64 64 CD 8951 94 49 to to TO 8951 94 50 56 56 CD 8951 94 51 per- per- NN 8951 94 52 cent cent NN 8951 94 53 , , , 8951 94 54 figures figure NNS 8951 94 55 which which WDT 8951 94 56 seem seem VBP 8951 94 57 unlikely unlikely JJ 8951 94 58 to to TO 8951 94 59 prejudice prejudice VB 8951 94 60 the the DT 8951 94 61 net net JJ 8951 94 62 performance performance NN 8951 94 63 in in IN 8951 94 64 any any DT 8951 94 65 marked marked JJ 8951 94 66 way way NN 8951 94 67 . . . 8951 95 1 Nonetheless nonetheless RB 8951 95 2 , , , 8951 95 3 monitoring monitoring NN 8951 95 4 of of IN 8951 95 5 performance performance NN 8951 95 6 and and CC 8951 95 7 of of IN 8951 95 8 data data NNP 8951 95 9 base base NNP 8951 95 10 name name NN 8951 95 11 characteristics characteristic NNS 8951 95 12 over over IN 8951 95 13 a a DT 8951 95 14 period period NN 8951 95 15 of of IN 8951 95 16 operation operation NN 8951 95 17 might may MD 8951 95 18 well well RB 8951 95 19 be be VB 8951 95 20 advisable advisable JJ 8951 95 21 . . . 8951 96 1 DISTRIBUTION distribution NN 8951 96 2 CHARACTERISTICS characteristic NNS 8951 96 3 OF of IN 8951 96 4 OTHER other JJ 8951 96 5 TYPES types NN 8951 96 6 OF of IN 8951 96 7 KEYS KEYS NNP 8951 96 8 It -PRON- PRP 8951 96 9 is be VBZ 8951 96 10 particularly particularly RB 8951 96 11 instructive instructive JJ 8951 96 12 to to TO 8951 96 13 examine examine VB 8951 96 14 the the DT 8951 96 15 distribution distribution NN 8951 96 16 characteristics characteristic NNS 8951 96 17 of of IN 8951 96 18 other other JJ 8951 96 19 types type NNS 8951 96 20 of of IN 8951 96 21 keys key NNS 8951 96 22 , , , 8951 96 23 including include VBG 8951 96 24 those those DT 8951 96 25 of of IN 8951 96 26 fixed fixed JJ 8951 96 27 length length NN 8951 96 28 , , , 8951 96 29 generated generate VBN 8951 96 30 from from IN 8951 96 31 various various JJ 8951 96 32 positions position NNS 8951 96 33 in in IN 8951 96 34 the the DT 8951 96 35 names name NNS 8951 96 36 , , , 8951 96 37 and and CC 8951 96 38 to to TO 8951 96 39 compare compare VB 8951 96 40 them -PRON- PRP 8951 96 41 with with IN 8951 96 42 those those DT 8951 96 43 of of IN 8951 96 44 the the DT 8951 96 45 opti- opti- NNP 8951 96 46 mal mal NNP 8951 96 47 key key NN 8951 96 48 - - HYPH 8951 96 49 sets set NNS 8951 96 50 employed employ VBN 8951 96 51 in in IN 8951 96 52 the the DT 8951 96 53 variety variety NN 8951 96 54 - - HYPH 8951 96 55 generator generator NN 8951 96 56 approach approach NN 8951 96 57 . . . 8951 97 1 To to IN 8951 97 2 this this DT 8951 97 3 end end NN 8951 97 4 , , , 8951 97 5 the the DT 8951 97 6 file file NN 8951 97 7 of of IN 8951 97 8 50,000 50,000 CD 8951 97 9 names name NNS 8951 97 10 was be VBD 8951 97 11 processed process VBN 8951 97 12 to to TO 8951 97 13 produce produce VB 8951 97 14 the the DT 8951 97 15 following follow VBG 8951 97 16 keys key NNS 8951 97 17 or or CC 8951 97 18 key- key- NN 8951 97 19 sets set NNS 8951 97 20 : : : 8951 97 21 1 1 CD 8951 97 22 . . . 8951 98 1 Initial initial JJ 8951 98 2 digram digram NN 8951 98 3 of of IN 8951 98 4 surname surname NN 8951 98 5 . . . 8951 99 1 2 2 LS 8951 99 2 . . . 8951 100 1 Initial initial JJ 8951 100 2 trigram trigram NN 8951 100 3 of of IN 8951 100 4 surname surname NN 8951 100 5 . . . 8951 101 1 3 3 LS 8951 101 2 . . . 8951 102 1 Key key NN 8951 102 2 - - HYPH 8951 102 3 set set NN 8951 102 4 of of IN 8951 102 5 ninety ninety CD 8951 102 6 - - HYPH 8951 102 7 four four CD 8951 102 8 n n CD 8951 102 9 - - HYPH 8951 102 10 grams gram NNS 8951 102 11 from from IN 8951 102 12 the the DT 8951 102 13 front front NN 8951 102 14 of of IN 8951 102 15 the the DT 8951 102 16 surname surname NN 8951 102 17 , , , 8951 102 18 with with IN 8951 102 19 first first JJ 8951 102 20 and and CC 8951 102 21 second second JJ 8951 102 22 initials initial NNS 8951 102 23 . . . 8951 103 1 4 4 LS 8951 103 2 . . . 8951 104 1 Key key JJ 8951 104 2 - - HYPH 8951 104 3 set set NN 8951 104 4 consisting consisting NN 8951 104 5 of of IN 8951 104 6 first first JJ 8951 104 7 and and CC 8951 104 8 last last JJ 8951 104 9 character character NN 8951 104 10 of of IN 8951 104 11 surname surname NN 8951 104 12 , , , 8951 104 13 with with IN 8951 104 14 first first JJ 8951 104 15 and and CC 8951 104 16 second second JJ 8951 104 17 initials initial NNS 8951 104 18 . . . 8951 105 1 The the DT 8951 105 2 figures figure NNS 8951 105 3 ( ( -LRB- 8951 105 4 Table table NN 8951 105 5 9 9 CD 8951 105 6 ) ) -RRB- 8951 105 7 show show VBP 8951 105 8 clearly clearly RB 8951 105 9 that that IN 8951 105 10 all all DT 8951 105 11 have have VBP 8951 105 12 distributions distribution NNS 8951 105 13 which which WDT 8951 105 14 leave leave VBP 8951 105 15 no no DT 8951 105 16 doubt doubt NN 8951 105 17 as as IN 8951 105 18 to to IN 8951 105 19 their -PRON- PRP$ 8951 105 20 relative relative JJ 8951 105 21 inadequacy inadequacy NN 8951 105 22 in in IN 8951 105 23 resolving resolve VBG 8951 105 24 power power NN 8951 105 25 , , , 8951 105 26 where where WRB 8951 105 27 this this DT 8951 105 28 is be VBZ 8951 105 29 defined define VBN 8951 105 30 as as IN 8951 105 31 the the DT 8951 105 32 ratio ratio NN 8951 105 33 of of IN 8951 105 34 distinct distinct JJ 8951 105 35 name name NN 8951 105 36 representations representation NNS 8951 105 37 provided provide VBN 8951 105 38 by by IN 8951 105 39 the the DT 8951 105 40 key key NN 8951 105 41 - - HYPH 8951 105 42 set set NN 8951 105 43 used use VBN 8951 105 44 to to IN 8951 105 45 the the DT 8951 105 46 number number NN 8951 105 47 of of IN 8951 105 48 different different JJ 8951 105 49 name name NN 8951 105 50 entries entry NNS 8951 105 51 ( ( -LRB- 8951 105 52 41,469 41,469 CD 8951 105 53 ) ) -RRB- 8951 105 54 in in IN 8951 105 55 the the DT 8951 105 56 file file NN 8951 105 57 . . . 8951 106 1 At at IN 8951 106 2 the the DT 8951 106 3 digram digram NN 8951 106 4 level level NN 8951 106 5 , , , 8951 106 6 the the DT 8951 106 7 value value NN 8951 106 8 of of IN 8951 106 9 the the DT 8951 106 10 resolving resolve VBG 8951 106 11 power power NN 8951 106 12 is be VBZ 8951 106 13 0.009 0.009 CD 8951 106 14 , , , 8951 106 15 i.e. i.e. FW 8951 106 16 , , , 8951 106 17 each each DT 8951 106 18 Vm·iety Vm·iety NNP 8951 106 19 - - HYPH 8951 106 20 Generator Generator NNP 8951 106 21 Approach Approach NNP 8951 106 22 / / SYM 8951 106 23 FOKKER FOKKER NNP 8951 106 24 and and CC 8951 106 25 LYNCH LYNCH NNP 8951 106 26 209 209 CD 8951 106 27 digram digram NN 8951 106 28 represents represent VBZ 8951 106 29 , , , 8951 106 30 on on IN 8951 106 31 average average JJ 8951 106 32 , , , 8951 106 33 110 110 CD 8951 106 34 different different JJ 8951 106 35 name name NN 8951 106 36 entries entry NNS 8951 106 37 , , , 8951 106 38 while while IN 8951 106 39 no no RB 8951 106 40 fewer few JJR 8951 106 41 than than IN 8951 106 42 thirty thirty CD 8951 106 43 - - HYPH 8951 106 44 two two CD 8951 106 45 specific specific JJ 8951 106 46 digrams digram NNS 8951 106 47 each each DT 8951 106 48 represent represent NN 8951 106 49 between between IN 8951 106 50 500 500 CD 8951 106 51 and and CC 8951 106 52 1,000 1,000 CD 8951 106 53 dif- dif- JJ 8951 106 54 ferent ferent JJ 8951 106 55 names name NNS 8951 106 56 . . . 8951 107 1 At at IN 8951 107 2 the the DT 8951 107 3 trigram trigram NN 8951 107 4 level level NN 8951 107 5 , , , 8951 107 6 the the DT 8951 107 7 value value NN 8951 107 8 of of IN 8951 107 9 the the DT 8951 107 10 resolving resolve VBG 8951 107 11 power power NN 8951 107 12 rises rise VBZ 8951 107 13 to to IN 8951 107 14 0.08 0.08 CD 8951 107 15 , , , 8951 107 16 a a DT 8951 107 17 tenfold tenfold JJ 8951 107 18 increase increase NN 8951 107 19 ; ; : 8951 107 20 however however RB 8951 107 21 , , , 8951 107 22 one one CD 8951 107 23 trigram trigram NN 8951 107 24 still still RB 8951 107 25 represents represent VBZ 8951 107 26 between between IN 8951 107 27 500 500 CD 8951 107 28 and and CC 8951 107 29 1,000 1,000 CD 8951 107 30 different different JJ 8951 107 31 names name NNS 8951 107 32 . . . 8951 108 1 Use use NN 8951 108 2 of of IN 8951 108 3 the the DT 8951 108 4 first first JJ 8951 108 5 and and CC 8951 108 6 last last JJ 8951 108 7 letters letter NNS 8951 108 8 of of IN 8951 108 9 the the DT 8951 108 10 surname surname NN 8951 108 11 plus plus CC 8951 108 12 the the DT 8951 108 13 initials initial NNS 8951 108 14 again again RB 8951 108 15 in- in- PRP 8951 108 16 creases crease VBZ 8951 108 17 the the DT 8951 108 18 value value NN 8951 108 19 of of IN 8951 108 20 the the DT 8951 108 21 resolving resolve VBG 8951 108 22 power power NN 8951 108 23 to to IN 8951 108 24 0.627 0.627 CD 8951 108 25 , , , 8951 108 26 or or CC 8951 108 27 1.6 1.6 CD 8951 108 28 distinct distinct JJ 8951 108 29 names name NNS 8951 108 30 per per IN 8951 108 31 entry entry NN 8951 108 32 ; ; : 8951 108 33 eight eight CD 8951 108 34 of of IN 8951 108 35 the the DT 8951 108 36 representations representation NNS 8951 108 37 now now RB 8951 108 38 account account VBP 8951 108 39 for for IN 8951 108 40 between between IN 8951 108 41 thirty thirty CD 8951 108 42 - - HYPH 8951 108 43 one one CD 8951 108 44 Table table NN 8951 108 45 9 9 CD 8951 108 46 . . . 8951 109 1 Distributions distribution NNS 8951 109 2 of of IN 8951 109 3 a a DT 8951 109 4 Variety Variety NNP 8951 109 5 of of IN 8951 109 6 Other other JJ 8951 109 7 Representations Representations NNPS 8951 109 8 of of IN 8951 109 9 Personal Personal NNP 8951 109 10 Names Names NNPS 8951 109 11 in in IN 8951 109 12 a a DT 8951 109 13 File file NN 8951 109 14 of of IN 8951 109 15 50,000 50,000 CD 8951 109 16 Entries entry NNS 8951 109 17 94 94 CD 8951 109 18 n n CD 8951 109 19 - - HYPH 8951 109 20 grams gram NNS 8951 109 21 from from IN 8951 109 22 First first JJ 8951 109 23 and and CC 8951 109 24 Last last JJ 8951 109 25 Frequency Frequency NNP 8951 109 26 Initial Initial NNP 8951 109 27 Digram Digram NNP 8951 109 28 Initial Initial NNP 8951 109 29 Trigram Trigram NNP 8951 109 30 Front Front NNP 8951 109 31 of of IN 8951 109 32 Surname Surname NNP 8951 109 33 Letter Letter NNP 8951 109 34 of of IN 8951 109 35 Surname Surname NNP 8951 109 36 f f NNP 8951 109 37 of of IN 8951 109 38 Surname Surname NNP 8951 109 39 of of IN 8951 109 40 Surname Surname NNP 8951 109 41 Plus Plus NNP 8951 109 42 2 2 CD 8951 109 43 Initials Initials NNPS 8951 109 44 Plus plus CC 8951 109 45 2 2 CD 8951 109 46 Initials Initials NNPS 8951 109 47 1 1 CD 8951 109 48 40 40 CD 8951 109 49 735 735 CD 8951 109 50 8,964 8,964 CD 8951 109 51 16,346 16,346 CD 8951 109 52 2 2 CD 8951 109 53 22 22 CD 8951 109 54 428 428 CD 8951 109 55 3,929 3,929 CD 8951 109 56 4,919 4,919 CD 8951 109 57 3 3 CD 8951 109 58 16 16 CD 8951 109 59 249 249 CD 8951 109 60 1,884 1,884 CD 8951 109 61 2,025 2,025 CD 8951 109 62 4 4 CD 8951 109 63 11 11 CD 8951 109 64 197 197 CD 8951 109 65 1,006 1,006 CD 8951 109 66 973 973 CD 8951 109 67 5 5 CD 8951 109 68 7 7 CD 8951 109 69 170 170 CD 8951 109 70 646 646 CD 8951 109 71 581 581 CD 8951 109 72 6 6 CD 8951 109 73 7 7 CD 8951 109 74 110 110 CD 8951 109 75 397 397 CD 8951 109 76 340 340 CD 8951 109 77 7 7 CD 8951 109 78 10 10 CD 8951 109 79 112 112 CD 8951 109 80 234 234 CD 8951 109 81 224 224 CD 8951 109 82 8 8 CD 8951 109 83 4 4 CD 8951 109 84 98 98 CD 8951 109 85 186 186 CD 8951 109 86 146 146 CD 8951 109 87 9 9 CD 8951 109 88 7 7 CD 8951 109 89 81 81 CD 8951 109 90 144 144 CD 8951 109 91 92 92 CD 8951 109 92 10 10 CD 8951 109 93 5 5 CD 8951 109 94 66 66 CD 8951 109 95 108 108 CD 8951 109 96 72 72 CD 8951 109 97 11 11 CD 8951 109 98 6 6 CD 8951 109 99 61 61 CD 8951 109 100 70 70 CD 8951 109 101 49 49 CD 8951 109 102 12 12 CD 8951 109 103 2 2 CD 8951 109 104 56 56 CD 8951 109 105 88 88 CD 8951 109 106 36 36 CD 8951 109 107 13 13 CD 8951 109 108 5 5 CD 8951 109 109 51 51 CD 8951 109 110 74 74 CD 8951 109 111 33 33 CD 8951 109 112 14 14 CD 8951 109 113 1 1 CD 8951 109 114 48 48 CD 8951 109 115 50 50 CD 8951 109 116 24 24 CD 8951 109 117 15 15 CD 8951 109 118 2 2 CD 8951 109 119 35 35 CD 8951 109 120 51 51 CD 8951 109 121 23 23 CD 8951 109 122 16 16 CD 8951 109 123 3 3 CD 8951 109 124 37 37 CD 8951 109 125 36 36 CD 8951 109 126 25 25 CD 8951 109 127 17 17 CD 8951 109 128 2 2 CD 8951 109 129 35 35 CD 8951 109 130 29 29 CD 8951 109 131 15 15 CD 8951 109 132 18 18 CD 8951 109 133 3 3 CD 8951 109 134 33 33 CD 8951 109 135 29 29 CD 8951 109 136 11 11 CD 8951 109 137 19 19 CD 8951 109 138 8 8 CD 8951 109 139 35 35 CD 8951 109 140 28 28 CD 8951 109 141 6 6 CD 8951 109 142 20 20 CD 8951 109 143 8 8 CD 8951 109 144 40 40 CD 8951 109 145 23 23 CD 8951 109 146 5 5 CD 8951 109 147 21 21 CD 8951 109 148 - - SYM 8951 109 149 30 30 CD 8951 109 150 21 21 CD 8951 109 151 207 207 CD 8951 109 152 127 127 CD 8951 109 153 49 49 CD 8951 109 154 31 31 CD 8951 109 155 - - SYM 8951 109 156 40 40 CD 8951 109 157 23 23 CD 8951 109 158 109 109 CD 8951 109 159 47 47 CD 8951 109 160 8 8 CD 8951 109 161 41 41 CD 8951 109 162 - - SYM 8951 109 163 50 50 CD 8951 109 164 13 13 CD 8951 109 165 88 88 CD 8951 109 166 13 13 CD 8951 109 167 51 51 CD 8951 109 168 - - SYM 8951 109 169 100 100 CD 8951 109 170 36 36 CD 8951 109 171 142 142 CD 8951 109 172 3 3 CD 8951 109 173 101 101 CD 8951 109 174 - - SYM 8951 109 175 200 200 CD 8951 109 176 24 24 CD 8951 109 177 62 62 CD 8951 109 178 201 201 CD 8951 109 179 - - SYM 8951 109 180 500 500 CD 8951 109 181 57 57 CD 8951 109 182 15 15 CD 8951 109 183 501 501 CD 8951 109 184 - - SYM 8951 109 185 1000 1000 CD 8951 109 186 32 32 CD 8951 109 187 1 1 CD 8951 109 188 Total total JJ 8951 109 189 375 375 CD 8951 109 190 3,301 3,301 CD 8951 109 191 18,166 18,166 CD 8951 109 192 26,002 26,002 CD 8951 109 193 Resolving resolve VBG 8951 109 194 power power NN 8951 109 195 .009 .009 CD 8951 109 196 .080 .080 CD 8951 109 197 .438 .438 CD 8951 109 198 .627 .627 CD 8951 109 199 and and CC 8951 109 200 forty forty CD 8951 109 201 distinct distinct JJ 8951 109 202 entries entry NNS 8951 109 203 . . . 8951 110 1 In in IN 8951 110 2 contrast contrast NN 8951 110 3 , , , 8951 110 4 however however RB 8951 110 5 , , , 8951 110 6 the the DT 8951 110 7 key key NN 8951 110 8 - - HYPH 8951 110 9 set set NN 8951 110 10 of of IN 8951 110 11 148 148 CD 8951 110 12 keys key NNS 8951 110 13 comprising comprise VBG 8951 110 14 ninety ninety CD 8951 110 15 - - HYPH 8951 110 16 four four CD 8951 110 17 n n NN 8951 110 18 - - HYPH 8951 110 19 gram gram NN 8951 110 20 keys key NNS 8951 110 21 from from IN 8951 110 22 the the DT 8951 110 23 front front NN 8951 110 24 of of IN 8951 110 25 the the DT 8951 110 26 name name NN 8951 110 27 and and CC 8951 110 28 the the DT 8951 110 29 first first JJ 8951 110 30 and and CC 8951 110 31 second second JJ 8951 110 32 initials initial NNS 8951 110 33 , , , 8951 110 34 although although IN 8951 110 35 almost almost RB 8951 110 36 50 50 CD 8951 110 37 percent percent NN 8951 110 38 larger large JJR 8951 110 39 than than IN 8951 110 40 the the DT 8951 110 41 four- four- NNP 8951 110 42 character character NN 8951 110 43 representation representation NNP 8951 110 44 , , , 8951 110 45 has have VBZ 8951 110 46 a a DT 8951 110 47 resolving resolve VBG 8951 110 48 power power NN 8951 110 49 of of IN 8951 110 50 only only RB 8951 110 51 0.438 0.438 CD 8951 110 52 ( ( -LRB- 8951 110 53 or or CC 8951 110 54 2.28 2.28 CD 8951 110 55 en- en- NN 8951 110 56 tries try NNS 8951 110 57 per per IN 8951 110 58 representation representation NN 8951 110 59 ) ) -RRB- 8951 110 60 . . . 8951 111 1 This this DT 8951 111 2 contrast contrast NN 8951 111 3 provides provide VBZ 8951 111 4 particularly particularly RB 8951 111 5 strong strong JJ 8951 111 6 evi- evi- NNS 8951 111 7 dence dence NN 8951 111 8 for for IN 8951 111 9 the the DT 8951 111 10 superiority superiority NN 8951 111 11 of of IN 8951 111 12 keys key NNS 8951 111 13 from from IN 8951 111 14 the the DT 8951 111 15 front front NN 8951 111 16 and and CC 8951 111 17 rear rear NN 8951 111 18 of of IN 8951 111 19 the the DT 8951 111 20 surnames surname NNS 8951 111 21 over over IN 8951 111 22 those those DT 8951 111 23 from from IN 8951 111 24 the the DT 8951 111 25 front front NN 8951 111 26 alone alone RB 8951 111 27 , , , 8951 111 28 even even RB 8951 111 29 when when WRB 8951 111 30 the the DT 8951 111 31 latter latter JJ 8951 111 32 are be VBP 8951 111 33 variable variable JJ 8951 111 34 in in IN 8951 111 35 • • NNP 8951 111 36 ' ' POS 8951 111 37 210 210 CD 8951 111 38 Journal Journal NNP 8951 111 39 of of IN 8951 111 40 Library Library NNP 8951 111 41 Automation Automation NNP 8951 111 42 Vol Vol NNP 8951 111 43 . . . 8951 112 1 7/3 7/3 CD 8951 112 2 September September NNP 8951 112 3 1974 1974 CD 8951 112 4 length length NN 8951 112 5 . . . 8951 113 1 As as IN 8951 113 2 expected expect VBN 8951 113 3 , , , 8951 113 4 the the DT 8951 113 5 precision precision NN 8951 113 6 ratio ratio NN 8951 113 7 of of IN 8951 113 8 the the DT 8951 113 9 four four CD 8951 113 10 - - HYPH 8951 113 11 character character NN 8951 113 12 representa- representa- NN 8951 113 13 tion tion NN 8951 113 14 is be VBZ 8951 113 15 low low JJ 8951 113 16 , , , 8951 113 17 at at IN 8951 113 18 37 37 CD 8951 113 19 percent percent NN 8951 113 20 ( ( -LRB- 8951 113 21 ratio ratio NN 8951 113 22 of of IN 8951 113 23 averages average NNS 8951 113 24 ) ) -RRB- 8951 113 25 , , , 8951 113 26 compared compare VBN 8951 113 27 with with IN 8951 113 28 64 64 CD 8951 113 29 percent percent NN 8951 113 30 for for IN 8951 113 31 the the DT 8951 113 32 optimal148-key optimal148-key NNP 8951 113 33 key key NN 8951 113 34 - - HYPH 8951 113 35 set set NN 8951 113 36 . . . 8951 114 1 EXTENT extent NN 8951 114 2 OF of IN 8951 114 3 STATISTICAL STATISTICAL NNP 8951 114 4 ASSOCIATION ASSOCIATION NNP 8951 114 5 AMONG among IN 8951 114 6 KEYS KEYS NNP 8951 114 7 Thus thus RB 8951 114 8 far far RB 8951 114 9 , , , 8951 114 10 the the DT 8951 114 11 frequency frequency NN 8951 114 12 of of IN 8951 114 13 occurrence occurrence NN 8951 114 14 of of IN 8951 114 15 variable variable JJ 8951 114 16 - - HYPH 8951 114 17 length length NN 8951 114 18 character character NN 8951 114 19 strings string NNS 8951 114 20 from from IN 8951 114 21 the the DT 8951 114 22 front front NN 8951 114 23 and and CC 8951 114 24 back back NN 8951 114 25 of of IN 8951 114 26 the the DT 8951 114 27 surnames surname NNS 8951 114 28 is be VBZ 8951 114 29 the the DT 8951 114 30 only only JJ 8951 114 31 factor factor NN 8951 114 32 consid- consid- NN 8951 114 33 ered ered JJ 8951 114 34 in in IN 8951 114 35 their -PRON- PRP$ 8951 114 36 selection selection NN 8951 114 37 as as IN 8951 114 38 keys key NNS 8951 114 39 . . . 8951 115 1 It -PRON- PRP 8951 115 2 is be VBZ 8951 115 3 well well RB 8951 115 4 known know VBN 8951 115 5 in in IN 8951 115 6 other other JJ 8951 115 7 areas area NNS 8951 115 8 that that WDT 8951 115 9 statisti- statisti- VBP 8951 115 10 cal cal NN 8951 115 11 associations association NNS 8951 115 12 among among IN 8951 115 13 keys key NNS 8951 115 14 can can MD 8951 115 15 influence influence VB 8951 115 16 the the DT 8951 115 17 effectiveness effectiveness NN 8951 115 18 of of IN 8951 115 19 their -PRON- PRP$ 8951 115 20 combi- combi- JJ 8951 115 21 nations nation NNS 8951 115 22 . . . 8951 116 1 3 3 LS 8951 116 2 Where where WRB 8951 116 3 a a DT 8951 116 4 strong strong JJ 8951 116 5 positive positive JJ 8951 116 6 association association NN 8951 116 7 between between IN 8951 116 8 two two CD 8951 116 9 keys key NNS 8951 116 10 exists exist VBZ 8951 116 11 , , , 8951 116 12 their -PRON- PRP$ 8951 116 13 intersection intersection NN 8951 116 14 results result NNS 8951 116 15 in in IN 8951 116 16 only only RB 8951 116 17 a a DT 8951 116 18 small small JJ 8951 116 19 reduction reduction NN 8951 116 20 of of IN 8951 116 21 the the DT 8951 116 22 number number NN 8951 116 23 of of IN 8951 116 24 items item NNS 8951 116 25 re- re- RB 8951 116 26 trieved trieve VBD 8951 116 27 over over IN 8951 116 28 that that DT 8951 116 29 obtained obtain VBN 8951 116 30 by by IN 8951 116 31 using use VBG 8951 116 32 each each DT 8951 116 33 independently independently RB 8951 116 34 . . . 8951 117 1 When when WRB 8951 117 2 the the DT 8951 117 3 associa- associa- NN 8951 117 4 tion tion NN 8951 117 5 is be VBZ 8951 117 6 strongly strongly RB 8951 117 7 negative negative JJ 8951 117 8 , , , 8951 117 9 the the DT 8951 117 10 result result NN 8951 117 11 of of IN 8951 117 12 intersection intersection NN 8951 117 13 may may MD 8951 117 14 be be VB 8951 117 15 much much RB 8951 117 16 greater great JJR 8951 117 17 than than IN 8951 117 18 that that DT 8951 117 19 predicted predict VBD 8951 117 20 on on IN 8951 117 21 the the DT 8951 117 22 basis basis NN 8951 117 23 of of IN 8951 117 24 the the DT 8951 117 25 product product NN 8951 117 26 of of IN 8951 117 27 the the DT 8951 117 28 individual individual JJ 8951 117 29 proba- proba- NN 8951 117 30 bilities bilitie NNS 8951 117 31 of of IN 8951 117 32 the the DT 8951 117 33 keys key NNS 8951 117 34 . . . 8951 118 1 To to TO 8951 118 2 assess assess VB 8951 118 3 the the DT 8951 118 4 extent extent NN 8951 118 5 of of IN 8951 118 6 associations association NNS 8951 118 7 among among IN 8951 118 8 keys key NNS 8951 118 9 from from IN 8951 118 10 the the DT 8951 118 11 front front NN 8951 118 12 and and CC 8951 118 13 rear rear NN 8951 118 14 of of IN 8951 118 15 surnames surname NNS 8951 118 16 and and CC 8951 118 17 initials initial NNS 8951 118 18 , , , 8951 118 19 sets set NNS 8951 118 20 of of IN 8951 118 21 both both DT 8951 118 22 fixed- fixed- JJ 8951 118 23 and and CC 8951 118 24 variable variable JJ 8951 118 25 - - HYPH 8951 118 26 length length NN 8951 118 27 keys key NNS 8951 118 28 from from IN 8951 118 29 each each DT 8951 118 30 of of IN 8951 118 31 these these DT 8951 118 32 positions position NNS 8951 118 33 were be VBD 8951 118 34 examined examine VBN 8951 118 35 . . . 8951 118 36 · · NFP 8951 118 37 The the DT 8951 118 38 Kendall Kendall NNP 8951 118 39 correlation correlation NN 8951 118 40 coefficient coefficient NN 8951 118 41 V v NN 8951 118 42 was be VBD 8951 118 43 calculated calculate VBN 8951 118 44 for for IN 8951 118 45 each each DT 8951 118 46 of of IN 8951 118 47 the the DT 8951 118 48 twenty twenty CD 8951 118 49 most most RBS 8951 118 50 frequent frequent JJ 8951 118 51 combinations combination NNS 8951 118 52 of of IN 8951 118 53 these these DT 8951 118 54 . . . 8951 119 1 This this DT 8951 119 2 is be VBZ 8951 119 3 related related JJ 8951 119 4 to to IN 8951 119 5 the the DT 8951 119 6 chi chi JJ 8951 119 7 - - HYPH 8951 119 8 square square JJ 8951 119 9 value value NN 8951 119 10 by by IN 8951 119 11 the the DT 8951 119 12 expression expression NN 8951 119 13 X2 X2 NNP 8951 119 14 = = SYM 8951 119 15 m m NNP 8951 119 16 V2 v2 NN 8951 119 17 where where WRB 8951 119 18 m m NNP 8951 119 19 is be VBZ 8951 119 20 the the DT 8951 119 21 file file NN 8951 119 22 size size NN 8951 119 23 , , , 8951 119 24 or or CC 8951 119 25 50,000 50,000 CD 8951 119 26 . . . 8951 120 1 Table table NN 8951 120 2 10 10 CD 8951 120 3 shows show VBZ 8951 120 4 the the DT 8951 120 5 values value NNS 8951 120 6 of of IN 8951 120 7 the the DT 8951 120 8 associa- associa- NNP 8951 120 9 tion tion NN 8951 120 10 coefficient coefficient NN 8951 120 11 for for IN 8951 120 12 certain certain JJ 8951 120 13 of of IN 8951 120 14 the the DT 8951 120 15 characters character NNS 8951 120 16 in in IN 8951 120 17 the the DT 8951 120 18 full full JJ 8951 120 19 name name NN 8951 120 20 . . . 8951 121 1 Those those DT 8951 121 2 above above IN 8951 121 3 .012 .012 CD 8951 121 4 are be VBP 8951 121 5 significant significant JJ 8951 121 6 at at IN 8951 121 7 a a DT 8951 121 8 99 99 CD 8951 121 9 percent percent NN 8951 121 10 confidence confidence NN 8951 121 11 level level NN 8951 121 12 . . . 8951 122 1 Positive positive JJ 8951 122 2 associations association NNS 8951 122 3 are be VBP 8951 122 4 Table table NN 8951 122 5 10 10 CD 8951 122 6 . . . 8951 123 1 A8sociation a8sociation CD 8951 123 2 Coefficients coefficient NNS 8951 123 3 for for IN 8951 123 4 Sets set NNS 8951 123 5 of of IN 8951 123 6 the the DT 8951 123 7 Most most RBS 8951 123 8 Frequent frequent JJ 8951 123 9 Digrams Digrams NNP 8951 123 10 from from IN 8951 123 11 Various various JJ 8951 123 12 Posi- Posi- NNP 8951 123 13 tions tion NNS 8951 123 14 in in IN 8951 123 15 Personal Personal NNP 8951 123 16 Names Names NNPS 8951 123 17 First first JJ 8951 123 18 and and CC 8951 123 19 Last last JJ 8951 123 20 First First NNP 8951 123 21 Letter Letter NNP 8951 123 22 of of IN 8951 123 23 Surname Surname NNP 8951 123 24 First First NNP 8951 123 25 and and CC 8951 123 26 Second Second NNP 8951 123 27 Letters Letters NNPS 8951 123 28 of of IN 8951 123 29 Surname Surname NNP 8951 123 30 and and CC 8951 123 31 First First NNP 8951 123 32 Initial Initial NNP 8951 123 33 Initials Initials NNPS 8951 123 34 Digram Digram NNP 8951 123 35 v v NNP 8951 123 36 Digram Digram NNP 8951 123 37 v v NNP 8951 123 38 Digram Digram NNP 8951 123 39 v v NNP 8951 123 40 KV KV NNP 8951 123 41 .064 .064 CD 8951 123 42 KV KV NNP 8951 123 43 .054 .054 CD 8951 123 44 HV HV NNP 8951 123 45 .078 .078 CD 8951 123 46 WR WR NNP 8951 123 47 .050 .050 CD 8951 123 48 HJ HJ NNP 8951 123 49 .027 .027 CD 8951 123 50 MV MV NNP 8951 123 51 .069 .069 CD 8951 123 52 KA ka NN 8951 123 53 .038 .038 CD 8951 123 54 BR BR NNP 8951 123 55 -.024 -.024 HYPH 8951 123 56 KV KV NNP 8951 123 57 .069 .069 CD 8951 123 58 HN HN NNP 8951 123 59 .028 .028 CD 8951 123 60 SJ SJ NNP 8951 123 61 -.023 -.023 CD 8951 123 62 RV RV NNP 8951 123 63 -.055 -.055 : 8951 123 64 SA sa NN 8951 123 65 .024 .024 CD 8951 123 66 DJ dj UH 8951 123 67 .022 .022 CD 8951 123 68 DV DV NNP 8951 123 69 -.053 -.053 HYPH 8951 123 70 SN SN NNP 8951 123 71 .024 .024 CD 8951 123 72 BG bg NN 8951 123 73 .018 .018 CD 8951 123 74 TV tv NN 8951 123 75 .053 .053 CD 8951 123 76 CN CN NNP 8951 123 77 .022 .022 CD 8951 123 78 KA KA NNP 8951 123 79 .018 .018 CD 8951 123 80 JV JV NNP 8951 123 81 -.045 -.045 : 8951 123 82 KN KN NNP 8951 123 83 -.020 -.020 HYPH 8951 123 84 CJ CJ NNP 8951 123 85 , , , 8951 123 86 018 018 CD 8951 123 87 SV SV NNP 8951 123 88 .034 .034 CD 8951 123 89 MA MA NNP 8951 123 90 .014 .014 CD 8951 123 91 SD sd NN 8951 123 92 .015 .015 CD 8951 123 93 FV FV NNP 8951 123 94 .033 .033 CD 8951 123 95 KR KR NNP 8951 123 96 -.011 -.011 : 8951 123 97 sv sv NNP 8951 123 98 .013 .013 CD 8951 123 99 NV NV NNP 8951 123 100 -.029 -.029 HYPH 8951 123 101 sv sv NNP 8951 123 102 , , , 8951 123 103 010 010 CD 8951 123 104 MM MM NNP 8951 123 105 .011 .011 CD 8951 123 106 GV GV NNP 8951 123 107 .022 .022 CD 8951 123 108 RN rn NN 8951 123 109 .010 .010 CD 8951 123 110 MJ MJ NNP 8951 123 111 , , , 8951 123 112 007 007 CD 8951 123 113 LV LV NNP 8951 123 114 -.022 -.022 HYPH 8951 123 115 BN BN NNP 8951 123 116 -.008 -.008 HYPH 8951 123 117 BJ BJ NNP 8951 123 118 , , , 8951 123 119 005 005 CD 8951 123 120 IV iv NN 8951 123 121 -.019 -.019 HYPH 8951 123 122 BR BR NNP 8951 123 123 .008 .008 CD 8951 123 124 SG sg NN 8951 123 125 -.004 -.004 HYPH 8951 123 126 AV AV NNP 8951 123 127 -.019 -.019 HYPH 8951 123 128 MN MN NNP 8951 123 129 -.007 -.007 : 8951 123 130 SR SR NNP 8951 123 131 .004 .004 CD 8951 123 132 CV CV NNP 8951 123 133 -.018 -.018 HYPH 8951 123 134 SR SR NNP 8951 123 135 .007 .007 CD 8951 123 136 BA BA NNP 8951 123 137 .004 .004 CD 8951 123 138 PV PV NNP 8951 123 139 .017 .017 CD 8951 123 140 MR MR NNP 8951 123 141 .004 .004 CD 8951 123 142 MA MA NNP 8951 123 143 , , , 8951 123 144 004 004 CD 8951 123 145 WV WV NNP 8951 123 146 -.014 -.014 HYPH 8951 123 147 SI SI NNP 8951 123 148 -.002 -.002 HYPH 8951 123 149 SM SM NNP 8951 123 150 -.003 -.003 -LRB- 8951 123 151 YV YV NNP 8951 123 152 .010 .010 CD 8951 123 153 GN GN NNP 8951 123 154 .001 .001 CD 8951 123 155 MR MR NNP 8951 123 156 .002 .002 CD 8951 123 157 BV BV NNP 8951 123 158 .005 .005 CD 8951 123 159 LN LN NNP 8951 123 160 .001 .001 CD 8951 123 161 SA SA NNP 8951 123 162 -.000 -.000 HYPH 8951 123 163 EV EV NNP 8951 123 164 -.002 -.002 NNP 8951 123 165 Variety Variety NNP 8951 123 166 - - HYPH 8951 123 167 Generator Generator NNP 8951 123 168 App1'0ach App1'0ach NNP 8951 123 169 / / SYM 8951 123 170 FOKKER FOKKER NNP 8951 123 171 and and CC 8951 123 172 LYNCH LYNCH NNP 8951 123 173 211 211 CD 8951 123 174 more more RBR 8951 123 175 frequent frequent JJ 8951 123 176 than than IN 8951 123 177 negative negative JJ 8951 123 178 . . . 8951 124 1 The the DT 8951 124 2 figures figure NNS 8951 124 3 indicate indicate VBP 8951 124 4 that that IN 8951 124 5 intersection intersection NN 8951 124 6 of of IN 8951 124 7 cer- cer- VBG 8951 124 8 tain tain NN 8951 124 9 of of IN 8951 124 10 these these DT 8951 124 11 characters character NNS 8951 124 12 as as IN 8951 124 13 keys key NNS 8951 124 14 in in IN 8951 124 15 search search NN 8951 124 16 would would MD 8951 124 17 result result VB 8951 124 18 in in IN 8951 124 19 some some DT 8951 124 20 slight slight JJ 8951 124 21 dimi- dimi- JJ 8951 124 22 nution nution NN 8951 124 23 in in IN 8951 124 24 performance performance NN 8951 124 25 against against IN 8951 124 26 that that DT 8951 124 27 expected expect VBN 8951 124 28 . . . 8951 125 1 The the DT 8951 125 2 figures figure NNS 8951 125 3 for for IN 8951 125 4 the the DT 8951 125 5 association association NN 8951 125 6 coefficients coefficient NNS 8951 125 7 among among IN 8951 125 8 the the DT 8951 125 9 twenty twenty CD 8951 125 10 most most JJS 8951 125 11 fre- fre- JJ 8951 125 12 quent quent JJ 8951 125 13 combinations combination NNS 8951 125 14 of of IN 8951 125 15 keys key NNS 8951 125 16 from from IN 8951 125 17 the the DT 8951 125 18 front front NN 8951 125 19 and and CC 8951 125 20 back back NN 8951 125 21 of of IN 8951 125 22 surnames surname NNS 8951 125 23 in in IN 8951 125 24 the the DT 8951 125 25 148- 148- CD 8951 125 26 and and CC 8951 125 27 296-key 296-key CD 8951 125 28 key key JJ 8951 125 29 - - HYPH 8951 125 30 sets set NNS 8951 125 31 show show NN 8951 125 32 magnitudes magnitude NNS 8951 125 33 ( ( -LRB- 8951 125 34 mostly mostly RB 8951 125 35 positive positive JJ 8951 125 36 ) ) -RRB- 8951 125 37 which which WDT 8951 125 38 are be VBP 8951 125 39 sub- sub- DT 8951 125 40 stantially stantially RB 8951 125 41 greater great JJR 8951 125 42 than than IN 8951 125 43 those those DT 8951 125 44 for for IN 8951 125 45 single single JJ 8951 125 46 characters character NNS 8951 125 47 ( ( -LRB- 8951 125 48 see see VB 8951 125 49 Table Table NNP 8951 125 50 11 11 CD 8951 125 51 ) ) -RRB- 8951 125 52 . . . 8951 126 1 The the DT 8951 126 2 rea- rea- NN 8951 126 3 sons son NNS 8951 126 4 for for IN 8951 126 5 these these DT 8951 126 6 values value NNS 8951 126 7 are be VBP 8951 126 8 obvious obvious JJ 8951 126 9 ; ; : 8951 126 10 in in IN 8951 126 11 certain certain JJ 8951 126 12 instances instance NNS 8951 126 13 , , , 8951 126 14 e.g. e.g. RB 8951 126 15 , , , 8951 126 16 MILLER MILLER NNP 8951 126 17 , , , 8951 126 18 JONES JONES NNP 8951 126 19 , , , 8951 126 20 and and CC 8951 126 21 MARTIN MARTIN NNP 8951 126 22 , , , 8951 126 23 common common JJ 8951 126 24 complete complete JJ 8951 126 25 names name NNS 8951 126 26 are be VBP 8951 126 27 apparent apparent JJ 8951 126 28 , , , 8951 126 29 while while IN 8951 126 30 in in IN 8951 126 31 one one CD 8951 126 32 case case NN 8951 126 33 , , , 8951 126 34 LEE LEE NNP 8951 126 35 , , , 8951 126 36 an an DT 8951 126 37 overlap overlap NN 8951 126 38 between between IN 8951 126 39 keys key NNS 8951 126 40 from from IN 8951 126 41 the the DT 8951 126 42 front front JJ 8951 126 43 and and CC 8951 126 44 rear rear JJ 8951 126 45 exists exist NNS 8951 126 46 . . . 8951 127 1 In in IN 8951 127 2 others other NNS 8951 127 3 , , , 8951 127 4 linguistic linguistic JJ 8951 127 5 variations variation NNS 8951 127 6 on on IN 8951 127 7 common common JJ 8951 127 8 names name NNS 8951 127 9 can can MD 8951 127 10 be be VB 8951 127 11 discerned discern VBN 8951 127 12 , , , 8951 127 13 as as IN 8951 127 14 with with IN 8951 127 15 BR BR NNP 8951 127 16 N N NNP 8951 127 17 - - : 8951 127 18 BROWN BROWN NNP 8951 127 19 or or CC 8951 127 20 BRAUN BRAUN NNP 8951 127 21 . . . 8951 128 1 Table table NN 8951 128 2 11 11 CD 8951 128 3 . . . 8951 129 1 Association Association NNP 8951 129 2 Coefficients Coefficients NNP 8951 129 3 in in IN 8951 129 4 the the DT 8951 129 5 Twenty Twenty NNP 8951 129 6 Most Most NNP 8951 129 7 Frequent frequent JJ 8951 129 8 Key Key NNP 8951 129 9 Combinations Combinations NNPS 8951 129 10 from from IN 8951 129 11 Front Front NNP 8951 129 12 and and CC 8951 129 13 Back back RB 8951 129 14 of of IN 8951 129 15 Surnames Surnames NNPS 8951 129 16 in in IN 8951 129 17 Two two CD 8951 129 18 Key Key NNP 8951 129 19 - - HYPH 8951 129 20 Sets Sets NNP 8951 129 21 Key Key NNP 8951 129 22 - - HYPH 8951 129 23 Set Set NNP 8951 129 24 Size Size NNP 8951 129 25 Key Key NNP 8951 129 26 - - HYPH 8951 129 27 Set Set NNP 8951 129 28 Size Size NNP 8951 129 29 148 148 CD 8951 129 30 296 296 CD 8951 129 31 Keys Keys NNPS 8951 129 32 v v IN 8951 129 33 Keys Keys NNP 8951 129 34 v v NNP 8951 129 35 s s NN 8951 129 36 H h NN 8951 129 37 .146 .146 CD 8951 129 38 s s NN 8951 129 39 ITH ITH NNP 8951 129 40 .343 .343 CD 8951 129 41 J J NNP 8951 129 42 SON SON NNP 8951 129 43 .127 .127 CD 8951 129 44 JO JO NNP 8951 129 45 NSON NSON NNP 8951 129 46 .297 .297 CD 8951 129 47 sc sc IN 8951 129 48 ER ER NNP 8951 129 49 .104 .104 CD 8951 129 50 JO JO NNP 8951 129 51 NES NES NNP 8951 129 52 .278 .278 NNP 8951 129 53 w w NNP 8951 129 54 s s NNP 8951 129 55 .043 .043 CD 8951 129 56 AN an DT 8951 129 57 RSON rson NN 8951 129 58 .274 .274 CD 8951 129 59 T t NN 8951 129 60 A a NN 8951 129 61 .038 .038 CD 8951 129 62 SI si NN 8951 129 63 GH gh NN 8951 129 64 .249 .249 . 8951 129 65 T t NN 8951 129 66 I i NN 8951 129 67 .038 .038 CD 8951 129 68 LE le NN 8951 129 69 EE EE NNP 8951 129 70 .221 .221 CD 8951 129 71 w w NN 8951 129 72 ER ER NNP 8951 129 73 .038 .038 CD 8951 129 74 MU mu NN 8951 129 75 LLER ller NN 8951 129 76 .214 .214 CD 8951 129 77 c c NN 8951 129 78 E e NN 8951 129 79 .034 .034 CD 8951 129 80 TA ta NN 8951 129 81 OR or CC 8951 129 82 .195 .195 CD 8951 129 83 F f NN 8951 129 84 ER ER NNP 8951 129 85 .033 .033 CD 8951 129 86 GU GU NNP 8951 129 87 TA TA NNP 8951 129 88 .168 .168 CD 8951 129 89 p p NN 8951 129 90 s s NN 8951 129 91 .025 .025 CD 8951 129 92 BR BR NNP 8951 129 93 N N NNP 8951 129 94 .160 .160 CD 8951 129 95 D d NN 8951 129 96 E e NN 8951 129 97 .023 .023 CD 8951 129 98 MI MI NNP 8951 129 99 LLER ller NN 8951 129 100 .151 .151 CD 8951 129 101 L l NN 8951 129 102 E e NN 8951 129 103 .022 .022 CD 8951 129 104 MAR MAR NNP 8951 129 105 TIN tin NN 8951 129 106 .145 .145 CD 8951 129 107 w w NN 8951 129 108 E e NN 8951 129 109 .022 .022 CD 8951 129 110 WI WI NNP 8951 129 111 s s NN 8951 129 112 .137 .137 CD 8951 129 113 G g NN 8951 129 114 IN in IN 8951 129 115 .020 .020 CD 8951 129 116 F f NN 8951 129 117 HER her PRP$ 8951 129 118 .133 .133 . 8951 129 119 M m NN 8951 129 120 E e NN 8951 129 121 .009 .009 CD 8951 129 122 sc sc IN 8951 129 123 DER DER NNP 8951 129 124 .121 .121 CD 8951 129 125 s s NN 8951 129 126 A A NNP 8951 129 127 .008 .008 CD 8951 129 128 SA sa NN 8951 129 129 TO to TO 8951 129 130 .110 .110 CD 8951 129 131 G g NN 8951 129 132 E e NN 8951 129 133 .006 .006 CD 8951 129 134 T t NN 8951 129 135 AS as IN 8951 129 136 .084 .084 CD 8951 129 137 M m NN 8951 129 138 A A NNP 8951 129 139 .005 .005 CD 8951 129 140 sc sc IN 8951 129 141 ER ER NNP 8951 129 142 .069 .069 . 8951 129 143 M m NN 8951 129 144 ER ER NNP 8951 129 145 -.004 -.004 HYPH 8951 129 146 CH CH NNP 8951 129 147 EN EN NNP 8951 129 148 .055 .055 CD 8951 129 149 G g NN 8951 129 150 ER ER NNP 8951 129 151 -.000 -.000 , 8951 129 152 T T NNP 8951 129 153 SON son NN 8951 129 154 .050 .050 CD 8951 129 155 Such such JJ 8951 129 156 associations association NNS 8951 129 157 are be VBP 8951 129 158 inevitable inevitable JJ 8951 129 159 . . . 8951 130 1 When when WRB 8951 130 2 the the DT 8951 130 3 selection selection NN 8951 130 4 of of IN 8951 130 5 keys key NNS 8951 130 6 is be VBZ 8951 130 7 based base VBN 8951 130 8 solely solely RB 8951 130 9 on on IN 8951 130 10 frequency frequency NN 8951 130 11 , , , 8951 130 12 some some DT 8951 130 13 deviation deviation NN 8951 130 14 from from IN 8951 130 15 the the DT 8951 130 16 ideal ideal NN 8951 130 17 of of IN 8951 130 18 independence independence NN 8951 130 19 must must MD 8951 130 20 result result VB 8951 130 21 , , , 8951 130 22 becoming become VBG 8951 130 23 larger large JJR 8951 130 24 as as IN 8951 130 25 the the DT 8951 130 26 size size NN 8951 130 27 of of IN 8951 130 28 the the DT 8951 130 29 key key JJ 8951 130 30 - - HYPH 8951 130 31 sets set NNS 8951 130 32 increases increase NNS 8951 130 33 , , , 8951 130 34 and and CC 8951 130 35 as as IN 8951 130 36 the the DT 8951 130 37 length length NN 8951 130 38 of of IN 8951 130 39 certain certain JJ 8951 130 40 of of IN 8951 130 41 the the DT 8951 130 42 keys key NNS 8951 130 43 increases increase NNS 8951 130 44 . . . 8951 131 1 However however RB 8951 131 2 , , , 8951 131 3 since since IN 8951 131 4 its -PRON- PRP$ 8951 131 5 effect effect NN 8951 131 6 in in IN 8951 131 7 the the DT 8951 131 8 most most RBS 8951 131 9 extreme extreme JJ 8951 131 10 cases case NNS 8951 131 11 is be VBZ 8951 131 12 merely merely RB 8951 131 13 to to TO 8951 131 14 lead lead VB 8951 131 15 to to IN 8951 131 16 virtually virtually RB 8951 131 17 exact exact JJ 8951 131 18 definition definition NN 8951 131 19 of of IN 8951 131 20 the the DT 8951 131 21 most most RBS 8951 131 22 fre- fre- JJ 8951 131 23 quent quent NNP 8951 131 24 surnames surname NNS 8951 131 25 , , , 8951 131 26 no no DT 8951 131 27 particular particular JJ 8951 131 28 disadvantage disadvantage NN 8951 131 29 results result NNS 8951 131 30 . . . 8951 132 1 POSSIBLE possible RB 8951 132 2 IMPLEMENTATIONS implementations JJ 8951 132 3 OF of IN 8951 132 4 THE the DT 8951 132 5 VARIETY VARIETY NNP 8951 132 6 - - HYPH 8951 132 7 GENERATOR GENERATOR NNP 8951 132 8 NAME NAME NNP 8951 132 9 SEARCH SEARCH NNS 8951 132 10 APPROACH approach VBP 8951 132 11 The the DT 8951 132 12 variety variety NN 8951 132 13 - - HYPH 8951 132 14 generator generator NN 8951 132 15 approach approach NN 8951 132 16 permits permit VBZ 8951 132 17 a a DT 8951 132 18 number number NN 8951 132 19 of of IN 8951 132 20 possible possible JJ 8951 132 21 implemen- implemen- JJ 8951 132 22 tations tation NNS 8951 132 23 of of IN 8951 132 24 searches search NNS 8951 132 25 for for IN 8951 132 26 personal personal JJ 8951 132 27 names name NNS 8951 132 28 to to TO 8951 132 29 be be VB 8951 132 30 considered consider VBN 8951 132 31 , , , 8951 132 32 if if IN 8951 132 33 only only RB 8951 132 34 in in IN 8951 132 35 outline outline NNP 8951 132 36 f f NNP 8951 132 37 ( ( -LRB- 8951 132 38 f•j/ f•j/ NNP 8951 132 39 212 212 CD 8951 132 40 Journal Journal NNP 8951 132 41 of of IN 8951 132 42 Library Library NNP 8951 132 43 Automation Automation NNP 8951 132 44 Vol Vol NNP 8951 132 45 . . . 8951 133 1 7/3 7/3 CD 8951 133 2 September September NNP 8951 133 3 1974 1974 CD 8951 133 4 at at IN 8951 133 5 this this DT 8951 133 6 stage stage NN 8951 133 7 , , , 8951 133 8 using use VBG 8951 133 9 a a DT 8951 133 10 variety variety NN 8951 133 11 of of IN 8951 133 12 file file NN 8951 133 13 organization organization NN 8951 133 14 methods method NNS 8951 133 15 . . . 8951 134 1 The the DT 8951 134 2 most most RBS 8951 134 3 widely widely RB 8951 134 4 known know VBN 8951 134 5 methods method NNS 8951 134 6 ( ( -LRB- 8951 134 7 apart apart RB 8951 134 8 from from IN 8951 134 9 purely purely RB 8951 134 10 sequential sequential JJ 8951 134 11 files file NNS 8951 134 12 ) ) -RRB- 8951 134 13 are be VBP 8951 134 14 direct direct JJ 8951 134 15 access access NN 8951 134 16 ( ( -LRB- 8951 134 17 uti- uti- RB 8951 134 18 lizing lize VBG 8951 134 19 hash hash NN 8951 134 20 - - HYPH 8951 134 21 addressing addressing NN 8951 134 22 ) ) -RRB- 8951 134 23 , , , 8951 134 24 chained chain VBN 8951 134 25 , , , 8951 134 26 and and CC 8951 134 27 index index NN 8951 134 28 sequential sequential JJ 8951 134 29 files file NNS 8951 134 30 . . . 8951 135 1 Direct direct JJ 8951 135 2 application application NN 8951 135 3 of of IN 8951 135 4 the the DT 8951 135 5 concatenated concatenate VBN 8951 135 6 key key JJ 8951 135 7 - - HYPH 8951 135 8 numbers number NNS 8951 135 9 as as IN 8951 135 10 the the DT 8951 135 11 basis basis NN 8951 135 12 for for IN 8951 135 13 hash hash NN 8951 135 14 - - HYPH 8951 135 15 address address NN 8951 135 16 computation computation NN 8951 135 17 appears appear VBZ 8951 135 18 attractive attractive JJ 8951 135 19 in in IN 8951 135 20 instances instance NNS 8951 135 21 where where WRB 8951 135 22 the the DT 8951 135 23 person- person- NN 8951 135 24 al al NNP 8951 135 25 name name NNP 8951 135 26 is be VBZ 8951 135 27 used use VBN 8951 135 28 alone alone RB 8951 135 29 or or CC 8951 135 30 in in IN 8951 135 31 combination combination NN 8951 135 32 ( ( -LRB- 8951 135 33 as as IN 8951 135 34 , , , 8951 135 35 for for IN 8951 135 36 instance instance NN 8951 135 37 , , , 8951 135 38 with with IN 8951 135 39 a a DT 8951 135 40 part part NN 8951 135 41 of of IN 8951 135 42 the the DT 8951 135 43 document document NN 8951 135 44 title title NN 8951 135 45 ) ) -RRB- 8951 135 46 . . . 8951 136 1 The the DT 8951 136 2 almost almost RB 8951 136 3 random random JJ 8951 136 4 distribution distribution NN 8951 136 5 of of IN 8951 136 6 the the DT 8951 136 7 bits bit NNS 8951 136 8 in in IN 8951 136 9 this this DT 8951 136 10 code code NN 8951 136 11 should should MD 8951 136 12 result result VB 8951 136 13 in in IN 8951 136 14 a a DT 8951 136 15 general general JJ 8951 136 16 diminution diminution NN 8951 136 17 of of IN 8951 136 18 the the DT 8951 136 19 collision collision NN 8951 136 20 and and CC 8951 136 21 overflow overflow NN 8951 136 22 problems problem NNS 8951 136 23 commonly commonly RB 8951 136 24 encountered encounter VBN 8951 136 25 with with IN 8951 136 26 fixed fix VBN 8951 136 27 - - HYPH 8951 136 28 length length NN 8951 136 29 keys key NNS 8951 136 30 . . . 8951 137 1 Since since IN 8951 137 2 only only RB 8951 137 3 four four CD 8951 137 4 keys key NNS 8951 137 5 are be VBP 8951 137 6 used use VBN 8951 137 7 to to TO 8951 137 8 represent represent VB 8951 137 9 each each DT 8951 137 10 name name NN 8951 137 11 , , , 8951 137 12 and and CC 8951 137 13 the the DT 8951 137 14 four four CD 8951 137 15 sets set NNS 8951 137 16 of of IN 8951 137 17 keys key NNS 8951 137 18 from from IN 8951 137 19 which which WDT 8951 137 20 these these DT 8951 137 21 are be VBP 8951 137 22 selected select VBN 8951 137 23 are be VBP 8951 137 24 limited limit VBN 8951 137 25 in in IN 8951 137 26 number number NN 8951 137 27 and and CC 8951 137 28 of of IN 8951 137 29 ap- ap- NNP 8951 137 30 proximately proximately RB 8951 137 31 equal equal JJ 8951 137 32 probability probability NN 8951 137 33 , , , 8951 137 34 the the DT 8951 137 35 keys key NNS 8951 137 36 can can MD 8951 137 37 be be VB 8951 137 38 used use VBN 8951 137 39 to to TO 8951 137 40 construct construct VB 8951 137 41 chained chained JJ 8951 137 42 indexes index NNS 8951 137 43 , , , 8951 137 44 to to TO 8951 137 45 which which WDT 8951 137 46 , , , 8951 137 47 however however RB 8951 137 48 , , , 8951 137 49 the the DT 8951 137 50 usual usual JJ 8951 137 51 constraints constraint NNS 8951 137 52 still still RB 8951 137 53 apply apply VBP 8951 137 54 . . . 8951 138 1 Index index NN 8951 138 2 sequential sequential JJ 8951 138 3 storage storage NN 8951 138 4 again again RB 8951 138 5 offers offer VBZ 8951 138 6 opportunities opportunity NNS 8951 138 7 , , , 8951 138 8 in in IN 8951 138 9 particular particular JJ 8951 138 10 since since IN 8951 138 11 the the DT 8951 138 12 low low JJ 8951 138 13 variety variety NN 8951 138 14 of of IN 8951 138 15 key key JJ 8951 138 16 types type NNS 8951 138 17 means mean VBZ 8951 138 18 that that IN 8951 138 19 the the DT 8951 138 20 sorting sort VBG 8951 138 21 operations operation NNS 8951 138 22 which which WDT 8951 138 23 this this DT 8951 138 24 entails entail NNS 8951 138 25 can can MD 8951 138 26 be be VB 8951 138 27 eliminated eliminate VBN 8951 138 28 . . . 8951 139 1 In in IN 8951 139 2 effect effect NN 8951 139 3 , , , 8951 139 4 each each DT 8951 139 5 name name NN 8951 139 6 entry entry NN 8951 139 7 would would MD 8951 139 8 be be VB 8951 139 9 represented represent VBN 8951 139 10 by by IN 8951 139 11 an an DT 8951 139 12 entry entry NN 8951 139 13 in in IN 8951 139 14 each each DT 8951 139 15 of of IN 8951 139 16 four four CD 8951 139 17 lists list NNS 8951 139 18 of of IN 8951 139 19 document document NN 8951 139 20 numbers number NNS 8951 139 21 or or CC 8951 139 22 addresses address NNS 8951 139 23 , , , 8951 139 24 and and CC 8951 139 25 documents document NNS 8951 139 26 retrieved retrieve VBN 8951 139 27 by by IN 8951 139 28 intersection intersection NN 8951 139 29 of of IN 8951 139 30 the the DT 8951 139 31 lists list NNS 8951 139 32 . . . 8951 140 1 While while IN 8951 140 2 four four CD 8951 140 3 such such JJ 8951 140 4 numbers number NNS 8951 140 5 are be VBP 8951 140 6 stored store VBN 8951 140 7 for for IN 8951 140 8 each each DT 8951 140 9 name name NN 8951 140 10 , , , 8951 140 11 in in IN 8951 140 12 contrast contrast NN 8951 140 13 to to IN 8951 140 14 a a DT 8951 140 15 single single JJ 8951 140 16 entry entry NN 8951 140 17 for for IN 8951 140 18 the the DT 8951 140 19 more more JJR 8951 140 20 con- con- NNP 8951 140 21 ventional ventional NNP 8951 140 22 name name NN 8951 140 23 list list NN 8951 140 24 , , , 8951 140 25 the the DT 8951 140 26 removal removal NN 8951 140 27 of of IN 8951 140 28 the the DT 8951 140 29 name name NN 8951 140 30 list list NN 8951 140 31 itself -PRON- PRP 8951 140 32 would would MD 8951 140 33 more more RBR 8951 140 34 than than IN 8951 140 35 compensate compensate VB 8951 140 36 for for IN 8951 140 37 the the DT 8951 140 38 additional additional JJ 8951 140 39 storage storage NN 8951 140 40 required require VBN 8951 140 41 for for IN 8951 140 42 the the DT 8951 140 43 lists list NNS 8951 140 44 . . . 8951 141 1 In in IN 8951 141 2 the the DT 8951 141 3 index index NN 8951 141 4 sequential sequential JJ 8951 141 5 mode mode NN 8951 141 6 , , , 8951 141 7 the the DT 8951 141 8 lists list NNS 8951 141 9 of of IN 8951 141 10 document document NN 8951 141 11 addresses address NNS 8951 141 12 or or CC 8951 141 13 num- num- JJ 8951 141 14 bers ber NNS 8951 141 15 stored store VBN 8951 141 16 with with IN 8951 141 17 each each DT 8951 141 18 key key NN 8951 141 19 are be VBP 8951 141 20 more more RBR 8951 141 21 or or CC 8951 141 22 less less RBR 8951 141 23 equally equally RB 8951 141 24 long long JJ 8951 141 25 . . . 8951 142 1 They -PRON- PRP 8951 142 2 may may MD 8951 142 3 thus thus RB 8951 142 4 be be VB 8951 142 5 replaced replace VBN 8951 142 6 by by IN 8951 142 7 bit bit NN 8951 142 8 - - HYPH 8951 142 9 vectors vector NNS 8951 142 10 in in IN 8951 142 11 which which WDT 8951 142 12 the the DT 8951 142 13 position position NN 8951 142 14 of of IN 8951 142 15 a a DT 8951 142 16 bit bit NN 8951 142 17 corresponds correspond VBZ 8951 142 18 to to IN 8951 142 19 a a DT 8951 142 20 name name NN 8951 142 21 or or CC 8951 142 22 document document NN 8951 142 23 number number NN 8951 142 24 . . . 8951 143 1 If if IN 8951 143 2 the the DT 8951 143 3 number number NN 8951 143 4 of of IN 8951 143 5 keys key NNS 8951 143 6 bears bear VBZ 8951 143 7 a a DT 8951 143 8 simple simple JJ 8951 143 9 relation relation NN 8951 143 10 to to IN 8951 143 11 the the DT 8951 143 12 number number NN 8951 143 13 of of IN 8951 143 14 blocks block NNS 8951 143 15 on on IN 8951 143 16 a a DT 8951 143 17 disc disc NN 8951 143 18 cylinder cylinder NN 8951 143 19 , , , 8951 143 20 the the DT 8951 143 21 vectors vector NNS 8951 143 22 can can MD 8951 143 23 be be VB 8951 143 24 stored store VBN 8951 143 25 in in IN 8951 143 26 pre- pre- RB 8951 143 27 determined determine VBN 8951 143 28 positions position NNS 8951 143 29 within within IN 8951 143 30 a a DT 8951 143 31 cylinder cylinder NN 8951 143 32 , , , 8951 143 33 resulting result VBG 8951 143 34 in in IN 8951 143 35 the the DT 8951 143 36 serial serial JJ 8951 143 37 - - HYPH 8951 143 38 parallel parallel NN 8951 143 39 file file NN 8951 143 40 . . . 8951 144 1 The the DT 8951 144 2 usefulness usefulness NN 8951 144 3 of of IN 8951 144 4 this this DT 8951 144 5 file file NN 8951 144 6 organization organization NN 8951 144 7 has have VBZ 8951 144 8 yet yet RB 8951 144 9 to to TO 8951 144 10 be be VB 8951 144 11 fully fully RB 8951 144 12 evaluated evaluate VBN 8951 144 13 ; ; : 8951 144 14 however however RB 8951 144 15 , , , 8951 144 16 it -PRON- PRP 8951 144 17 also also RB 8951 144 18 promises promise VBZ 8951 144 19 substantial substantial JJ 8951 144 20 economies economy NNS 8951 144 21 in in IN 8951 144 22 storage storage NN 8951 144 23 . . . 8951 145 1 On on IN 8951 145 2 average average JJ 8951 145 3 , , , 8951 145 4 only only RB 8951 145 5 four four CD 8951 145 6 of of IN 8951 145 7 the the DT 8951 145 8 bits bit NNS 8951 145 9 are be VBP 8951 145 10 set set VBN 8951 145 11 at at IN 8951 145 12 the the DT 8951 145 13 positions position NNS 8951 145 14 in in IN 8951 145 15 the the DT 8951 145 16 vectors vector NNS 8951 145 17 corresponding correspond VBG 8951 145 18 to to IN 8951 145 19 the the DT 8951 145 20 name name NN 8951 145 21 or or CC 8951 145 22 document document NN 8951 145 23 entry entry NN 8951 145 24 . . . 8951 146 1 On on IN 8951 146 2 average average JJ 8951 146 3 , , , 8951 146 4 then then RB 8951 146 5 , , , 8951 146 6 the the DT 8951 146 7 density density NN 8951 146 8 of of IN 8951 146 9 1-bits 1-bits CD 8951 146 10 is be VBZ 8951 146 11 very very RB 8951 146 12 low low JJ 8951 146 13 , , , 8951 146 14 and and CC 8951 146 15 long long JJ 8951 146 16 runs run NNS 8951 146 17 of of IN 8951 146 18 zeros zero NNS 8951 146 19 occur occur VBP 8951 146 20 in in IN 8951 146 21 the the DT 8951 146 22 vectors vector NNS 8951 146 23 . . . 8951 147 1 They -PRON- PRP 8951 147 2 can can MD 8951 147 3 , , , 8951 147 4 therefore therefore RB 8951 147 5 , , , 8951 147 6 be be VB 8951 147 7 compressed compress VBN 8951 147 8 using use VBG 8951 147 9 run run VBN 8951 147 10 - - HYPH 8951 147 11 length length NN 8951 147 12 coding coding NN 8951 147 13 , , , 8951 147 14 for for IN 8951 147 15 instance instance NN 8951 147 16 as as IN 8951 147 17 applied apply VBN 8951 147 18 by by IN 8951 147 19 Brad- Brad- NNP 8951 147 20 ley.3 ley.3 NNP 8951 147 21 · · NFP 8951 147 22 4 4 CD 8951 147 23 Preliminary preliminary JJ 8951 147 24 work work NN 8951 147 25 with with IN 8951 147 26 the the DT 8951 147 27 296-key 296-key CD 8951 147 28 key key NN 8951 147 29 - - HYPH 8951 147 30 set set NN 8951 147 31 has have VBZ 8951 147 32 indicated indicate VBN 8951 147 33 already already RB 8951 147 34 that that IN 8951 147 35 a a DT 8951 147 36 gross gross JJ 8951 147 37 compression compression NN 8951 147 38 ratio ratio NN 8951 147 39 of of IN 8951 147 40 nine nine CD 8951 147 41 to to IN 8951 147 42 one one CD 8951 147 43 is be VBZ 8951 147 44 attainable attainable JJ 8951 147 45 , , , 8951 147 46 so so IN 8951 147 47 that that IN 8951 147 48 the the DT 8951 147 49 explicit explicit JJ 8951 147 50 storage storage NN 8951 147 51 requirements requirement NNS 8951 147 52 to to TO 8951 147 53 identify identify VB 8951 147 54 the the DT 8951 147 55 association association NN 8951 147 56 between between IN 8951 147 57 a a DT 8951 147 58 name name NN 8951 147 59 and and CC 8951 147 60 a a DT 8951 147 61 document document NN 8951 147 62 number number NN 8951 147 63 would would MD 8951 147 64 be be VB 8951 147 65 just just RB 8951 147 66 over over IN 8951 147 67 thirty thirty CD 8951 147 68 bits bit NNS 8951 147 69 . . . 8951 148 1 CONCLUSIONS conclusion NNS 8951 148 2 The the DT 8951 148 3 work work NN 8951 148 4 described describe VBN 8951 148 5 here here RB 8951 148 6 relates relate VBZ 8951 148 7 solely solely RB 8951 148 8 to to IN 8951 148 9 searches search NNS 8951 148 10 for for IN 8951 148 11 individual individual JJ 8951 148 12 occur- occur- NN 8951 148 13 rences rence NNS 8951 148 14 of of IN 8951 148 15 personal personal JJ 8951 148 16 names name NNS 8951 148 17 . . . 8951 149 1 Clearly clearly RB 8951 149 2 , , , 8951 149 3 in in IN 8951 149 4 operational operational JJ 8951 149 5 systems system NNS 8951 149 6 in in IN 8951 149 7 which which WDT 8951 149 8 one one CD 8951 149 9 or or CC 8951 149 10 more more JJR 8951 149 11 author author NN 8951 149 12 names name NNS 8951 149 13 are be VBP 8951 149 14 associated associate VBN 8951 149 15 with with IN 8951 149 16 a a DT 8951 149 17 particular particular JJ 8951 149 18 bibliographical bibliographical JJ 8951 149 19 item item NN 8951 149 20 , , , 8951 149 21 it -PRON- PRP 8951 149 22 will will MD 8951 149 23 be be VB 8951 149 24 necessary necessary JJ 8951 149 25 to to TO 8951 149 26 provide provide VB 8951 149 27 for for IN 8951 149 28 description description NN 8951 149 29 of of IN 8951 149 30 each each DT 8951 149 31 of of IN 8951 149 32 these these DT 8951 149 33 for for IN 8951 149 34 access access NN 8951 149 35 . . . 8951 150 1 If if IN 8951 150 2 this this DT 8951 150 3 is be VBZ 8951 150 4 provided provide VBN 8951 150 5 solely solely RB 8951 150 6 on on IN 8951 150 7 the the DT 8951 150 8 basis basis NN 8951 150 9 of of IN 8951 150 10 a a DT 8951 150 11 document document NN 8951 150 12 number number NN 8951 150 13 , , , 8951 150 14 some some DT 8951 150 15 false false JJ 8951 150 16 coordination coordination NN 8951 150 17 will will MD 8951 150 18 occur occur VB 8951 150 19 - - : 8951 150 20 for for RP 8951 150 21 instance instance NN 8951 150 22 , , , 8951 150 23 when when WRB 8951 150 24 the the DT 8951 150 25 initials initial NNS 8951 150 26 of of IN 8951 150 27 one one CD 8951 150 28 entry entry NN 8951 150 29 are be VBP 8951 150 30 Variety Variety NNP 8951 150 31 - - HYPH 8951 150 32 Generator Generator NNP 8951 150 33 Approach Approach NNP 8951 150 34 / / SYM 8951 150 35 FOKKER FOKKER NNP 8951 150 36 and and CC 8951 150 37 LYNCH LYNCH NNP 8951 150 38 213 213 CD 8951 150 39 combined combine VBN 8951 150 40 with with IN 8951 150 41 the the DT 8951 150 42 surname surname NN 8951 150 43 of of IN 8951 150 44 another another DT 8951 150 45 . . . 8951 151 1 A a DT 8951 151 2 number number NN 8951 151 3 of of IN 8951 151 4 strategies strategy NNS 8951 151 5 can can MD 8951 151 6 be be VB 8951 151 7 en- en- RB 8951 151 8 visaged visage VBN 8951 151 9 to to TO 8951 151 10 overcome overcome VB 8951 151 11 this this DT 8951 151 12 problem problem NN 8951 151 13 . . . 8951 152 1 , , , 8951 152 2 The the DT 8951 152 3 performance performance NN 8951 152 4 figures figure NNS 8951 152 5 show show VBP 8951 152 6 clearly clearly RB 8951 152 7 that that IN 8951 152 8 a a DT 8951 152 9 small small JJ 8951 152 10 number number NN 8951 152 11 of of IN 8951 152 12 character- character- JJ 8951 152 13 istics istic NNS 8951 152 14 - - , 8951 152 15 between between IN 8951 152 16 100 100 CD 8951 152 17 and and CC 8951 152 18 300 300 CD 8951 152 19 in in IN 8951 152 20 this this DT 8951 152 21 study study NN 8951 152 22 - - , 8951 152 23 are be VBP 8951 152 24 sufficient sufficient JJ 8951 152 25 to to TO 8951 152 26 characterize characterize VB 8951 152 27 the the DT 8951 152 28 entries entry NNS 8951 152 29 in in IN 8951 152 30 large large JJ 8951 152 31 files file NNS 8951 152 32 of of IN 8951 152 33 personal personal JJ 8951 152 34 names name NNS 8951 152 35 and and CC 8951 152 36 to to TO 8951 152 37 provide provide VB 8951 152 38 a a DT 8951 152 39 high high JJ 8951 152 40 degree degree NN 8951 152 41 of of IN 8951 152 42 resolution resolution NN 8951 152 43 in in IN 8951 152 44 searches search NNS 8951 152 45 for for IN 8951 152 46 them -PRON- PRP 8951 152 47 . . . 8951 153 1 While while IN 8951 153 2 performance performance NN 8951 153 3 in in IN 8951 153 4 much much RB 8951 153 5 larger large JJR 8951 153 6 files file NNS 8951 153 7 , , , 8951 153 8 involving involve VBG 8951 153 9 the the DT 8951 153 10 extension extension NN 8951 153 11 of of IN 8951 153 12 key key JJ 8951 153 13 - - HYPH 8951 153 14 set set NN 8951 153 15 sizes size NNS 8951 153 16 to to IN 8951 153 17 larger large JJR 8951 153 18 munbers munber NNS 8951 153 19 , , , 8951 153 20 has have VBZ 8951 153 21 yet yet RB 8951 153 22 to to TO 8951 153 23 be be VB 8951 153 24 studied study VBN 8951 153 25 , , , 8951 153 26 the the DT 8951 153 27 logical logical JJ 8951 153 28 application application NN 8951 153 29 of of IN 8951 153 30 the the DT 8951 153 31 concept concept NN 8951 153 32 of of IN 8951 153 33 variety variety NN 8951 153 34 generation generation NN 8951 153 35 would would MD 8951 153 36 appear appear VB 8951 153 37 to to TO 8951 153 38 open open VB 8951 153 39 the the DT 8951 153 40 way way NN 8951 153 41 to to IN 8951 153 42 novel novel JJ 8951 153 43 approaches approach NNS 8951 153 44 to to IN 8951 153 45 searches search NNS 8951 153 46 for for IN 8951 153 47 documents document NNS 8951 153 48 as- as- XX 8951 153 49 sociated sociate VBD 8951 153 50 with with IN 8951 153 51 particular particular JJ 8951 153 52 personal personal JJ 8951 153 53 names name NNS 8951 153 54 , , , 8951 153 55 which which WDT 8951 153 56 seem seem VBP 8951 153 57 likely likely JJ 8951 153 58 to to TO 8951 153 59 offer offer VB 8951 153 60 ad- ad- CC 8951 153 61 vantages vantage NNS 8951 153 62 in in IN 8951 153 63 terms term NNS 8951 153 64 of of IN 8951 153 65 the the DT 8951 153 66 overall overall JJ 8951 153 67 economic economic JJ 8951 153 68 performance performance NN 8951 153 69 of of IN 8951 153 70 search search NN 8951 153 71 systems system NNS 8951 153 72 , , , 8951 153 73 not not RB 8951 153 74 only only RB 8951 153 75 in in IN 8951 153 76 bibliographic bibliographic JJ 8951 153 77 but but CC 8951 153 78 also also RB 8951 153 79 in in IN 8951 153 80 more more JJR 8951 153 81 general general JJ 8951 153 82 computer computer NN 8951 153 83 - - HYPH 8951 153 84 based base VBN 8951 153 85 infor- infor- NNP 8951 153 86 mation mation NN 8951 153 87 systems system NNS 8951 153 88 . . . 8951 154 1 ACKNOWLEDGMENTS acknowledgment NNS 8951 154 2 We -PRON- PRP 8951 154 3 thank thank VBP 8951 154 4 M. M. NNP 8951 154 5 D. D. NNP 8951 154 6 Martin Martin NNP 8951 154 7 of of IN 8951 154 8 the the DT 8951 154 9 Institution Institution NNP 8951 154 10 of of IN 8951 154 11 Electrical Electrical NNP 8951 154 12 Engineers Engineers NNPS 8951 154 13 for for IN 8951 154 14 provision provision NN 8951 154 15 of of IN 8951 154 16 a a DT 8951 154 17 part part NN 8951 154 18 of of IN 8951 154 19 the the DT 8951 154 20 INSPEC INSPEC NNP 8951 154 21 data data NN 8951 154 22 base base NN 8951 154 23 and and CC 8951 154 24 of of IN 8951 154 25 file file NN 8951 154 26 - - HYPH 8951 154 27 handling handle VBG 8951 154 28 soft- soft- NN 8951 154 29 ware ware NN 8951 154 30 , , , 8951 154 31 and and CC 8951 154 32 the the DT 8951 154 33 Potchefstroom Potchefstroom NNP 8951 154 34 University University NNP 8951 154 35 for for IN 8951 154 36 C.H.E. C.H.E. NNP 8951 155 1 ( ( -LRB- 8951 155 2 South South NNP 8951 155 3 Mrica Mrica NNP 8951 155 4 ) ) -RRB- 8951 155 5 for for IN 8951 155 6 awarding award VBG 8951 155 7 a a DT 8951 155 8 National National NNP 8951 155 9 Grant Grant NNP 8951 155 10 to to IN 8951 155 11 D. D. NNP 8951 155 12 Fokker Fokker NNP 8951 155 13 to to TO 8951 155 14 pursue pursue VB 8951 155 15 this this DT 8951 155 16 work work NN 8951 155 17 . . . 8951 156 1 We -PRON- PRP 8951 156 2 also also RB 8951 156 3 thank thank VBP 8951 156 4 Dr. Dr. NNP 8951 157 1 I. I. NNP 8951 157 2 J. J. NNP 8951 157 3 Barton Barton NNP 8951 157 4 and and CC 8951 157 5 Dr. Dr. NNP 8951 157 6 G. G. NNP 8951 157 7 W. W. NNP 8951 157 8 Adamson Adamson NNP 8951 157 9 for for IN 8951 157 10 valuable valuable JJ 8951 157 11 discussions discussion NNS 8951 157 12 , , , 8951 157 13 and and CC 8951 157 14 the the DT 8951 157 15 former former JJ 8951 157 16 for for IN 8951 157 17 n n JJ 8951 157 18 - - HYPH 8951 157 19 gram gram NN 8951 157 20 generation generation NN 8951 157 21 programs program NNS 8951 157 22 . . . 8951 158 1 REFERENCES reference NNS 8951 158 2 1 1 CD 8951 158 3 . . . 8951 159 1 D. D. NNP 8951 159 2 W. W. NNP 8951 159 3 Fokker Fokker NNP 8951 159 4 and and CC 8951 159 5 M. M. NNP 8951 159 6 F. F. NNP 8951 159 7 Lynch Lynch NNP 8951 159 8 , , , 8951 159 9 " " `` 8951 159 10 Application application NN 8951 159 11 of of IN 8951 159 12 the the DT 8951 159 13 Variety Variety NNP 8951 159 14 - - HYPH 8951 159 15 Generator Generator NNP 8951 159 16 Approach Approach NNP 8951 159 17 to to IN 8951 159 18 Searches search NNS 8951 159 19 of of IN 8951 159 20 Personal Personal NNP 8951 159 21 Names Names NNPS 8951 159 22 in in IN 8951 159 23 Bibliographic Bibliographic NNP 8951 159 24 Data Data NNP 8951 159 25 Bases Bases NNPS 8951 159 26 - - HYPH 8951 159 27 Part Part NNP 8951 159 28 1 1 CD 8951 159 29 . . . 8951 160 1 Microstructure microstructure NN 8951 160 2 of of IN 8951 160 3 Personal Personal NNP 8951 160 4 Authors Authors NNPS 8951 160 5 ' ' POS 8951 160 6 Names name NNS 8951 160 7 , , , 8951 160 8 " " '' 8951 160 9 Journal Journal NNP 8951 160 10 of of IN 8951 160 11 Library Library NNP 8951 160 12 Automation Automation NNP 8951 160 13 7:105 7:105 NNPS 8951 160 14 - - SYM 8951 160 15 18 18 CD 8951 160 16 ( ( -LRB- 8951 160 17 June June NNP 8951 160 18 1974 1974 CD 8951 160 19 ) ) -RRB- 8951 160 20 . . . 8951 161 1 2 2 LS 8951 161 2 . . . 8951 162 1 I. I. NNP 8951 162 2 J. J. NNP 8951 162 3 Barton Barton NNP 8951 162 4 , , , 8951 162 5 S. S. NNP 8951 162 6 E. E. NNP 8951 162 7 Creasey Creasey NNP 8951 162 8 , , , 8951 162 9 M. M. NNP 8951 162 10 F. F. NNP 8951 162 11 Lynch Lynch NNP 8951 162 12 , , , 8951 162 13 and and CC 8951 162 14 M. M. NNP 8951 162 15 J. J. NNP 8951 162 16 Snell Snell NNP 8951 162 17 , , , 8951 162 18 " " `` 8951 162 19 An an DT 8951 162 20 Information information NN 8951 162 21 - - HYPH 8951 162 22 Theoretic theoretic JJ 8951 162 23 Approach Approach NNP 8951 162 24 to to IN 8951 162 25 Text Text NNP 8951 162 26 Searching Searching NNP 8951 162 27 in in IN 8951 162 28 Direct Direct NNP 8951 162 29 - - HYPH 8951 162 30 Access Access NNP 8951 162 31 Systems Systems NNPS 8951 162 32 , , , 8951 162 33 " " '' 8951 162 34 Communications communication NNS 8951 162 35 of of IN 8951 162 36 the the DT 8951 162 37 ACM ACM NNP 8951 162 38 ( ( -LRB- 8951 162 39 in in IN 8951 162 40 press press NN 8951 162 41 ) ) -RRB- 8951 162 42 . . . 8951 163 1 3 3 LS 8951 163 2 . . . 8951 164 1 S. S. NNP 8951 164 2 D. D. NNP 8951 164 3 Bradley Bradley NNP 8951 164 4 , , , 8951 164 5 " " `` 8951 164 6 Optimizing optimize VBG 8951 164 7 a a DT 8951 164 8 Scheme Scheme NNP 8951 164 9 for for IN 8951 164 10 Run Run NNP 8951 164 11 - - HYPH 8951 164 12 Length Length NNP 8951 164 13 Encoding encoding NN 8951 164 14 , , , 8951 164 15 " " '' 8951 164 16 Proceedings proceeding NNS 8951 164 17 of of IN 8951 164 18 the the DT 8951 164 19 IEEE IEEE NNP 8951 164 20 57:108 57:108 CD 8951 164 21 - - SYM 8951 164 22 9 9 CD 8951 164 23 ( ( -LRB- 8951 164 24 1969 1969 CD 8951 164 25 ) ) -RRB- 8951 164 26 . . . 8951 165 1 4 4 LS 8951 165 2 . . . 8951 166 1 M. M. NNP 8951 166 2 F. F. NNP 8951 166 3 Lynch Lynch NNP 8951 166 4 , , , 8951 166 5 " " `` 8951 166 6 Compression compression NN 8951 166 7 of of IN 8951 166 8 Bibliographic Bibliographic NNP 8951 166 9 Files Files NNP 8951 166 10 Using use VBG 8951 166 11 an an DT 8951 166 12 Adaptation adaptation NN 8951 166 13 of of IN 8951 166 14 Run- Run- NNP 8951 166 15 Length Length NNP 8951 166 16 Coding Coding NNP 8951 166 17 , , , 8951 166 18 " " `` 8951 166 19 Information Information NNP 8951 166 20 Storage Storage NNP 8951 166 21 and and CC 8951 166 22 Retrieval Retrieval NNP 8951 166 23 9:207 9:207 CD 8951 166 24 - - HYPH 8951 166 25 14 14 CD 8951 166 26 ( ( -LRB- 8951 166 27 1973 1973 CD 8951 166 28 ) ) -RRB- 8951 166 29 . . . 8951 167 1 r r LS 8951 167 2 / / SYM 8951 167 3 ' ' '' 8951 167 4 I -PRON- PRP 8951 167 5 , , ,