Summary of your 'study carrel' ============================== This is a summary of your Distant Reader 'study carrel'. The Distant Reader harvested & cached your content into a collection/corpus. It then applied sets of natural language processing and text mining against the collection. The results of this process was reduced to a database file -- a 'study carrel'. The study carrel can then be queried, thus bringing light specific characteristics for your collection. These characteristics can help you summarize the collection as well as enumerate things you might want to investigate more closely. This report is a terse narrative report, and when processing is complete you will be linked to a more complete narrative report. Eric Lease Morgan Number of items in the collection; 'How big is my corpus?' ---------------------------------------------------------- 57 Average length of all items measured in words; "More or less, how big is each item?" ------------------------------------------------------------------------------------ 4925 Average readability score of all items (0 = difficult; 100 = easy) ------------------------------------------------------------------ 68 Top 50 statistically significant keywords; "What is my collection about?" ------------------------------------------------------------------------- 40 Library 11 MARC 10 University 7 system 6 card 6 IBM 5 record 5 order 4 catalog 3 title 3 cost 3 book 3 Libraries 2 program 2 list 2 file 2 field 2 entry 2 RECON 2 Information 2 File 2 Department 2 Card 1 table 1 subrecord 1 storage 1 place 1 painting 1 objective 1 number 1 module 1 loan 1 line 1 library 1 letter 1 hour 1 format 1 error 1 document 1 designation 1 content 1 computer 1 code 1 century 1 automation 1 author 1 area 1 algorithm 1 TAG 1 Stanford Top 50 lemmatized nouns; "What is discussed?" --------------------------------------------- 1380 system 1331 record 1234 card 1113 number 1029 library 780 catalog 755 cost 733 file 721 book 699 program 672 title 656 computer 655 entry 640 time 631 order 584 datum 536 list 535 field 531 character 515 information 461 table 421 tape 413 code 405 author 397 use 393 word 377 line 343 form 332 data 327 machine 298 name 292 format 286 work 283 processing 277 item 269 element 262 operation 258 document 256 input 242 reference 242 length 241 year 241 problem 241 error 234 method 233 case 226 user 224 t 223 example 222 copy Top 50 proper nouns; "What are the names of persons or places?" -------------------------------------------------------------- 1288 Library 694 MARC 398 Automation 391 _ 384 University 378 Journal 341 Vol 266 IBM 227 LC 226 Congress 198 J 179 Data 177 Information 165 System 163 • 157 June 148 Libraries 146 March 146 II 140 Systems 133 Catalog 130 RECON 129 Department 129 December 128 State 128 S 127 Card 126 New 122 Center 120 Computer 119 Book 112 Processing 109 Research 107 MOLDS 106 t 103 Project 102 File 100 Science 100 National 98 R. 97 J. 97 Fig 96 September 92 C 91 Figure 90 D. 88 L 87 s 87 F 87 C. Top 50 personal pronouns nouns; "To whom are things referred?" ------------------------------------------------------------- 1413 it 779 i 286 they 157 he 114 them 62 we 52 one 48 itself 28 she 21 him 18 themselves 18 me 15 us 14 you 5 de- 4 s 4 em 3 himself 2 tt 2 oneself 2 mine 1 wr 1 v 1 u 1 sz 1 rf 1 pre 1 in- 1 ia 1 i- 1 herself 1 f 1 di- 1 cofoi 1 bl Top 50 lemmatized verbs; "What do things do?" --------------------------------------------- 9842 be 1447 have 865 use 515 do 432 make 360 include 327 give 295 provide 293 produce 289 require 268 contain 262 follow 233 print 217 show 211 add 186 describe 165 appear 164 punch 162 indicate 161 develop 152 base 148 catalog 147 take 145 need 144 select 141 find 131 receive 130 call 127 prepare 125 search 124 set 124 design 124 consider 123 become 122 list 122 begin 121 write 114 process 111 work 111 fix 111 exist 111 convert 107 obtain 103 specify 103 consist 103 assign 101 occur 100 determine 97 generate 94 store Top 50 lemmatized adjectives and adverbs; "How are things described?" --------------------------------------------------------------------- 988 not 499 first 450 more 418 other 389 only 331 new 302 also 263 such 255 subject 244 bibliographic 220 same 218 available 210 then 208 most 207 however 205 well 203 large 201 many 199 library 196 as 193 second 192 possible 187 total 156 readable 155 high 154 main 149 up 148 major 143 very 143 necessary 142 - 141 variable 141 several 139 less 137 so 132 various 131 out 130 special 129 great 122 even 121 few 120 long 118 much 115 single 112 thus 112 manual 109 present 109 initial 109 full 108 original Top 50 lemmatized superlative adjectives; "How are things described to the extreme?" ------------------------------------------------------------------------- 80 most 49 least 37 good 27 high 18 large 16 Most 12 low 11 great 9 long 8 late 5 simple 5 early 4 small 4 cheap 3 old 3 near 3 easy 2 short 2 full 2 big 1 ter 1 slow 1 safe 1 quick 1 narrow 1 heavy 1 gross 1 few 1 fast 1 close 1 bottommost 1 bad Top 50 lemmatized superlative adverbs; "How do things do to the extreme?" ------------------------------------------------------------------------ 128 most 25 least 6 well 4 highest 1 near 1 fast Top 50 Internet domains; "What Webbed places are alluded to in this corpus?" ---------------------------------------------------------------------------- Top 50 URLs; "What is hyperlinked from this corpus?" ---------------------------------------------------- Top 50 email addresses; "Who are you gonna call?" ------------------------------------------------- Top 50 positive assertions; "What sentences are in the shape of noun-verb-noun?" ------------------------------------------------------------------------------- 6 entry is not 4 entry is subject 4 information is available 4 title added entry 3 book does not 3 computer based laboratory 3 computer based system 3 library catalog cards 3 marc based sdi 3 system described above 3 system does not 3 tape is then 2 author had wrong 2 book are poorly 2 book catalog production 2 card is not 2 card is then 2 cards are then 2 cards were only 2 catalog is not 2 catalogs were not 2 characters do not 2 computer based assignments 2 costs are considerably 2 field called jon 2 field called mon 2 fields do not 2 files are not 2 information is not 2 libraries are basically 2 library catalog data 2 library catalog records 2 library catalog searches 2 library does not 2 library had already 2 number is always 2 program called genesis 2 program described here 2 record is also 2 record is then 2 records do not 2 system has many 2 system has not 2 system was originally 2 system was worth 2 systems are really 2 tape is rewound 2 times were re- 1 author added entries 1 author did not Top 50 negative assertions; "What sentences are in the shape of noun-verb-no|not-noun?" --------------------------------------------------------------------------------------- 4 entry is not subject 1 books are not uniform 1 card is not available 1 card is not negligible 1 cards are not available 1 cards were not eligible 1 catalog is no brief 1 catalogs is not easy 1 computer was not brief 1 file is not too 1 files are not absolutely 1 information is not available 1 libraries have no immediate 1 libraries produce no hard 1 library is not entirely 1 order is not necessary 1 orders were not in- 1 program has not yet 1 program is not available 1 program was not officially 1 system is not so 1 systems did not fully 1 table are no problem 1 title are not permanently Sizes of items; "Measures in words, how big is each item?" ---------------------------------------------------------- 17746 2923 11037 4665 8880 5126 8752 4666 8672 4667 8443 5250 8205 5255 8190 5257 7974 2926 7304 2930 6458 5260 6360 2927 6356 3797 6236 4642 6184 5259 6124 5127 5605 2932 5551 5262 5391 2933 5227 5246 5143 3796 5113 5249 4902 5256 4819 2934 4754 2937 4741 2928 4497 5123 4254 2939 4237 2922 4125 5258 4081 5125 4079 4641 3974 2936 3843 4644 3791 4669 3639 2938 3614 2924 3476 2935 3443 5261 3336 5254 3335 5124 3206 2925 3122 5129 3074 3795 3062 5245 2959 3794 2821 5251 2701 2929 2593 5128 2567 4668 2517 4663 2424 5247 2234 2931 2196 5248 1571 3798 1292 5263 487 4643 Readability of items; "How difficult is each item to read?" ----------------------------------------------------------- 98.0 5247 88.0 5126 84.0 2922 84.0 2923 82.0 2928 81.0 2934 81.0 3797 81.0 5261 79.0 2932 79.0 5255 79.0 5259 77.0 2933 77.0 2939 74.0 2936 74.0 5245 73.0 2927 73.0 2929 72.0 2925 72.0 4666 70.0 5250 70.0 5260 69.0 2930 69.0 2938 69.0 3795 69.0 5124 69.0 5128 69.0 5262 68.0 2924 68.0 4667 68.0 5249 68.0 5254 68.0 5257 67.0 2931 67.0 4642 66.0 2937 66.0 3794 65.0 2935 65.0 5258 64.0 4665 64.0 4669 64.0 5125 64.0 5248 63.0 3796 63.0 4668 59.0 4641 59.0 5127 58.0 4643 57.0 3798 57.0 5263 56.0 4644 56.0 5129 55.0 5246 55.0 5256 54.0 5123 52.0 4663 50.0 2926 48.0 5251 Item summaries; "In a narrative form, how can each item be abstracted?" ----------------------------------------------------------------------- 2922 In September, 1966, a system was initiated at the University which provides for the use of automatically produced multiple orders and for the use of change cards to update order information on previously placed automatic production of a weekly books on order list or ''''Processing Information List" similar to that reported by Schultheiss to be in use at the As a first step, the book order request form was redesigned to serve the catalog department for use in the Library of Congress card order RI!CORD OfSCRI,.TION BOOK ORDER CARD # 1 fORIIIAT NO. TIOII CHANGE CARD #1 (PROCESSED 1-lf!EN BOOK RECEIVED ) fORIIIAT NO. Example of Computer Produced Books on Order List "books on order" list by main entry (Figure 6). sent to the catalog department for use as an order for cards from the received will appear on the "books on order" list next to the original 2923 Although the card catalogs were never prepared, the planning was of considerable value later in establishing field and record lengths for the machine record, as well as in securing general agreement on the kind of A complete catalog was to be printed annually (author & title, subject, preparation of the programs, library staff began in July, 1965, the inputting of cataloging information. On the one hand, this has meant a shorter record and greater simplicity in inputting data; on the other hand, it became necessary to maintain a name authority file when the form adopted for the book catalog Added copies and volumes are entered by preparing Area 10 information only and punching S in column 61, the shelf list indicator. filing of entries in the shelf list and in the author & title and subject prints the pages of the author & title and subject catalogs. 2924 fund accounting and the maintenance of an outstanding order file. The Fund Accounting System of the Brown University .Library was initiated on the basis of a program developed in April, 1966. and more effective control over the approximately 150 fund accounts administered by the Order Department of the University Library. a list of items produced by computer from punch cards in which order · financial accounting for book purchases based on pre-punched cards corresponding to purchase orders typed ( 4). The processing file documents, each containing a fund slip, an L.C. order slip, and a cataloger''s work slip on a System is to maintain current balances on the various library fund accounts and to maintain a file of outstandmg orders exclusive of standing This program validates Fund Code, rejects duplicate order numbers and encumbers List Price, thereby reducing Net Available. Record Cards must match an existing order number on file. 2925 different methods of conversion: keypunching, paper-tape typeW1·iting, The record converted included call number, copy number, first 39 letters of the author''s name, were scanned electronically and converted to magnetic tape; Johns Hopkins, working from microfilm copy of the shelf list, used special type font Cost figures for converting library records are scarce. The Johns Hopkins conversion of "about 300,000 · cards" was accomplished by optical scanning and cost $18,170 (3,p.4). available to most research libraries: keypunching, paper-tape typewriting, The shelf list records of the Michigan State University Library were method recorded the bibliographic data by use of an IBM 026 keypunch. In the conversion keypunching cost 6.63 cents per record. Average Cost Per Shelf List Record Converted Costs of Shelf List Conversion/ CHAPIN and PRETZER 73 Costs of Shelf List Conversion/ CHAPIN and PRETZER 73 conversion, at a cost of 6 or 7 cents a record, then the library cannot 2926 This approach is based on the premise that practically all computerbased library systems are in an experimental or research-and-development stage with questionable economic justification, and that it is unnecessary and uneconomical for every library to undertake difficult and done with local funds, home-grown staff, batch processing and even second generation computers, the total systems approach must be based on While the old stage one of library automation was one in which librarians almost exclusively did the design and programming, it is doubtful team of librarians and computer experts should not be able to work effectively together to design and implement future library systems. processing equipment and techniques, and an over-all view of the general state of the library automation art, including its potential and direction of development. effective library automation work has been done by the people who combine the abilities of the systems analyst with those of the expert programmer and are capable of doing a complete project themselves. 2927 Automated systems of book order and circulation control using an IBM Congress card orders, Oakland University invoice-vouchers, a complete weekly: purchase orders, Library of Congress card order slips, and Oakland University invoice-vouchers. Library of Congress order slips, completely computer printed ( including the symbols indicating the reasons for the Library of Congress''s possible inability to supply cards, the account number, and formula for the Then four items of information are keypunched into a "received" card: the purchase order number, the actual A "received" card actuates the following sequence: the order status is changed from "on-order" to "inprocessing", the encumbrance is returned to the book fund, the actual price is charged against the book fund, an Oakland University invoicevoucher is printed and a voucher card punched. When a dealer returns the purchase order card reporting that he cannot supply a book, this information ·is keypunched into the card: the LIB 014 To prepare Library of Congress card orders for new book 2928 Keypunching of an expanded character set for library catalog data is were shelf list cards, the master record at the University of California h 2 1ra"eris;fuon characters 01 library catalog data on more t an 0 anguag , the shell list consisted 01 both printed Library of Congress caIds and cards Ed «: .''ifJ square, which was not used at Santa Cruz, was not counted.fu ~ All data elements were encoded in fixed card fields; that is, the field Date Rec''d 37-40 Month, year receIved at Library Publ./Source 1-60 May be continued on a second card two forms of shelf list cards, as mentioned above: Library of Congres~ "d the programs written to process the catalog card data. Cost Per Title to Produce Machine Readable Catalog Data elements of data within the catalog record than does Santa Cruz. COSTS OF LIBRARY CATALOG CARDS prodUction, but neither library routinely produced cards. 2929 Early in September, 1964, the Yale Medical Library.put into routine operation the Columbia-Harvard-Yale computerized technique for catalog The Columbia Medical Library and Harvard Medical Library also installed IBM 870 Document Writers and tested the programs for card procedure for catalog card production, among them the Medical Library At the University of Kentucky an 870 Document Writer types catalog cards, but new programs were written to run Nearly all reports on catalog card production limit study of costs to ready to be filed into a catalog, cost 16.6 cents per card; cards produced by a machine method consisting of a tape typewriter and a very to drive the 870 Document Writer for production of catalog cards on oneup forms. Table 1 contains cost figures for catalog card production by the four Costs of Library Catalog Cards/ KILGOUR 127 Costs of Library Catalog Cards/ KILGOUR 127 2930 Bell Telephone Laboratories has established an on-line circulation system linking two terminals in each of its three largest libraries to a central of collections, immediate reporting on publication availability or a borrower''s record, automatic reserve follow-up; reduced labor; and increased · reservations and queries with real-time speed and responsiveness; additionally, it provides a wide range of other products and information copies of a title in the library system (union catalogs serve this purpose, after some steps and card handling) but the availability of account number, a five-digit number which is keyed in or read from a prepunched card for all loans, reservations and other transactions requiring Twenty-two different transaction codes are currently available to handle loans, returns, renewals, reservations and queries in real time. the book and the number of copies still available for loan at each library. operation, the requester is automatically charged with the book, the holding library is told his name and address, and mailing follows. 2931 A functioning technical processing system in a two-year community college library utilizes a model 2201 Friden Flexowriter with punch card 1440 computer, with two tape and two disc drives, to produce all acquisitions and catalog files based primarily on a single typing at the time of mechanized order files and shelf lists, catalogs in both the traditional 3x5 6) to provide catalog copy for technical processes to use in producing library use: one Friden Flexowriter (Model 2201) with Card Punch Control Unit and Tab Card Reading Unit, one IBM 026 Key Punch with alternate programming, and guaranteed time on the college-owned IBM The tab cards, containing full order information, are used as the magnetic tape order file is used to prepare tab cards containing all original order information and the cards are sent to the Library with a 2932 Comparative costs obtained were $ .89 per entry for computer assisted catalog This system, depicted in Figure 1, consists of: 1) centralized card production, and; 2) branch catalog maintenance. data and the record number for each Master File entry in alphabetical Book Catalog and periodic cumulative supplements of new entries. The cost figures presented in Table 4 reflect the total card production Manual Catalog Card Production and Maintenance Costs Initial Costs for Machine Readable (Master File) Entries However, the cost per entry figure for computer assisted file maintenance The figures developed in Table 9 establish a cost per entry for file The Locate File update input cost is identical to that used for the entry The Biblio File update costs are the computer keyline production figures presented in Table 6. $0.0242 is the cost per entry for file maintenance for one year. 2933 A system developed to produce fourteen subject reference lists by IBM lists to guide them in the use of library materials, and library reference Such subject reference lists could be easily generated if the library Serial number is assigned to every new entry to maintain the alphabetical order of the complete listing. PAGE 2 REFER!IICE LIBRARY MASTER/fiLE HAIN''miANCE PRE-EDIT REPORT KARCH 0}, 1968 PAGE l REFERENCE LIBRARY MASTER/FILE MAINTENANCE REPORT JANUARY 30, 1968. This list (Figure 7) is an alphabetical summary (in serial number sequence) containing added entries only from the "maintenance" run. The first edition of the reference lists was available for distribution at printed, costs, and sale to students and faculty of first edition lists. Questionnaire on Use of Reference Subject Lists. Subject Reference Lists/ CHEN and KINGHAM 197 Subject Reference Lists/ CHEN and KINGHAM 197 Subject Reference Lists/ CHEN and KINGHAM 197 System Reference Library, File no. 2934 gram produces punched cards which instruct the 870 to type a library Producing library catalog cards and accessions bulletins with the aid preparation of 3x5 catalog cards and the library accessions bulletin. punch, an auxiliary card punch, and a second typewriter; but for an output of library catalog cards and bulletin only the Control Unit and one such as parentheses, brackets, colon and semicolon, are obtained by punching the upper-case shift character in the card immediately before the to a statement which punches the last tracings data and a special character ( ) which, during printout, will cause the typewriter to tum up (Punch statement for main entry on tracing card, drawn from card to the heading, it is punched with the subject code number. The data for the Library Bulletin program consists of the cards which were punched manually for the catalog card program. 2935 readable form the entire shelf list of the Libraries of the State University IBM DATATEXT~ the on-line computer service which was used for the conversion, provided an upperand lowercase typewriter which transmitted data to disk storage of a digital computer. to convert library bibliographic information to machine readable catalog to machine readable form would provide the greatest improvement in library services and operations. experience in using magnetic tape for the handling of bibliographic information, so that when the Library of Congress'' MARC Project begins consoles, data sets and telephone lines were added, and the Conversion Mter three weeks of operation, it was obvious that the typists could tag at the console, thus making Later, typists were trained in several sessions by one of the Library''s typing staff. typed into "working storage" for an hour, inputting 15 to 30 shelf list "Hours Typed" is time that consoles were actually being used to input or correct cards. 2936 A study of problems associated with bibliographic retrieval using unverified input data supplied by requesters. which accepts such data must be designed 1) to increase the level of confidence through machine-generated search structures and variable threshholds and 2) to reduce the dependence upon spelling accuracy, punctuation, spacing and word order. indicates, use of the entire title for searching would present distinct problems for retrieval systems ( 4) . code derived from the compression of author and title information supplied by the user. It is assumed that a similar code is provided for all entries of the data base using the same compression rules for main and added of the significant title words contain less than ten characters. At the time the input data is read, the existence of title, author, edition, words in the supplied title which must be matched by the base code. 2937 Changes in reference services themselves, however, may make automation of question-answering practical. will mean two things to reference librarians: greater concentration of resources, allowing more specialized books and mechanization; and screening of questions at the local level, letting reference centers concentrate The categorizing system was based on two nearly universal generalizati.ons about biographical reference books: 1) They are consistently confined to biographies of persons who have something in common: for example, being alive or dead; or having the same nationality, sex, occupation, religion, race, memberships; or possessing some combination of those as to the specifics contained in each biographical reference book in the SEX, LIVING, NAT (nationality), OCCUP (occupation), MIN ( minorities), DATE, INDEX, SPECl and SPEC2 (specifics). same question ignoring already retrieved books each time, he will eventually have a comprehensive list of possible sources in the data that have A scheme for categorizing biographical reference books by their exclusive and specific categories makes it possible to automatically retrieve 2938 is on codes useful in creating directories to large data files. Recursive Decomposition is a selected letter code devised by the author are Recursive Decomposition Coding or Transition Distance Transition Distance Code: Alphacheck. In Transition Distance Coding, the natural bias of letters and letter distribution of transition distances in English words is essentially uniform." algorithm for Transition Distance Coding but not in Alphacheck. The method of Transition Distance Coding is used to operate on a modulo product of primes associated with transition distances of permuted the transition distances, beginning with 5 so that the alphanumeric base ( 36) and all numbers are relatively prime. When used with a selected letter compression .code, it operates on A process practically identical to that of Transition Distance Coding is (Transition Distance Coding), since this was designed to operate on full Letter Positions and Primes used in Transition Distance Coding 2939 length MARC records containing variable length fields. Congress for the MARC Distribution Service, some potential users of machine readable data have expressed doubts that MARC formatted records tape was in the Library of Congress'' MARC processing format. the analysis of the COBOL language with the task at hand, i.e, the programming effort required to produce the necessary reports from a magnetic tape file of MARC records. Leader Communications Control Fixed Record Variable Library of Congress MARC Processing Format. MARC records at the Library of Congress are contained on magnetic The control fields contain alphameric data elements and are recorded like variable fields although many have a fixed length. field is identified by a 3-byte numeric tag in the record directory In COBOL, the MARC file and records, along with the option card file Output of COBOL Language Program Using MARC II Data. COBOL was successfully used for the computer processing of MARC records. 3794 In designing a coded set of instructions to perform these tasks, it was deemed necessary that any attempt at the selection or compilation of a series of music programs should be made in accordance with certain criteria supplied to the system by the user, and In the final operation of the program, each duplicate title of a work any single work within it, the type of music desired, and additional information, such as date, time and title of the program to be aired and an indication of the station for which the program is to be selected. A program selection card contains the following data: Title of program to be selected, day and time. of titles on the master file the program will tend to select the longest minutes and twelve seconds, MUSPROG selected the programs, updated the catalog, checked for duplication of selections, timed each program, 3795 HIGH SCHOOL LIBRARY DATA PROCESSING Planning and operation of an automated high school library system is using library data processing are the Oak Park-River Forest High School Data processing at Leavenworth High School was made possible by The Leavenworth Data Processing Library System was set up to 1) order all new library books; 2) complete shelf cards and book checkout overdue notices; and 7) provide library inventory, book count lists and The high school leases the following IBM data processing equipment: is punched from lists of books to be ordered and only the following information and columns are punched: author name (columns 14-35), title by data processing students in the high school, working occasionally during class, but mostly after class and on Saturdays on a voluntary basis. When new books are received, the Library staff discards Shelf Card B Cards are pulled by the Library staff and sent to Data Processing,. 3796 In addition, library-type file maintenance, organization and search are The tendency today, however, is to build a single master file with various functional fields where bibliographic information, ordering, and purchasing data, loan records, location information and other item control There are three basic file organization schemes in use today for information retrieval: the serial file, the inverted file and the list process file Although the combined file organization of a serial record file and an inverted index does cost more to maintain than serial or list file organization, it provides such superior search capabilities that it has become the inverted file search, in · its basic form, takes the request descriptors, obtains the list of record addresses or items under each relevant descriptor, bibliographic record on the serially organized main file and the index in File Organization of Library Records/W ARHEIT 29 File Organization of Library Records/W ARHEIT 29 3797 In information retrieval applications, the number of elements may approach several hundred thousand or even a million documents, as in the case of a large library. This immediately poses two serious problems: the storage space necessary to store the matrix increases as the square of the number of documents, and the time required to calculate the matrix also increases quadratically. Tables 3 and 4 show the results of scoring the documents in the sample collection against the profiles from Table 1 (cutoff= 10). Clusters Resulting from Document Scoring the new algorithm, documents are assigned to more than one cluster on loose documents to be assigned to clusters after the first iteration; a document to score higher against profiles of smaller clusters. a document to score higher against profiles of smaller clusters. If the number of documents in the cluster is not too large, number of clusters desired; 2) approximate percentage of loose documents 3798 This book reprints, describes, summarizes or refers to every item in book to look, he can find very useful information, but just leafing through and other complex data bases calls for printing and other output capabilities which exceed those now commonly available with computers. puter produced library publications, such as book and periodical catalogs Some pioneers have already made good use of this advanced technology to produce quality catalogs and lists; this book will second describes many types of computer peripherals and gives considerable attention to the various codes used for computer and printing equipment data input. The objectives of the book as given in the preface are: to offer a description, discussion, critique, and collection of facts and data on coordinate indexing as one of the systems which may be used to intellectually indexing are included so that the material has value for anyone interested The first part of this book describes man''s interaction with recorded 4641 USA STANDARD FOR A FORMAT FOR BIBLIOGRAPHIC INFORMATION INTERCHANGE ON MAGNETIC TAPE National Library of Medicine, and National Agricultural Library Implementation of the Proposed American Standard for a Format for Bibliographic Information Interchange on Magnetic Tape as Applied to Records 0.2 This standard defines a format which is intended for the interchange of bibliographic records on magnetic tape. length (in characters) of the delimiter plus data element identifier) A character used to terminate a variable field within a bibliographic record. A data element which indicates the relation of the bibliographic record to a file (e.g., new, updated, etc.). The entry map is a data element in character positions [Since bibliographic data is usually variable in length, the structure of an entry in the directory will usually follow the length of field, and starting character position) other data entry consisted of a 3-digit tag, no length of field data 4642 The record terminator will consist of the "group separator" (ASCII character AGRICULTURAL LIBRARY IMPLEMENTATION OF THE PROPOSED AMERICAN STANDARD FOR A FORMAT FOR BIBLIOGRAPHIC INFORMATION INTERCHANGE ON MAGNETIC The data element identifier will consist of one basic character. A library character set for the roman alphabet and romanized non-roman alphabets necessitates a larger number of The correlation of the character set to digital form code The basic digital form code for the character set is The indicator is a two-character code consisting of basic characters specifying the origin or authority for the data in each variable field. Each directory entry consists of the following data elements : Each directory entry consists of the following data elements : CONTENT This is the classification code of the subject CONTENT This is the classification code of the subject This is the classification code of the provisional subject term in data element 420 4643 AND SUBRECORD RELATIONSHIP FIELDS IN THE PROPOSED A subrecord directory will be used when a bibliographic record consists of more than one subrecord. 1.1 Entries in Subrecord Directory The format of each entry in the subrecord directory is as for multiple subrecords describing bibliographic units which have the same type-of-record 1.1.1.2 Type-of-Record and Bibliographic Level When the subrecord does not describe a primary bibliographic unit, the type-of-record A subrecord relationship field will be present if, and only if, a subrecord directory is present. The subrecord relationship field, when The subrecord relationship field, when present, shall contain fixed fields which are used to indicate the relationships of subrecords to each other. The format of each relationship field is as shown : 64 subrecords with the same type-of-record and bibliographic The tags in the subrecord relationship field have the three ( 3) basic characters used to indicate the relationship between subfields. 4644 MARC Manuals Used by the Library of Congress prepared by the Information Systems Office, Library of Congress. or MARC II data for experiment or operations, complete information on MARC Manuals is an indispensable publication for any library contemplating use of, or using, MARC II tapes. Library Association needed hardly a month to produce the MARC Manuals. Bureau of Information Sciences Research, Graduate School of Library Service, Rutgers, The State University. reporting them, 6) comprehensive reviews, 7) bibliographies, 8) evaluative articles, and 9) directories to current research. well as bibliographies or literature surveys of fields outside Information These criteria, and the book''s subject classification scheme, are themselves useful and they reflect considerable thought, even though the user A bibliography on information science and technology Coward prepared these documents for users of BNB MARC records. These two publications contain much useful information about MARC Library & Information Science Abstracts. 4663 the Library of Congress and the British National Bibliography the opportunity to look outwards at other systems being developed in other countries. The traditional method of distributing Library of Congress bibliographic information is to provide catalogue cards or proof sheets. The original MARC project ( 1) was "an experiment to test the feasability of distributing Library of Congress cataloguing in machine readable of the MARC I project; its end product was a catalogue record on magnetic tape. form of presentation, as best exemplified by a Library of Congress catalogue card, has been developed which holds all the essential data and What is less understandable is that the Library of Congress should transfer the non-standard cataloguing rules established by provide for international exchange a standard MARC record together with American MARC Projects not as systems for distributing bibliographic information but as the forerunners of an international bibliographic network. 4665 This paper deals with the application to library systems development programs of planning techniques which long ago proved their usefulness in the requirements of a major library systems development program and Similarly, plans should include consideration of every major activity required to achieve the objectives, but the level of detail may vary widely A library administration begins the process of developing objectives for a modernization program by reviewing existing library Special, single-end-item projects, like facilities development, objectives or policy formulation, major systems projects (i.e. the development of a new module), etc., are a part of the management apparatus of the system. Library Systems Development Program Work Breakdown Structure. When a single set of concepts is finally selected, estimates of development and operating costs for a new module based on the concept, together with its projected effectiveness, should be made and compared with 4666 The cited report provided some information about the cost of computerized library catalogs. spent on cataloging adds cost to the study that tends to reduce the number of libraries willing to report; those that do report may or may not cataloging operation, three sources of information were used: 1) a detailed analysis made as part of an overall time and motion study of operations in the Lockheed Research Library ( 3); a detailed study of the cataloging and processing activities of the New York Public Library as a preliminary to possible automation of some of these operations ( 4); and a detailed study of the acquisitions, cataloging, and other processing operations for the Columbia University science libraries ( 5) . The simplest conclusion to reach from a study of Table 2 is that cataloging costs vary widely from one library to another. 4667 3) A content edit program generates records in the final processing format. At this stage, mnemonic tags are converted to numeric form, subfield codes may be supplied, implicit fixed fields are set, etc. purpose of this article is to give a progress report on work in three significant areas : 1) automatic tagging of data elements by format recognition programs; 2) file analysis by a statistical program called GENESIS; format are recorded in binary, packed decimal, or EBCDIC notation depending on the characteristics of the data and the processing required. Specifies the MARC record data element( s) characters in the data element of the MARC record being The personal name main entry field (tag 100) of each MARC record The data base would be searched for records with LC card number field it can query any data base that contains records in the MARC processing 4668 Its acceptability as a substitute for conventional catalog searching methods is questioned unless recall performance can be improved, either by use of the and 98.67% accuracy (precision) in a test of his method in which unverified book order requests were matched against a MARC I data base method when applied to bibliographic retrieval from a library catalog. method by manual simulation, and to test its recall performance in real paper came from 126 searches in which the catalog users had been successful in locating the specific documents that they were seeking. instance to the author-title search clues supplied by the catalog user and Ruecking''s method was designed for use with English-language titles catalog, only 88 were retrieved by the compression-code method-a recall were retrieved by the compression-code method-a recall rate of 73%. The test considered real document searches that were concluded successfully in an actual library using a manual catalog; recall is 4669 for the development and implementation of a Connecticut Library Research Center (see ERIC Document ED 0221512). Thesaurus of ERIC Descriptors, 2d Edition, Washington, D.C.: Educational Resources Information Center, Bureau of Research, Office of Education, 1969. The Thesaurus of ERIC Descriptors is, by its own admission, an authority list ("Only those descriptors actually used for indexing are placed controlled vocabulary indexing/ retrieval; an authority list may be used provide a good discussion of the problems of indexing and "thesaurus" development, especially concerning the need for multi-term entries. Guide to Reprints provides an annual, cumulative list of books, journals titles listed in Announced Reprints. gives the date of the first time a reprint title is listed, eventually librarians edition containing 8880 titles held by 35 libraries was published in the with less information as to title changes than in this list); the use of 5123 of investigation that will yield findings useful in development of mechanized, descriptive cataloging. be possible to prepare entries mechanically from title pages for inclusion who obtains a novel by use of an author-title entry in a catalog is not For the library user the words in an author''s name, a title, and subject headings, are index labels that he uses to find a book that contains information he needs. retrieved from a library by use of a library catalog is via an author-title The organization for a computerized catalog containing entries prepared mechanically from title pages would be somewhat different from be carried out to produce knowledge needed for development of a computerized library catalog hospitable to descriptive cataloging entries produced mechanically: 1) use of library catalogs; 2) specification for derived keys; 3) identification of title-page words useful and not useful for occurence on classical catalog entries, title pages, and in subject indexes, 5124 Use of an IBM 1401 computer and a single keypunch operation for changing a college book collection from Dewey decimal to Library of Congress Decisions were made to automate a projected reclassification, the acquisitions and circulation systems and accounting procedures, and to produce lists of the serial holdings ( 4). The IBM Processing installation used for the Library comprised the following: 1401 computer, 12K storage; 1402 Card Reader Punch; 1403 create the circulation book card, duplicating accession number, call number, author and title. A machine readable shelf list made possible an automated circulation The marked circulation cards are sent three times a week to the Data Center, where the Old and new circulation cards are machine filed together by accession number and returned When the book and its Library of Congress cards have been received, The shelf list is kept in the IBM card form, and a book catalog could 5125 A computer based laboratory for library science students to use in class MARC file ( 48,POO records, approx.) were used for search and retrieval Data bases, programs, and seven different class assignments are described and evaluated for their impact on library education in general and individual students and faculty in particular. the MARC Pilot Project tapes as the file of catalog records, has been the computer programs and the MARC tapes could be used by library school faculty in class assignments and by students for independent research. deal with the various class assignments and student projects developed The facilities at LEEP include MARC Pilot Project data bases, computer programs, and personnel. Class Assignments and Student Projects Using LEEP three titles; search the MARCS/DPS file for the assigned number; compare titles cataloged with titles some students the only experience with LEEP was a class assignment Program Description: MARC I Record Sort" (NAPS 00881); 5) "LEEP 5126 A centralized data base of MARC II records distributed by the Library This report describes the cooperative system developed by the Department of Libraries to maintain and make available MARC II records to maintaining a MARC master file of all records sent out by the Library selected records by LC card number from the MARC master file for copies specified LC card number records from the MARC master file to Inputs are 1) a MARC master (a tape in MARC format and code containing all records merged to date), which is in LC card number sequence; and 2) MARC "items" (a tape ( s) in MARC format and code a 12-position LC card number and a code indicating if this MARC record This program prints all LC card numbers on any tape in MARC code MARC code and format, its output a printed listing of all Library of using MARC records from the State data base for bibliographic information for its machine produced book catalog. 5127 of catalog cards, book catalogs, periodical check-in, serials holdings, circulation control systems, acquisitions programs and searching of files, or Systems librarians have been busy designing individual programs, building special computer stored files, implementing holding program; the development of authority lists in book catalog programs; the simultaneous updating of accession files and circulation control It is this total systems concept which is the new and current development of library Electronic Data Processing ( EDP). One of the basic elements of this hardware was the development of real-time, on-line, terminal-oriented, timeshared systems. were quick to take advantage of their in-house, time-share system to implement acquisitions, catalog input and library bulletin programs (9). With the advent of time-shared, on-line capabilities and the potentiality of building total, integrated systems, librarians today who are planning EDP systems are faced with a number of design decisions: 1) Should 5128 position, and relative record address ( RRA) in this paper, refers to the computed offset address of a record in the file, relative to the starting address of the file is track 40, a record with RRA = 5 would have its home Direct Access Methods, the system locates that disk address generated file was being occupied by records-this method became unusable; records were then located so far from their original addresses that terminal of the input data, map records uniformly across the available file space. The algorithm which resulted utilizes a pseudo-random number generator, RAND (7) developed at the W.S.U. Computing Center RANDL, N to the purchase order number of the record to be located, and by using Inputs to this program are: a) an algorithm to generate relative record locations; b) a there are six records to map into a direct access file of eight locations. 5129 It attempts to give a good introduction to current practices in library automation and a fairly detailed ac4 Kimber present computerized systems for various library activities: Chapter III, "Ordering and Acquisitions," Chapter IV, "Circulation Control," The second issue of the "International Directory of Research and Development Scientists" ( IDR&DS) iists the names and organizational addresses of 152,648 authors whose papers were listed in either 4o