mv: 'input-file.zip' and './input-file.zip' are the same file Creating study carrel named subject-proverbs-freebo Initializing database Unzipping Archive: input-file.zip inflating: ./tmp/input/A44738.xml inflating: ./tmp/input/A58161.xml inflating: ./tmp/input/A03057.xml inflating: ./tmp/input/A47620.xml inflating: ./tmp/input/A17848.xml inflating: ./tmp/input/xml2htm.xsl inflating: ./tmp/input/A16738.xml inflating: ./tmp/input/metadata.csv inflating: ./tmp/input/A16737.xml inflating: ./tmp/input/A85437.xml inflating: ./tmp/input/A15606.xml caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: === metadata file: ./tmp/input/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-proverbs-freebo May 24, 2021 8:19:32 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: J2KImageReader not loaded. JPEG2000 files will not be processed. See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io for optional dependencies. May 24, 2021 8:19:32 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: Tesseract OCR is installed and will be automatically applied to image files unless you've excluded the TesseractOCRParser from the default parser. Tesseract may dramatically slow down content extraction (TIKA-2359). As of Tika 1.15 (and prior versions), Tesseract is automatically called. In future versions of Tika, users may need to turn the TesseractOCRParser on via TikaConfig. May 24, 2021 8:19:32 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: org.xerial's sqlite-jdbc is not loaded. Please provide the jar on your classpath to parse sqlite files. See tika-parsers/pom.xml for the correct version. INFO Starting Apache Tika 1.24.1 server INFO Setting the server's publish address to be http://localhost:9998/ INFO Logging initialized @2045ms to org.eclipse.jetty.util.log.Slf4jLog INFO jetty-9.4.27.v20200227; built: 2020-02-27T18:37:21.340Z; git: a304fd9f351f337e7c0e2a7c28878dd536149c6c; jvm 1.8.0_281-b09 INFO Started ServerConnector@3e74829{HTTP/1.1, (http/1.1)}{localhost:9998} INFO Started @2153ms WARN Empty contextPath INFO Started o.e.j.s.h.ContextHandler@70f02c32{/,null,AVAILABLE} INFO Started Apache Tika server at http://localhost:9998/ INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) FILE: cache/A85437.xml OUTPUT: txt/A85437.txt FILE: cache/A16738.xml OUTPUT: txt/A16738.txt FILE: cache/A16737.xml OUTPUT: txt/A16737.txt FILE: cache/A03057.xml OUTPUT: txt/A03057.txt FILE: cache/A15606.xml OUTPUT: txt/A15606.txt FILE: cache/A47620.xml OUTPUT: txt/A47620.txt FILE: cache/A17848.xml OUTPUT: txt/A17848.txt FILE: cache/A58161.xml OUTPUT: txt/A58161.txt FILE: cache/A44738.xml OUTPUT: txt/A44738.txt === file2bib.sh === INFO Detecting media type for Filename: b'A16737.xml' INFO Detecting media type for Filename: b'A85437.xml' INFO Detecting media type for Filename: b'A03057.xml' INFO Detecting media type for Filename: b'A16738.xml' INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO Detecting media type for Filename: b'A15606.xml' INFO Detecting media type for Filename: b'A47620.xml' INFO Detecting media type for Filename: b'A17848.xml' INFO Detecting media type for Filename: b'A58161.xml' INFO rmeta/text (autodetecting type) INFO Detecting media type for Filename: b'A44738.xml' INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) A85437 txt/../pos/A85437.pos A85437 txt/../wrd/A85437.wrd A16737 txt/../wrd/A16737.wrd A85437 txt/../ent/A85437.ent A16737 txt/../pos/A16737.pos A16738 txt/../pos/A16738.pos === file2bib.sh === id: A85437 author: Goodwin, Thomas, 1600-1680. title: Most holy and profitable sayings of that reverend divine, Doctor Tho. Goodwin Who departed this life, Feb. 23. 1679/80. date: 1680 pages: extension: .xml txt: ./txt/A85437.txt cache: ./cache/A85437.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'A85437.xml' A16738 txt/../wrd/A16738.wrd A16737 txt/../ent/A16737.ent === file2bib.sh === id: A16738 author: Breton, Nicholas, 1545?-1626? title: Crossing of proverbs The second part. With, Certaine briefe questions and answeres. By B.N. Gent. date: 1616 pages: extension: .xml txt: ./txt/A16738.txt cache: ./cache/A16738.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 15 resourceName b'A16738.xml' === file2bib.sh === id: A16737 author: Breton, Nicholas, 1545?-1626? title: Crossing of prouerbs Crosse-answeres. and crosse-humours. By B.N. Gent. date: 1616 pages: extension: .xml txt: ./txt/A16737.txt cache: ./cache/A16737.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 7 resourceName b'A16737.xml' A03057 txt/../pos/A03057.pos A16738 txt/../ent/A16738.ent A03057 txt/../wrd/A03057.wrd === file2bib.sh === id: A03057 author: Herbert, George, 1593-1633. title: Outlandish proverbs, selected by Mr. G.H. date: 1640 pages: extension: .xml txt: ./txt/A03057.txt cache: ./cache/A03057.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 36 resourceName b'A03057.xml' A15606 txt/../pos/A15606.pos A03057 txt/../ent/A03057.ent A47620 txt/../pos/A47620.pos A15606 txt/../wrd/A15606.wrd A17848 txt/../pos/A17848.pos A58161 txt/../pos/A58161.pos A47620 txt/../wrd/A47620.wrd A15606 txt/../ent/A15606.ent A47620 txt/../ent/A47620.ent A58161 txt/../wrd/A58161.wrd A17848 txt/../ent/A17848.ent A17848 txt/../wrd/A17848.wrd A44738 txt/../pos/A44738.pos === file2bib.sh === id: A15606 author: Herbert, George, 1592-1637. title: Wits recreations. Selected from the finest fancies of moderne muses date: 1640 pages: extension: .xml txt: ./txt/A15606.txt cache: ./cache/A15606.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 141 resourceName b'A15606.xml' A58161 txt/../ent/A58161.ent A44738 txt/../wrd/A44738.wrd A44738 txt/../ent/A44738.ent === file2bib.sh === id: A47620 author: Leigh, Edward, 1602-1671. title: Select and choyce observations, containing all the Romane emperours the first eighteen by Edward Leigh ... ; the others added by his son Henry Leigh ... ; certain choyce French proverbs, alphabetically disposed and Englished added also by the same Edward Leigh. date: 1657 pages: extension: .xml txt: ./txt/A47620.txt cache: ./cache/A47620.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 172 resourceName b'A47620.xml' === file2bib.sh === id: A58161 author: Ray, John, 1627-1705. title: A collection of English proverbs digested into a convenient method for the speedy finding any one upon occasion : with short annotations : whereunto are added local proverbs with their explications, old proverbial rhythmes, less known or exotick proverbial sentences, and Scottish proverbs / by J. Ray, M.A. and Fellow of the Royal Society. date: 1678 pages: extension: .xml txt: ./txt/A58161.txt cache: ./cache/A58161.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 192 resourceName b'A58161.xml' === file2bib.sh === id: A17848 author: Camden, William, 1551-1623. title: Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes date: 1605 pages: extension: .xml txt: ./txt/A17848.txt cache: ./cache/A17848.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 188 resourceName b'A17848.xml' === file2bib.sh === id: A44738 author: Howell, James, 1594?-1666. title: Paroimiographia Proverbs, or, Old sayed savves & adages in English (or the Saxon toung), Italian, French, and Spanish, whereunto the British for their great antiquity and weight are added ... / collected by J.H., Esqr. date: 1659 pages: extension: .xml txt: ./txt/A44738.txt cache: ./cache/A44738.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 235 resourceName b'A44738.xml' Done mapping. Reducing subject-proverbs-freebo === reduce.pl bib === id = A47620 author = Leigh, Edward, 1602-1671. title = Select and choyce observations, containing all the Romane emperours the first eighteen by Edward Leigh ... ; the others added by his son Henry Leigh ... ; certain choyce French proverbs, alphabetically disposed and Englished added also by the same Edward Leigh. date = 1657 pages = extension = .xml mime = application/xml words = 66381 sentences = 20685 flesch = 88 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. cache = ./cache/A47620.xml txt = ./txt/A47620.txt === reduce.pl bib === id = A17848 author = Camden, William, 1551-1623. title = Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes date = 1605 pages = extension = .xml mime = application/xml words = 86944 sentences = 28233 flesch = 92 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). cache = ./cache/A17848.xml txt = ./txt/A17848.txt === reduce.pl bib === id = A58161 author = Ray, John, 1627-1705. title = A collection of English proverbs digested into a convenient method for the speedy finding any one upon occasion : with short annotations : whereunto are added local proverbs with their explications, old proverbial rhythmes, less known or exotick proverbial sentences, and Scottish proverbs / by J. Ray, M.A. and Fellow of the Royal Society. date = 1678 pages = extension = .xml mime = application/xml words = 85352 sentences = 27249 flesch = 102 summary = A collection of English proverbs digested into a convenient method for the speedy finding any one upon occasion : with short annotations : whereunto are added local proverbs with their explications, old proverbial rhythmes, less known or exotick proverbial sentences, and Scottish proverbs / by J. A collection of English proverbs digested into a convenient method for the speedy finding any one upon occasion : with short annotations : whereunto are added local proverbs with their explications, old proverbial rhythmes, less known or exotick proverbial sentences, and Scottish proverbs / by J. EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. cache = ./cache/A58161.xml txt = ./txt/A58161.txt === reduce.pl bib === id = A03057 author = Herbert, George, 1593-1633. title = Outlandish proverbs, selected by Mr. G.H. date = 1640 pages = extension = .xml mime = application/xml words = 10412 sentences = 3957 flesch = 107 summary = This text is an enriched version of the TCP digital transcription A03057 of text S103991 in the English Short Title Catalog (STC 13182). Textual changes and metadata enrichments aim at making the text more computationally tractable, easier to read, and suitable for network-based collaborative curation by amateur and professional end users from many walks of life. Textual changes aim at restoring the text the author or stationer meant to publish. 69 KB of XML-encoded text transcribed from 39 1-bit group-IV TIFF page images. This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Transcribed from: (Early English Books Online ; image set 4180) Images scanned from microfilm: (Early English books, 1475-1640 ; 890:02) P[aine] for Humphrey Blunden; at the Castle in Corn-hill, Proverbs, English -Early works to 1800. Text and markup reviewed and edited cache = ./cache/A03057.xml txt = ./txt/A03057.txt === reduce.pl bib === id = A85437 author = Goodwin, Thomas, 1600-1680. title = Most holy and profitable sayings of that reverend divine, Doctor Tho. Goodwin Who departed this life, Feb. 23. 1679/80. date = 1680 pages = extension = .xml mime = application/xml words = 1598 sentences = 311 flesch = 89 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Most holy and profitable sayings of that reverend divine, Doctor Tho. Goodwin Who departed this life, Feb. 23. Most holy and profitable sayings of that reverend divine, Doctor Tho. Goodwin Who departed this life, Feb. 23. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). cache = ./cache/A85437.xml txt = ./txt/A85437.txt === reduce.pl bib === id = A16737 author = Breton, Nicholas, 1545?-1626? title = Crossing of prouerbs Crosse-answeres. and crosse-humours. By B.N. Gent. date = 1616 pages = extension = .xml mime = application/xml words = 2136 sentences = 542 flesch = 96 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Eld] for Iohn Wright, and are to be solde at his shop without Newgate, at the signe of the Bible, EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). The general aim of EEBO-TCP is to encode one copy (usually the first edition) of every monographic English-language title published between 1473 and 1700 available in EEBO. EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). cache = ./cache/A16737.xml txt = ./txt/A16737.txt === reduce.pl bib === id = A16738 author = Breton, Nicholas, 1545?-1626? title = Crossing of proverbs The second part. With, Certaine briefe questions and answeres. By B.N. Gent. date = 1616 pages = extension = .xml mime = application/xml words = 3979 sentences = 1309 flesch = 101 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Eld] for Iohn Wright, and are to be solde at his shop without Newgate, at the signe of the Bible, EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. cache = ./cache/A16738.xml txt = ./txt/A16738.txt === reduce.pl bib === id = A44738 author = Howell, James, 1594?-1666. title = Paroimiographia Proverbs, or, Old sayed savves & adages in English (or the Saxon toung), Italian, French, and Spanish, whereunto the British for their great antiquity and weight are added ... / collected by J.H., Esqr. date = 1659 pages = extension = .xml mime = application/xml words = 151063 sentences = 50728 flesch = 102 summary = Paroimiographia Proverbs, or, Old sayed savves & adages in English (or the Saxon toung), Italian, French, and Spanish, whereunto the British for their great antiquity and weight are added ... Paroimiographia Proverbs, or, Old sayed savves & adages in English (or the Saxon toung), Italian, French, and Spanish, whereunto the British for their great antiquity and weight are added ... EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). cache = ./cache/A44738.xml txt = ./txt/A44738.txt === reduce.pl bib === id = A15606 author = Herbert, George, 1592-1637. title = Wits recreations. Selected from the finest fancies of moderne muses date = 1640 pages = extension = .xml mime = application/xml words = 42343 sentences = 14971 flesch = 105 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. cache = ./cache/A15606.xml txt = ./txt/A15606.txt Building ./etc/reader.txt A44738 A17848 A58161 A85437 A58161 A47620 number of items: 9 sum of words: 450,208 average size in words: 50,023 average readability score: 98 nouns: man; men; time; name; day; death; nothing; viz; wife; hath; names; horse; house; t; life; head; words; thing; things; none; world; one; love; hand; way; woman; water; others; son; friend; doth; fire; money; hee; wine; word; self; place; fool; king; people; women; heart; body; dog; year; tongue; night; ones; reason verbs: is; be; was; have; are; had; were; ''s; make; do; made; hath; come; makes; did; being; say; take; let; see; called; go; said; give; know; comes; put; goes; came; been; found; done; set; taken; keep; eat; am; having; given; love; live; find; call; lost; gave; speak; tell; thought; leave; spoken adjectives: good; great; many; old; other; little; better; more; own; much; such; first; ill; same; best; wise; bad; long; full; most; young; rich; true; last; fair; dead; worth; poor; new; saith; common; small; high; whole; french; worse; short; english; sweet; second; few; happy; white; cold; non; noble; blind; greatest; hard; black adverbs: not; then; so; never; as; well; more; out; too; now; also; most; up; that; is; long; there; away; very; much; here; better; ever; first; onely; soon; yet; still; down; thus; once; rather; in; often; therefore; off; far; enough; all; again; no; ni; together; before; over; commonly; sometimes; nt; ill; forth pronouns: he; his; it; i; him; you; they; their; them; my; her; me; your; thy; we; she; our; himself; thee; us; themselves; one; its; mine; ni; je; ay; na; ''s; theirs; yours; ts; ours; ne; au; ye; y; ha; ''em; yn; ya; whereof; vp; il; à; herself; yee; wŷl; wr; whosoever proper nouns: 〉; ◊; 〈; la; de; y; le; que; c.; il; ●; el; god; thou; un; est; che; ni; king; hath; ne; se; i.; chi; si; nid; e.; qui; non; al; di; english; ital; ei; quien; è; england; bien; yn; les; proverb; mas; p.; del; tu; lord; da; à; emperour; hee keywords: tcp; good; god; time; king; great; man; like; hath; english; court; church; wife; sea; proverb; little; ill; hee; french; est; england; word; woman; thy; thou; thing; sun; son; saint; rome; prince; paris; old; north; non; nation; love; lord; long; law; latine; lady; italians; gentleman; france; fool; empire; emperour; doth; devil one topic; one dimension: good file(s): ./cache/A17848.xml titles(s): Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes three topics; one dimension: hee; la; good file(s): ./cache/A17848.xml, ./cache/A44738.xml, ./cache/A58161.xml titles(s): Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes | Paroimiographia Proverbs, or, Old sayed savves & adages in English (or the Saxon toung), Italian, French, and Spanish, whereunto the British for their great antiquity and weight are added ... / collected by J.H., Esqr. | A collection of English proverbs digested into a convenient method for the speedy finding any one upon occasion : with short annotations : whereunto are added local proverbs with their explications, old proverbial rhythmes, less known or exotick proverbial sentences, and Scottish proverbs / by J. Ray, M.A. and Fellow of the Royal Society. five topics; three dimensions: la il que; good man hath; king names hee; 1680 feb 1679; damnable tossed glorified file(s): ./cache/A44738.xml, ./cache/A58161.xml, ./cache/A17848.xml, ./cache/A85437.xml, ./cache/A85437.xml titles(s): Paroimiographia Proverbs, or, Old sayed savves & adages in English (or the Saxon toung), Italian, French, and Spanish, whereunto the British for their great antiquity and weight are added ... / collected by J.H., Esqr. | A collection of English proverbs digested into a convenient method for the speedy finding any one upon occasion : with short annotations : whereunto are added local proverbs with their explications, old proverbial rhythmes, less known or exotick proverbial sentences, and Scottish proverbs / by J. Ray, M.A. and Fellow of the Royal Society. | Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes | Most holy and profitable sayings of that reverend divine, Doctor Tho. Goodwin Who departed this life, Feb. 23. 1679/80. | Most holy and profitable sayings of that reverend divine, Doctor Tho. Goodwin Who departed this life, Feb. 23. 1679/80. Type: zip2carrel title: subject-proverbs-freebo date: 2021-05-24 time: 20:17 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: input-file.zip ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: A16737 author: Breton, Nicholas, 1545?-1626? title: Crossing of prouerbs Crosse-answeres. and crosse-humours. By B.N. Gent. date: 1616 words: 2136 sentences: 542 pages: flesch: 96 cache: ./cache/A16737.xml txt: ./txt/A16737.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Eld] for Iohn Wright, and are to be solde at his shop without Newgate, at the signe of the Bible, EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). The general aim of EEBO-TCP is to encode one copy (usually the first edition) of every monographic English-language title published between 1473 and 1700 available in EEBO. EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). id: A16738 author: Breton, Nicholas, 1545?-1626? title: Crossing of proverbs The second part. With, Certaine briefe questions and answeres. By B.N. Gent. date: 1616 words: 3979 sentences: 1309 pages: flesch: 101 cache: ./cache/A16738.xml txt: ./txt/A16738.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Eld] for Iohn Wright, and are to be solde at his shop without Newgate, at the signe of the Bible, EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. id: A17848 author: Camden, William, 1551-1623. title: Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes date: 1605 words: 86944 sentences: 28233 pages: flesch: 92 cache: ./cache/A17848.xml txt: ./txt/A17848.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes Remaines of a greater worke, concerning Britaine, the inhabitants thereof, their languages, names, surnames, empreses, wise speeches, poësies, and epitaphes EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). id: A85437 author: Goodwin, Thomas, 1600-1680. title: Most holy and profitable sayings of that reverend divine, Doctor Tho. Goodwin Who departed this life, Feb. 23. 1679/80. date: 1680 words: 1598 sentences: 311 pages: flesch: 89 cache: ./cache/A85437.xml txt: ./txt/A85437.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Most holy and profitable sayings of that reverend divine, Doctor Tho. Goodwin Who departed this life, Feb. 23. Most holy and profitable sayings of that reverend divine, Doctor Tho. Goodwin Who departed this life, Feb. 23. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). id: A15606 author: Herbert, George, 1592-1637. title: Wits recreations. Selected from the finest fancies of moderne muses date: 1640 words: 42343 sentences: 14971 pages: flesch: 105 cache: ./cache/A15606.xml txt: ./txt/A15606.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. id: A03057 author: Herbert, George, 1593-1633. title: Outlandish proverbs, selected by Mr. G.H. date: 1640 words: 10412 sentences: 3957 pages: flesch: 107 cache: ./cache/A03057.xml txt: ./txt/A03057.txt summary: This text is an enriched version of the TCP digital transcription A03057 of text S103991 in the English Short Title Catalog (STC 13182). Textual changes and metadata enrichments aim at making the text more computationally tractable, easier to read, and suitable for network-based collaborative curation by amateur and professional end users from many walks of life. Textual changes aim at restoring the text the author or stationer meant to publish. 69 KB of XML-encoded text transcribed from 39 1-bit group-IV TIFF page images. This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. Transcribed from: (Early English Books Online ; image set 4180) Images scanned from microfilm: (Early English books, 1475-1640 ; 890:02) P[aine] for Humphrey Blunden; at the Castle in Corn-hill, Proverbs, English -Early works to 1800. Text and markup reviewed and edited id: A44738 author: Howell, James, 1594?-1666. title: Paroimiographia Proverbs, or, Old sayed savves & adages in English (or the Saxon toung), Italian, French, and Spanish, whereunto the British for their great antiquity and weight are added ... / collected by J.H., Esqr. date: 1659 words: 151063 sentences: 50728 pages: flesch: 102 cache: ./cache/A44738.xml txt: ./txt/A44738.txt summary: Paroimiographia Proverbs, or, Old sayed savves & adages in English (or the Saxon toung), Italian, French, and Spanish, whereunto the British for their great antiquity and weight are added ... Paroimiographia Proverbs, or, Old sayed savves & adages in English (or the Saxon toung), Italian, French, and Spanish, whereunto the British for their great antiquity and weight are added ... EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). id: A47620 author: Leigh, Edward, 1602-1671. title: Select and choyce observations, containing all the Romane emperours the first eighteen by Edward Leigh ... ; the others added by his son Henry Leigh ... ; certain choyce French proverbs, alphabetically disposed and Englished added also by the same Edward Leigh. date: 1657 words: 66381 sentences: 20685 pages: flesch: 88 cache: ./cache/A47620.xml txt: ./txt/A47620.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. id: A58161 author: Ray, John, 1627-1705. title: A collection of English proverbs digested into a convenient method for the speedy finding any one upon occasion : with short annotations : whereunto are added local proverbs with their explications, old proverbial rhythmes, less known or exotick proverbial sentences, and Scottish proverbs / by J. Ray, M.A. and Fellow of the Royal Society. date: 1678 words: 85352 sentences: 27249 pages: flesch: 102 cache: ./cache/A58161.xml txt: ./txt/A58161.txt summary: A collection of English proverbs digested into a convenient method for the speedy finding any one upon occasion : with short annotations : whereunto are added local proverbs with their explications, old proverbial rhythmes, less known or exotick proverbial sentences, and Scottish proverbs / by J. A collection of English proverbs digested into a convenient method for the speedy finding any one upon occasion : with short annotations : whereunto are added local proverbs with their explications, old proverbial rhythmes, less known or exotick proverbial sentences, and Scottish proverbs / by J. EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel