mv: 'input-file.zip' and './input-file.zip' are the same file Creating study carrel named subject-wisdom-freebo Initializing database Unzipping Archive: input-file.zip inflating: ./tmp/input/xml2htm.xsl inflating: ./tmp/input/A62741.xml inflating: ./tmp/input/metadata.csv inflating: ./tmp/input/A02588.xml inflating: ./tmp/input/A67762.xml inflating: ./tmp/input/A27386.xml caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: === metadata file: ./tmp/input/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-wisdom-freebo May 25, 2021 1:00:39 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: J2KImageReader not loaded. JPEG2000 files will not be processed. See https://pdfbox.apache.org/2.0/dependencies.html#jai-image-io for optional dependencies. May 25, 2021 1:00:39 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: Tesseract OCR is installed and will be automatically applied to image files unless you've excluded the TesseractOCRParser from the default parser. Tesseract may dramatically slow down content extraction (TIKA-2359). As of Tika 1.15 (and prior versions), Tesseract is automatically called. In future versions of Tika, users may need to turn the TesseractOCRParser on via TikaConfig. May 25, 2021 1:00:39 PM org.apache.tika.config.InitializableProblemHandler$3 handleInitializableProblem WARNING: org.xerial's sqlite-jdbc is not loaded. Please provide the jar on your classpath to parse sqlite files. See tika-parsers/pom.xml for the correct version. INFO Starting Apache Tika 1.24.1 server INFO Setting the server's publish address to be http://localhost:9998/ INFO Logging initialized @4139ms to org.eclipse.jetty.util.log.Slf4jLog INFO jetty-9.4.27.v20200227; built: 2020-02-27T18:37:21.340Z; git: a304fd9f351f337e7c0e2a7c28878dd536149c6c; jvm 1.8.0_281-b09 INFO Started ServerConnector@3e74829{HTTP/1.1, (http/1.1)}{localhost:9998} INFO Started @4259ms WARN Empty contextPath INFO Started o.e.j.s.h.ContextHandler@51fadaff{/,null,AVAILABLE} INFO Started Apache Tika server at http://localhost:9998/ INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) FILE: cache/A27386.xml OUTPUT: txt/A27386.txt FILE: cache/A62741.xml OUTPUT: txt/A62741.txt FILE: cache/A67762.xml OUTPUT: txt/A67762.txt FILE: cache/A02588.xml OUTPUT: txt/A02588.txt === file2bib.sh === INFO Detecting media type for Filename: b'A27386.xml' INFO Detecting media type for Filename: b'A62741.xml' INFO Detecting media type for Filename: b'A67762.xml' INFO Detecting media type for Filename: b'A02588.xml' INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) INFO rmeta/text (autodetecting type) A27386 txt/../pos/A27386.pos A27386 txt/../wrd/A27386.wrd A62741 txt/../pos/A62741.pos A27386 txt/../ent/A27386.ent A67762 txt/../pos/A67762.pos A62741 txt/../wrd/A62741.wrd A62741 txt/../ent/A62741.ent A67762 txt/../wrd/A67762.wrd A67762 txt/../ent/A67762.ent A02588 txt/../pos/A02588.pos === file2bib.sh === id: A62741 author: Tanner, Thomas, 1630-1682. title: [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. In their late western circuit. By Tho. Tanner, Rector of Brightstone in Hants. date: 1677 pages: extension: .xml txt: ./txt/A62741.txt cache: ./cache/A62741.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 86 resourceName b'A62741.xml' === file2bib.sh === id: A27386 author: Benlowes, Edward, 1603?-1676. title: The summary of vvisedome by Edward Benlowes, Esq. date: 1657 pages: extension: .xml txt: ./txt/A27386.txt cache: ./cache/A27386.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 67 resourceName b'A27386.xml' === file2bib.sh === id: A67762 author: Younge, Richard. title: No wicked man a wise man, true wisdom described the excellency of spiritual, experimental, and saving knowledge, above all humane wisdom and learning ... / by R. Younge ... date: 1666 pages: extension: .xml txt: ./txt/A67762.txt cache: ./cache/A67762.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 91 resourceName b'A67762.xml' A02588 txt/../wrd/A02588.wrd A02588 txt/../ent/A02588.ent === file2bib.sh === id: A02588 author: Hall, Joseph, 1574-1656. title: Salomons diuine arts, of 1. Ethickes, 2. Politickes, 3. Oeconomicks that is; the gouernment of 1. Behauiour, 2. Common-vvealth, 3. Familie. Drawne into method, out of his Prouerbs & Ecclesiastes. With an open and plaine paraphrase, vpon the Song of songs. By Ioseph Hall. date: 1609 pages: extension: .xml txt: ./txt/A02588.txt cache: ./cache/A02588.xml Content-Type application/xml X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.xml.DcXMLParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 112 resourceName b'A02588.xml' Done mapping. Reducing subject-wisdom-freebo === reduce.pl bib === id = A02588 author = Hall, Joseph, 1574-1656. title = Salomons diuine arts, of 1. Ethickes, 2. Politickes, 3. Oeconomicks that is; the gouernment of 1. Behauiour, 2. Common-vvealth, 3. Familie. Drawne into method, out of his Prouerbs & Ecclesiastes. With an open and plaine paraphrase, vpon the Song of songs. By Ioseph Hall. date = 1609 pages = extension = .xml mime = application/xml words = 32312 sentences = 11360 flesch = 99 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. "An open and plaine paraphrase vpon the Song of songs" has separate dated title page and pagination; register is continuous. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. cache = ./cache/A02588.xml txt = ./txt/A02588.txt === reduce.pl bib === id = A27386 author = Benlowes, Edward, 1603?-1676. title = The summary of vvisedome by Edward Benlowes, Esq. date = 1657 pages = extension = .xml mime = application/xml words = 5800 sentences = 2124 flesch = 88 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. The summary of vvisedome by Edward Benlowes, Esq. The summary of vvisedome by Edward Benlowes, Esq. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). The texts were encoded and linked to page images in accordance with level 4 of the TEI in Libraries guidelines. cache = ./cache/A27386.xml txt = ./txt/A27386.txt === reduce.pl bib === id = A67762 author = Younge, Richard. title = No wicked man a wise man, true wisdom described the excellency of spiritual, experimental, and saving knowledge, above all humane wisdom and learning ... / by R. Younge ... date = 1666 pages = extension = .xml mime = application/xml words = 15768 sentences = 4722 flesch = 99 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. No wicked man a wise man, true wisdom described the excellency of spiritual, experimental, and saving knowledge, above all humane wisdom and learning ... No wicked man a wise man, true wisdom described the excellency of spiritual, experimental, and saving knowledge, above all humane wisdom and learning ... EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). cache = ./cache/A67762.xml txt = ./txt/A67762.txt === reduce.pl bib === id = A62741 author = Tanner, Thomas, 1630-1682. title = [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. In their late western circuit. By Tho. Tanner, Rector of Brightstone in Hants. date = 1677 pages = extension = .xml mime = application/xml words = 10189 sentences = 2807 flesch = 92 summary = This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. printed for William Keblewhite bookseller at Newport in the Isle of Wight, EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). cache = ./cache/A62741.xml txt = ./txt/A62741.txt Building ./etc/reader.txt A02588 A67762 A62741 A67762 A62741 A27386 number of items: 4 sum of words: 64,069 average size in words: 16,017 average readability score: 94 nouns: man; pr; men; wisdom; knowledge; heart; things; life; thy; world; hee; religion; others; eyes; time; grace; way; hand; soule; mouth; glory; reason; soul; selfe; none; nothing; text; house; end; art; words; hands; euill; light; truth; people; spirit; part; foole; faith; death; word; head; riches; one; hath; earth; day; ▪; thing verbs: is; be; are; have; was; had; were; let; do; know; come; been; take; see; make; bee; made; did; am; being; hath; set; say; found; said; haue; according; put; makes; vnto; thought; brought; knows; hee; done; blessed; find; encoded; thou; give; think; speak; get; fall; saving; gone; go; giue; came; keep adjectives: wise; good; other; such; own; great; much; more; true; many; wicked; full; better; first; little; rich; righteous; best; same; haue; false; least; foolish; most; sweet; right; able; whole; pleasant; holy; early; faithfull; spiritual; natural; humane; greatest; christian; pure; godly; glorious; saith; new; last; humble; greater; few; english; common; doth; precious adverbs: not; so; then; more; therefore; now; yet; only; out; never; much; most; forth; as; also; ever; onely; too; away; still; there; thus; thereof; first; indeed; again; rather; in; even; alone; far; well; no; before; here; up; together; off; lastly; very; comely; once; better; all; over; hence; especially; down; vs; usually pronouns: his; it; he; i; my; they; their; him; thy; them; her; me; we; you; our; thee; she; us; himself; your; themselves; mine; its; one; vp; theirs; ours; ye; yee; whereof; u; thou; quae; nay; jt; hee proper nouns: god; thou; ec; lord; yea; 〉; hath; ◊; 〈; haue; christ; bee; hee; tcp; ●; loue; thee; owne; pr; mee; church; thine; c.; world; ye; vp; gods; king; sauiour; doe; cor; father; text; shee; joh; art; tei; sect; prov; hell; eebo; english; thy; princes; est; spirit; salomons; oxford; ioy; wisdom keywords: tcp; man; god; thy; lord; world; thou; thee; tei; sauiour; salomons; religion; prov; pride; philosophy; munde; lust; like; king; joh; hell; hee; haue; gods; father; est; edward; early; cor; church; christian; christ one topic; one dimension: pr file(s): ./cache/A62741.xml titles(s): [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. In their late western circuit. By Tho. Tanner, Rector of Brightstone in Hants. three topics; one dimension: pr; knowledge; religion file(s): ./cache/A02588.xml, ./cache/A67762.xml, ./cache/A62741.xml titles(s): Salomons diuine arts, of 1. Ethickes, 2. Politickes, 3. Oeconomicks that is; the gouernment of 1. Behauiour, 2. Common-vvealth, 3. Familie. Drawne into method, out of his Prouerbs & Ecclesiastes. With an open and plaine paraphrase, vpon the Song of songs. By Ioseph Hall. | No wicked man a wise man, true wisdom described the excellency of spiritual, experimental, and saving knowledge, above all humane wisdom and learning ... / by R. Younge ... | [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. In their late western circuit. By Tho. Tanner, Rector of Brightstone in Hants. five topics; three dimensions: pr thy shall; knowledge wisdom god; religion wisdom men; plea typically secure; plea typically secure file(s): ./cache/A02588.xml, ./cache/A67762.xml, ./cache/A62741.xml, ./cache/A62741.xml, ./cache/A62741.xml titles(s): Salomons diuine arts, of 1. Ethickes, 2. Politickes, 3. Oeconomicks that is; the gouernment of 1. Behauiour, 2. Common-vvealth, 3. Familie. Drawne into method, out of his Prouerbs & Ecclesiastes. With an open and plaine paraphrase, vpon the Song of songs. By Ioseph Hall. | No wicked man a wise man, true wisdom described the excellency of spiritual, experimental, and saving knowledge, above all humane wisdom and learning ... / by R. Younge ... | [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. In their late western circuit. By Tho. Tanner, Rector of Brightstone in Hants. | [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. In their late western circuit. By Tho. Tanner, Rector of Brightstone in Hants. | [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. In their late western circuit. By Tho. Tanner, Rector of Brightstone in Hants. Type: zip2carrel title: subject-wisdom-freebo date: 2021-05-25 time: 12:44 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: input-file.zip ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: A27386 author: Benlowes, Edward, 1603?-1676. title: The summary of vvisedome by Edward Benlowes, Esq. date: 1657 words: 5800 sentences: 2124 pages: flesch: 88 cache: ./cache/A27386.xml txt: ./txt/A27386.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. The summary of vvisedome by Edward Benlowes, Esq. The summary of vvisedome by Edward Benlowes, Esq. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). The texts were encoded and linked to page images in accordance with level 4 of the TEI in Libraries guidelines. id: A02588 author: Hall, Joseph, 1574-1656. title: Salomons diuine arts, of 1. Ethickes, 2. Politickes, 3. Oeconomicks that is; the gouernment of 1. Behauiour, 2. Common-vvealth, 3. Familie. Drawne into method, out of his Prouerbs & Ecclesiastes. With an open and plaine paraphrase, vpon the Song of songs. By Ioseph Hall. date: 1609 words: 32312 sentences: 11360 pages: flesch: 99 cache: ./cache/A02588.xml txt: ./txt/A02588.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. "An open and plaine paraphrase vpon the Song of songs" has separate dated title page and pagination; register is continuous. EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). Selection was intended to range over a wide variety of subject areas, to reflect the true nature of the print record of the period. id: A62741 author: Tanner, Thomas, 1630-1682. title: [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. In their late western circuit. By Tho. Tanner, Rector of Brightstone in Hants. date: 1677 words: 10189 sentences: 2807 pages: flesch: 92 cache: ./cache/A62741.xml txt: ./txt/A62741.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. [Hebrew] or Wisdome and prudence exhibited in a sermon before the right honourable the Lord Chief Justice Rainsford, and the Lord Chief Justice North. printed for William Keblewhite bookseller at Newport in the Isle of Wight, EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). id: A67762 author: Younge, Richard. title: No wicked man a wise man, true wisdom described the excellency of spiritual, experimental, and saving knowledge, above all humane wisdom and learning ... / by R. Younge ... date: 1666 words: 15768 sentences: 4722 pages: flesch: 99 cache: ./cache/A67762.xml txt: ./txt/A67762.txt summary: This keyboarded and encoded edition of the work described above is co-owned by the institutions providing financial support to the Early English Books Online Text Creation Partnership. No wicked man a wise man, true wisdom described the excellency of spiritual, experimental, and saving knowledge, above all humane wisdom and learning ... No wicked man a wise man, true wisdom described the excellency of spiritual, experimental, and saving knowledge, above all humane wisdom and learning ... EEBO-TCP is a partnership between the Universities of Michigan and Oxford and the publisher ProQuest to create accurately transcribed and encoded texts based on the image sets published by ProQuest via their Early English Books Online (EEBO) database (http://eebo.chadwyck.com). EEBO-TCP aimed to produce large quantities of textual data within the usual project restraints of time and funding, and therefore chose to create diplomatic transcriptions (as opposed to critical editions) with light-touch, mainly structural encoding based on the Text Encoding Initiative (http://www.tei-c.org). ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel