mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-antoniusMarcusBc-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/1130.txt inflating: ./tmp/input/input-file/2062.txt inflating: ./tmp/input/input-file/1796.txt inflating: ./tmp/input/input-file/2268.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-antoniusMarcusBc-gutenberg FILE: cache/1130.txt OUTPUT: txt/1130.txt FILE: cache/2062.txt OUTPUT: txt/2062.txt FILE: cache/2268.txt OUTPUT: txt/2268.txt FILE: cache/1796.txt OUTPUT: txt/1796.txt 1130 txt/../ent/1130.ent === file2bib.sh === id: 2268 author: Shakespeare, William title: Antony and Cleopatra date: pages: extension: .txt txt: ./txt/2268.txt cache: ./cache/2268.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'2268.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 2268 txt/../ent/2268.ent 2268 txt/../pos/2268.pos 1796 txt/../ent/1796.ent 1130 txt/../pos/1130.pos 1796 txt/../pos/1796.pos 1796 txt/../wrd/1796.wrd 1130 txt/../wrd/1130.wrd 2268 txt/../wrd/2268.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point === file2bib.sh === id: 1796 author: Shakespeare, William title: Antony and Cleopatra date: pages: extension: .txt txt: ./txt/1796.txt cache: ./cache/1796.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'1796.txt' === file2bib.sh === id: 1130 author: Shakespeare, William title: The Tragedy of Antony and Cleopatra date: pages: extension: .txt txt: ./txt/1130.txt cache: ./cache/1130.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'1130.txt' 2062 txt/../pos/2062.pos 2062 txt/../wrd/2062.wrd 2062 txt/../ent/2062.ent === file2bib.sh === id: 2062 author: Dryden, John title: All for Love; Or, The World Well Lost: A Tragedy date: pages: extension: .txt txt: ./txt/2062.txt cache: ./cache/2062.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 3 resourceName b'2062.txt' Done mapping. Reducing subject-antoniusMarcusBc-gutenberg === reduce.pl bib === id = 2062 author = Dryden, John title = All for Love; Or, The World Well Lost: A Tragedy date = pages = extension = .txt mime = text/plain words = 29409 sentences = 3440 flesch = 93 summary = to that which I reserved for Antony and Cleopatra; whose mutual love That gave the world a lord: 'tis Antony's. A love, which knows no bounds, to Antony, My emperor; the man I love next Heaven: Thou long'st to curse me, and I give thee leave. And I will leave her; though, Heaven knows, I love Caesar shall know what 'tis to force a lover Ere Caesar saw your eyes, you gave me love, To say it was designed: 'tis true, I loved you, Gods, 'tis too much; too much for man to bear. I love this man, who runs to meet his ruin; How thou upbraid'st my love: The queen has eyes, And when thou speak'st (but let it first be long), Has loved her long; he, next my god-like lord, Think not 'tis thou hast conquered Antony; Then art thou innocent, my poor dear love, Thou hast loved me, cache = ./cache/2062.txt txt = ./txt/2062.txt === reduce.pl bib === id = 1796 author = Shakespeare, William title = Antony and Cleopatra date = pages = extension = .txt mime = text/plain words = 43 sentences = 14 flesch = 93 summary = THIS EBOOK WAS ONE OF PROJECT GUTENBERG'S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AT EBOOK #100. THE HTML FILE AT: http://www.gutenberg.org/files/100/100-h/100-h.htm cache = ./cache/1796.txt txt = ./txt/1796.txt === reduce.pl bib === === reduce.pl bib === id = 1130 author = Shakespeare, William title = The Tragedy of Antony and Cleopatra date = pages = extension = .txt mime = text/plain words = 40 sentences = 10 flesch = 88 summary = THIS EBOOK WAS ONE OF PROJECT GUTENBERG'S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#100) at https://www.gutenberg.org/ebooks/100 cache = ./cache/1130.txt txt = ./txt/1130.txt Building ./etc/reader.txt 2062 2268 1796 2062 2268 1796 number of items: 4 sum of words: 29,492 average size in words: 9,830 average readability score: 91 nouns: cleopatra; love; man; death; eyes; soul; life; world; lord; friend; queen; men; nature; word; time; serapion; heart; hand; fortune; power; part; none; fate; way; honour; gods; face; art; words; day; arms; reason; name; exit; wife; poets; place; emperor; work; sight; peace; virtue; mistress; friends; ventidius; slave; people; nothing; hours; farewell verbs: have; is; be; are; was; has; ''s; were; let; had; see; do; been; take; make; am; think; know; speak; come; give; leave; go; die; lost; enter; say; made; left; loved; love; ''m; hear; tell; find; did; bear; live; done; comes; saw; loves; look; said; gone; took; sent; bring; fear; thought adjectives: more; own; other; last; such; true; great; good; false; poor; first; roman; best; much; little; young; noble; dear; same; plain; happy; many; dead; sure; better; private; old; long; worth; nous; next; new; most; honest; hard; full; weak; vain; sweet; soft; pleased; open; mighty; least; kind; greatest; greater; fair; wretched; scarce adverbs: not; so; too; now; then; yet; more; out; up; octavia; never; first; well; tis; still; only; here; back; down; no; again; just; far; thus; once; most; indeed; even; much; aside; therefore; long; hence; ever; as; there; else; off; away; all; somewhat; rather; perhaps; better; on; already; alone; often; after; wholly pronouns: i; you; my; he; me; it; his; your; him; her; she; they; their; we; them; our; us; myself; himself; thy; its; thee; yours; yourself; themselves; mine; herself; ourselves; thyself; theirs; ours; is''t; yourselves; thou; perish--; one; itself; hers; ''s proper nouns: antony; ventidius; thou; dolabella; caesar; cleopatra; octavia; charmion; heaven; iras; lord; alexas; egypt; twas; tis; o''er; madam; roman; hast; dryden; god; serapion; ere; rome; romans; et; octavius; actium; wouldst; methinks; exeunt; ye; maecenas; england; canst; y; virgil; stood; sir; shakespeare; pr''ythee; ne; myris; lov''st; la; isis; horace; fortune; e''er; art keywords: ebook; ventidius; serapion; octavia; love; iras; heaven; dolabella; cleopatra; charmion; caesar; antony; alexas one topic; one dimension: antony file(s): ./cache/1130.txt titles(s): The Tragedy of Antony and Cleopatra three topics; one dimension: antony; 100; ebooks file(s): ./cache/2062.txt, ./cache/1796.txt, titles(s): All for Love; Or, The World Well Lost: A Tragedy | Antony and Cleopatra | Antony and Cleopatra five topics; three dimensions: antony ventidius cleopatra; 100 ebook gutenberg; ebooks https file; ebooks https file; ebooks https file file(s): ./cache/2062.txt, ./cache/1796.txt, , , titles(s): All for Love; Or, The World Well Lost: A Tragedy | Antony and Cleopatra | Antony and Cleopatra | Antony and Cleopatra | Antony and Cleopatra Type: gutenberg title: subject-antoniusMarcusBc-gutenberg date: 2021-05-31 time: 16:05 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Antonius, Marcus, 83?-30 B.C." ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 2062 author: Dryden, John title: All for Love; Or, The World Well Lost: A Tragedy date: words: 29409.0 sentences: 3440.0 pages: flesch: 93.0 cache: ./cache/2062.txt txt: ./txt/2062.txt summary: to that which I reserved for Antony and Cleopatra; whose mutual love That gave the world a lord: ''tis Antony''s. A love, which knows no bounds, to Antony, My emperor; the man I love next Heaven: Thou long''st to curse me, and I give thee leave. And I will leave her; though, Heaven knows, I love Caesar shall know what ''tis to force a lover Ere Caesar saw your eyes, you gave me love, To say it was designed: ''tis true, I loved you, Gods, ''tis too much; too much for man to bear. I love this man, who runs to meet his ruin; How thou upbraid''st my love: The queen has eyes, And when thou speak''st (but let it first be long), Has loved her long; he, next my god-like lord, Think not ''tis thou hast conquered Antony; Then art thou innocent, my poor dear love, Thou hast loved me, id: 1130 author: Shakespeare, William title: The Tragedy of Antony and Cleopatra date: words: 40.0 sentences: 10.0 pages: flesch: 88.0 cache: ./cache/1130.txt txt: ./txt/1130.txt summary: THIS EBOOK WAS ONE OF PROJECT GUTENBERG''S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#100) at https://www.gutenberg.org/ebooks/100 id: 1796 author: Shakespeare, William title: Antony and Cleopatra date: words: 43.0 sentences: 14.0 pages: flesch: 93.0 cache: ./cache/1796.txt txt: ./txt/1796.txt summary: THIS EBOOK WAS ONE OF PROJECT GUTENBERG''S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AT EBOOK #100. THE HTML FILE AT: http://www.gutenberg.org/files/100/100-h/100-h.htm id: 2268 author: Shakespeare, William title: Antony and Cleopatra date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel