mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-authorsLatin-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/28676.txt inflating: ./tmp/input/input-file/3234.txt inflating: ./tmp/input/input-file/2811.txt inflating: ./tmp/input/input-file/2812.txt inflating: ./tmp/input/input-file/8945.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-authorsLatin-gutenberg FILE: cache/2812.txt OUTPUT: txt/2812.txt FILE: cache/3234.txt OUTPUT: txt/3234.txt FILE: cache/28676.txt OUTPUT: txt/28676.txt FILE: cache/2811.txt OUTPUT: txt/2811.txt FILE: cache/8945.txt OUTPUT: txt/8945.txt === file2bib.sh === id: 3234 author: Pliny, the Younger title: The Letters of the Younger Pliny, First Series — Volume 1 date: pages: extension: .txt txt: ./txt/3234.txt cache: ./cache/3234.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'3234.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' === file2bib.sh === id: 2812 author: Cicero, Marcus Tullius title: Letters of Marcus Tullius Cicero date: pages: extension: .txt txt: ./txt/2812.txt cache: ./cache/2812.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'2812.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 3234 txt/../ent/3234.ent 2812 txt/../wrd/2812.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 2812 txt/../ent/2812.ent 3234 txt/../pos/3234.pos 2812 txt/../pos/2812.pos 3234 txt/../wrd/3234.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 2811 txt/../wrd/2811.wrd 2811 txt/../pos/2811.pos 2811 txt/../ent/2811.ent 28676 txt/../wrd/28676.wrd 28676 txt/../pos/28676.pos 8945 txt/../pos/8945.pos 8945 txt/../wrd/8945.wrd 8945 txt/../ent/8945.ent 28676 txt/../ent/28676.ent === file2bib.sh === id: 2811 author: Pliny, the Younger title: Letters of Pliny date: pages: extension: .txt txt: ./txt/2811.txt cache: ./cache/2811.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 5 resourceName b'2811.txt' === file2bib.sh === id: 8945 author: Trollope, Anthony title: The Life of Cicero, Volume One date: pages: extension: .txt txt: ./txt/8945.txt cache: ./cache/8945.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 11 resourceName b'8945.txt' === file2bib.sh === id: 28676 author: Trollope, Anthony title: The Life of Cicero, Volume II. date: pages: extension: .txt txt: ./txt/28676.txt cache: ./cache/28676.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 7 resourceName b'28676.txt' Done mapping. Reducing subject-authorsLatin-gutenberg === reduce.pl bib === id = 28676 author = Trollope, Anthony title = The Life of Cicero, Volume II. date = pages = extension = .txt mime = text/plain words = 127673 sentences = 7883 flesch = 82 summary = To Cicero's thinking, both Pompey and Cæsar were certain letter which Cicero had written to Cæsar. In the spring of the year we find Cicero writing to Cæsar in apparently the day of danger came, he joined Pompey's army against Cæsar, doubting, Then comes the passage in his letter on the strength of which Mr. Forsyth has condemned Cicero, not without abstract truth in his told, indeed, by Mr. Froude that the man was Cæsar, and that Cicero Cæsar's control--because we know that on his return Cicero's villas were mind of Cicero the idea of saying words which Cæsar might receive with The two men, Cæsar and Cicero, had agreed to differ, and had talked of have no means of knowing; but we feel that Cicero was not a man likely Not long after Cæsar's death Cicero left Rome, and spent the ensuing declared that he, Cicero, had been the author of Cæsar's death, in order cache = ./cache/28676.txt txt = ./txt/28676.txt === reduce.pl bib === id = 2811 author = Pliny, the Younger title = Letters of Pliny date = pages = extension = .txt mime = text/plain words = 92338 sentences = 5013 flesch = 74 summary = given by Metius Modestus, an excellent man, at that time in banishment is a man of sound judgment and great sagacity formed upon long this great man have done honour to the emperor, to the age, and to the had been a new day, he studied till supper-time, when a book was again it deserves a place rather in public history than in a private letter; senate in the usual manner, and as fully as the time and place would upon the nomination-day of proper persons to be received into the sacred 16 (return) [ "The equestrian dignity, or that order of the Roman people 105 (return) [ An officer employed by the emperor to receive and or for other reasons of the same kind, I thought proper, Sir, knowing return for many good offices he had done the city. 1053 (return) [ The Roman provinces in the times of the emperors were of cache = ./cache/2811.txt txt = ./txt/2811.txt === reduce.pl bib === id = 8945 author = Trollope, Anthony title = The Life of Cicero, Volume One date = pages = extension = .txt mime = text/plain words = 123251 sentences = 6615 flesch = 77 summary = Cicero's death men had to doubt whether literature or the Republic had familiar; but in Cicero's time the male free inhabitants of Rome did probably at work on his great poem, Cicero wrote an account of his Pompey the Great, was then Consul (B.C. 89), and Cicero was sent out to wonder how such a man as Cicero found time for the real work of his Verres had carried on his plunder during the years 73, 72, 71 B.C. During this time Cicero had been engaged sedulously as an advocate in state of things now in London, nor was it at Rome in Cicero's time. None of Cicero's letters have come to us from the year of his little was known in Rome of Cæsar till the time of Catiline's Catiline, had been declared in the Senate by Cicero himself on that day Cæsar's right-hand man in Gaul, was of the same politics as Cicero--so cache = ./cache/8945.txt txt = ./txt/8945.txt === reduce.pl bib === === reduce.pl bib === Building ./etc/reader.txt 28676 8945 2811 2811 8945 28676 number of items: 5 sum of words: 343,262 average size in words: 114,420 average readability score: 77 nouns: man; time; men; life; people; return; day; nothing; words; law; year; years; letter; power; way; letters; death; others; friend; work; speech; city; world; part; things; money; friends; name; truth; nature; place; one; country; days; story; character; order; matter; province; side; house; purpose; opinion; something; case; idea; mind; hands; father; brother verbs: was; had; be; is; have; been; were; are; has; made; do; did; know; come; think; done; having; am; says; said; see; being; make; say; written; given; seems; taken; told; found; give; go; trajan; does; read; put; let; tells; came; find; called; thought; take; left; become; brought; known; went; sent; used adjectives: own; other; great; such; same; roman; many; good; first; much; little; old; more; public; last; young; certain; true; necessary; few; best; full; second; very; whole; present; able; political; different; better; new; general; former; next; high; proper; large; private; common; several; strong; anxious; worthy; short; least; possible; open; greater; special; noble adverbs: not; so; then; only; now; as; very; even; more; well; up; too; also; out; most; again; still; much; here; however; indeed; never; no; on; down; therefore; there; probably; yet; far; always; back; perhaps; thus; ever; rather; already; together; doubt; almost; just; away; altogether; first; all; afterward; once; hardly; enough; generally pronouns: he; his; it; i; him; you; we; they; their; my; them; himself; me; us; your; our; her; its; she; themselves; myself; itself; yourself; one; ourselves; herself; mine; yours; ours; theirs; thyself; thy; hers; yourselves; you"--as; what!--do; verrem--"the; oneself; none--"the; non; grævius[138; behalf"--the proper nouns: cicero; cæsar; .; rome; pompey; ii; senate; lib; antony; i.; republic; atticus; de; catiline; brutus; clodius; _; sulla; consul; emperor; romans; m.; pliny; b.c.; ad; cato; milo; consulship; ca; consuls; roman; quintus; et; crassus; ibid; verres; sir; piso; italy; mr.; att; prætor; marius; hortensius; pro; state; gaul; augustus; marcellus; mommsen keywords: rome; roman; cicero; senate; republic; quintus; pompey; italy; gaul; cæsar; crassus; consul; clodius; cato; catiline; brutus; b.c.; atticus; antony; verre; tribune; trajan; time; sulla; state; sir; sidenote; sicily; secundus; sallust; roscius; return; regulus; quæstor; public; prætor; pliny; place; piso; person; nero; mr.; mommsen; milo; marius; marcellus; letter; ibid; hortensius; greek one topic; one dimension: cicero file(s): ./cache/28676.txt titles(s): The Life of Cicero, Volume II. three topics; one dimension: cicero; return; professors file(s): ./cache/28676.txt, ./cache/2811.txt, titles(s): The Life of Cicero, Volume II. | Letters of Pliny | Letters of Marcus Tullius Cicero five topics; three dimensions: cicero cæsar man; time return trajan; continence fetters impregnated; continence fetters impregnated; continence fetters impregnated file(s): ./cache/28676.txt, ./cache/2811.txt, , , titles(s): The Life of Cicero, Volume II. | Letters of Pliny | Letters of Marcus Tullius Cicero | Letters of Marcus Tullius Cicero | Letters of Marcus Tullius Cicero Type: gutenberg title: subject-authorsLatin-gutenberg date: 2021-06-01 time: 13:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Authors, Latin" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 2812 author: Cicero, Marcus Tullius title: Letters of Marcus Tullius Cicero date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 3234 author: Pliny, the Younger title: The Letters of the Younger Pliny, First Series — Volume 1 date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 2811 author: Pliny, the Younger title: Letters of Pliny date: words: 92338.0 sentences: 5013.0 pages: flesch: 74.0 cache: ./cache/2811.txt txt: ./txt/2811.txt summary: given by Metius Modestus, an excellent man, at that time in banishment is a man of sound judgment and great sagacity formed upon long this great man have done honour to the emperor, to the age, and to the had been a new day, he studied till supper-time, when a book was again it deserves a place rather in public history than in a private letter; senate in the usual manner, and as fully as the time and place would upon the nomination-day of proper persons to be received into the sacred 16 (return) [ "The equestrian dignity, or that order of the Roman people 105 (return) [ An officer employed by the emperor to receive and or for other reasons of the same kind, I thought proper, Sir, knowing return for many good offices he had done the city. 1053 (return) [ The Roman provinces in the times of the emperors were of id: 28676 author: Trollope, Anthony title: The Life of Cicero, Volume II. date: words: 127673.0 sentences: 7883.0 pages: flesch: 82.0 cache: ./cache/28676.txt txt: ./txt/28676.txt summary: To Cicero''s thinking, both Pompey and Cæsar were certain letter which Cicero had written to Cæsar. In the spring of the year we find Cicero writing to Cæsar in apparently the day of danger came, he joined Pompey''s army against Cæsar, doubting, Then comes the passage in his letter on the strength of which Mr. Forsyth has condemned Cicero, not without abstract truth in his told, indeed, by Mr. Froude that the man was Cæsar, and that Cicero Cæsar''s control--because we know that on his return Cicero''s villas were mind of Cicero the idea of saying words which Cæsar might receive with The two men, Cæsar and Cicero, had agreed to differ, and had talked of have no means of knowing; but we feel that Cicero was not a man likely Not long after Cæsar''s death Cicero left Rome, and spent the ensuing declared that he, Cicero, had been the author of Cæsar''s death, in order id: 8945 author: Trollope, Anthony title: The Life of Cicero, Volume One date: words: 123251.0 sentences: 6615.0 pages: flesch: 77.0 cache: ./cache/8945.txt txt: ./txt/8945.txt summary: Cicero''s death men had to doubt whether literature or the Republic had familiar; but in Cicero''s time the male free inhabitants of Rome did probably at work on his great poem, Cicero wrote an account of his Pompey the Great, was then Consul (B.C. 89), and Cicero was sent out to wonder how such a man as Cicero found time for the real work of his Verres had carried on his plunder during the years 73, 72, 71 B.C. During this time Cicero had been engaged sedulously as an advocate in state of things now in London, nor was it at Rome in Cicero''s time. None of Cicero''s letters have come to us from the year of his little was known in Rome of Cæsar till the time of Catiline''s Catiline, had been declared in the Senate by Cicero himself on that day Cæsar''s right-hand man in Gaul, was of the same politics as Cicero--so ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel