mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-humorousStoriesCanadian-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/20633.txt inflating: ./tmp/input/input-file/4682.txt inflating: ./tmp/input/input-file/6340.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-humorousStoriesCanadian-gutenberg FILE: cache/20633.txt OUTPUT: txt/20633.txt FILE: cache/4682.txt OUTPUT: txt/4682.txt FILE: cache/6340.txt OUTPUT: txt/6340.txt === file2bib.sh === id: 4682 author: Leacock, Stephen title: Nonsense Novels date: pages: extension: .txt txt: ./txt/4682.txt cache: ./cache/4682.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'4682.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 4682 txt/../ent/4682.ent 4682 txt/../pos/4682.pos 4682 txt/../wrd/4682.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 20633 txt/../pos/20633.pos 6340 txt/../pos/6340.pos 20633 txt/../wrd/20633.wrd 6340 txt/../wrd/6340.wrd 6340 txt/../ent/6340.ent 20633 txt/../ent/20633.ent === file2bib.sh === id: 20633 author: Leacock, Stephen title: Winsome Winnie and other New Nonsense Novels date: pages: extension: .txt txt: ./txt/20633.txt cache: ./cache/20633.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 3 resourceName b'20633.txt' === file2bib.sh === id: 6340 author: Leacock, Stephen title: Literary Lapses date: pages: extension: .txt txt: ./txt/6340.txt cache: ./cache/6340.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'6340.txt' Done mapping. Reducing subject-humorousStoriesCanadian-gutenberg === reduce.pl bib === id = 20633 author = Leacock, Stephen title = Winsome Winnie and other New Nonsense Novels date = pages = extension = .txt mime = text/plain words = 39749 sentences = 3563 flesch = 89 summary = "Miss Winnifred," said the Old Lawyer, looking keenly over and through "Sir," said Winnifred, drawing herself up proudly, "let me pass, I "Oh, sir," said Winnifred, clasping her hands and falling on her knees "Then, sir," said Winnifred, rising from her chair, "let me say this. "Miss Clair," said the Lawyer, advancing and taking the girl's hand for "I knew it all the time," said Lord Mordaunt, drawing the girl to his "Father," she said, "he wants to take our little girl away. "Miss Elphinspoon," he said, "I think I know what is coming. "Have a cigar, Chief," said Kent, "and let me hear what the trouble is." "Stop a bit," said Kent, pausing to think a moment. "Ha," said Kent, "a looloo!" The two men looked into one another's eyes. "Now tell me," said Kent, as they stood beside the billiard table, "what "No," said Kent, taking her hand a moment, "you were not." cache = ./cache/20633.txt txt = ./txt/20633.txt === reduce.pl bib === id = 6340 author = Leacock, Stephen title = Literary Lapses date = pages = extension = .txt mime = text/plain words = 42932 sentences = 3243 flesch = 86 summary = "Girl," said the earl sternly, "I care not for the man's trained by long years of high living and plain thinking, "Say good night!" they said, "why it's only half-past One night I heard one man say, "Well, let's call up New You know, many a man realizes late in life that if when not like to think of your pretty little letters lying old days a man was turned out thoroughly equipped after "Ah, statistics" said the other; "wonderful things, sir, But the Quick Man on the front seat said in a big whisper This time the thing seemed like a little round box. The great man is certainly a wonderful thing. old man put his hand on Smith's head and say, mark his Times were bad with the old man. "And you know nothing of death, of course?" said the poet "Pardon," said the old man. cache = ./cache/6340.txt txt = ./txt/6340.txt === reduce.pl bib === Building ./etc/reader.txt 20633 6340 4682 20633 6340 4682 number of items: 3 sum of words: 82,681 average size in words: 41,340 average readability score: 87 nouns: man; time; girl; room; moment; life; thing; day; way; hand; something; face; night; years; house; head; evening; nothing; father; book; men; things; mind; course; days; table; side; one; morning; hour; eye; question; place; people; heart; times; feet; eyes; sir; door; anything; water; money; hands; boy; work; part; player; idea; humour verbs: was; is; had; said; have; be; do; are; has; were; been; ''s; know; did; see; get; think; say; let; am; go; take; come; made; came; asked; tell; went; got; put; seemed; found; make; looked; cried; knew; want; find; read; answered; sat; ''ve; turned; thought; seen; look; give; done; told; stood adjectives: little; other; old; great; good; young; few; more; first; full; whole; such; last; same; long; own; many; much; right; new; least; best; quiet; half; very; present; dear; certain; beautiful; next; cold; poor; happy; afraid; white; short; dead; bright; equal; wonderful; quick; high; unhappy; tall; simple; second; blue; big; ready; large adverbs: not; n''t; up; then; so; now; out; as; very; just; never; only; here; too; down; all; there; again; on; still; once; more; back; even; in; well; ever; off; always; most; away; quite; over; together; yet; however; far; first; soon; really; about; thus; much; right; home; at; simply; perhaps; long; almost pronouns: i; it; he; his; you; my; him; me; her; they; we; she; your; them; our; their; its; us; himself; myself; itself; one; themselves; herself; yourself; mine; ourselves; ''s; yours; oneself; ours; hers; ''em; you''ll; theirs; saloonio--; q.--what; iii.--you; i''m; hay; etc.--one proper nouns: _; mr.; john; kent; winnifred; sir; leacock; miss; throgton; kelly; croyden; lord; de; new; smith; saloonio; randolph; jones; general; c; england; clair; scalper; inspector; vaux; thornton; oxhead; edwin; elphinspoon; earl; chapter; buggam; york; wynchgate; robinson; grange; eggleston; edith; peter; house; lee; angela; fifty; father; edwards; city; wazoos; mother; marchioness; literary keywords: mr.; winnifred; vaux; time; throgton; thing; smith; sir; scalper; saloonio; robinson; randolph; oxhead; miss; man; lord; little; like; life; let; leacock; kent; kelly; jones; john; inspector; gwendoline; general; elphinspoon; edwin; croyden; colonel; clair; chapter one topic; one dimension: said file(s): ./cache/20633.txt titles(s): Winsome Winnie and other New Nonsense Novels three topics; one dimension: said; said; sunk file(s): ./cache/6340.txt, ./cache/20633.txt, titles(s): Literary Lapses | Winsome Winnie and other New Nonsense Novels | Nonsense Novels five topics; three dimensions: said man time; said mr john; exploded bits orders; exploded bits orders; exploded bits orders file(s): ./cache/6340.txt, ./cache/20633.txt, , , titles(s): Literary Lapses | Winsome Winnie and other New Nonsense Novels | Nonsense Novels | Nonsense Novels | Nonsense Novels Type: gutenberg title: subject-humorousStoriesCanadian-gutenberg date: 2021-06-06 time: 17:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Humorous stories, Canadian" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 20633 author: Leacock, Stephen title: Winsome Winnie and other New Nonsense Novels date: words: 39749.0 sentences: 3563.0 pages: flesch: 89.0 cache: ./cache/20633.txt txt: ./txt/20633.txt summary: "Miss Winnifred," said the Old Lawyer, looking keenly over and through "Sir," said Winnifred, drawing herself up proudly, "let me pass, I "Oh, sir," said Winnifred, clasping her hands and falling on her knees "Then, sir," said Winnifred, rising from her chair, "let me say this. "Miss Clair," said the Lawyer, advancing and taking the girl''s hand for "I knew it all the time," said Lord Mordaunt, drawing the girl to his "Father," she said, "he wants to take our little girl away. "Miss Elphinspoon," he said, "I think I know what is coming. "Have a cigar, Chief," said Kent, "and let me hear what the trouble is." "Stop a bit," said Kent, pausing to think a moment. "Ha," said Kent, "a looloo!" The two men looked into one another''s eyes. "Now tell me," said Kent, as they stood beside the billiard table, "what "No," said Kent, taking her hand a moment, "you were not." id: 4682 author: Leacock, Stephen title: Nonsense Novels date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 6340 author: Leacock, Stephen title: Literary Lapses date: words: 42932.0 sentences: 3243.0 pages: flesch: 86.0 cache: ./cache/6340.txt txt: ./txt/6340.txt summary: "Girl," said the earl sternly, "I care not for the man''s trained by long years of high living and plain thinking, "Say good night!" they said, "why it''s only half-past One night I heard one man say, "Well, let''s call up New You know, many a man realizes late in life that if when not like to think of your pretty little letters lying old days a man was turned out thoroughly equipped after "Ah, statistics" said the other; "wonderful things, sir, But the Quick Man on the front seat said in a big whisper This time the thing seemed like a little round box. The great man is certainly a wonderful thing. old man put his hand on Smith''s head and say, mark his Times were bad with the old man. "And you know nothing of death, of course?" said the poet "Pardon," said the old man. ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel