mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named dickinson-from-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/2678.txt inflating: ./tmp/input/input-file/2679.txt inflating: ./tmp/input/input-file/12242.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === updating bibliographic database Building study carrel named dickinson-from-gutenberg FILE: cache/2678.txt OUTPUT: txt/2678.txt FILE: cache/2679.txt OUTPUT: txt/2679.txt FILE: cache/12242.txt OUTPUT: txt/12242.txt === file2bib.sh === id: 2678 author: Dickinson, Emily title: Poems by Emily Dickinson, Series One date: pages: extension: .txt txt: ./txt/2678.txt cache: ./cache/2678.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'2678.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 2679 txt/../wrd/2679.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point === file2bib.sh === id: 2679 author: Dickinson, Emily title: Poems by Emily Dickinson, Series Two date: pages: extension: .txt txt: ./txt/2679.txt cache: ./cache/2679.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'2679.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 2678 txt/../wrd/2678.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 2679 txt/../pos/2679.pos 2678 txt/../pos/2678.pos 2679 txt/../ent/2679.ent 2678 txt/../ent/2678.ent 12242 txt/../pos/12242.pos 12242 txt/../wrd/12242.wrd 12242 txt/../ent/12242.ent === file2bib.sh === id: 12242 author: Dickinson, Emily title: Poems by Emily Dickinson, Three Series, Complete date: pages: extension: .txt txt: ./txt/12242.txt cache: ./cache/12242.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 3 resourceName b'12242.txt' Done mapping. Reducing dickinson-from-gutenberg === reduce.pl bib === id = 12242 author = Dickinson, Emily title = Poems by Emily Dickinson, Three Series, Complete date = pages = extension = .txt mime = text/plain words = 31219 sentences = 4223 flesch = 95 summary = The little toil of love, I thought, Beware, lest this little brook of life And thread the dews all night, like pearls, And dream the days away, -Like flowers that heard the tale of dews, The wind does, working like a hand These are the days when birds come back, I think the hemlock likes to stand Look back on time with kindly eyes, I shall know why, when time is over, Storm, wind, the wild March sky, sunsets and dawns; the birds and A thought went up my mind to-day I many times thought peace had come, The day came slow, till five o'clock, Belles from some lost summer day, They looked like frightened beads, I thought; And I 'd like to look a little more Noons like these she rose, It's like the bee, -It's like the morning, -We only know what time of year cache = ./cache/12242.txt txt = ./txt/12242.txt === reduce.pl bib === === reduce.pl bib === Building ./etc/reader.txt 12242 2679 2678 12242 2679 2678 number of items: 3 sum of words: 31,219 average size in words: 31,219 average readability score: 95 nouns: day; life; time; t; night; summer; sun; death; face; soul; sea; eyes; way; morning; feet; door; heart; sky; one; air; men; noon; bird; bee; nature; hand; friend; fingers; birds; man; love; house; thing; dew; world; light; grass; flower; eye; year; tree; place; name; thought; poems; breath; angels; land; flowers; ear verbs: is; was; be; have; had; are; were; know; tell; ''s; see; has; did; go; do; come; went; say; look; been; put; take; lost; done; came; thought; passed; made; seen; heard; die; said; knew; find; make; am; gone; died; stand; left; keep; hear; going; goes; touch; think; meet; ''m; wonder; took adjectives: little; other; many; new; old; sweet; such; last; first; own; few; purple; more; small; same; simple; full; best; dead; single; much; low; long; blue; good; white; human; fair; unknown; red; quiet; narrow; mighty; large; imperial; happy; great; distant; common; yellow; wild; true; next; golden; foreign; early; cool; ample; timid; sure adverbs: not; so; then; just; too; away; never; still; as; now; yet; more; there; out; only; down; here; up; n''t; far; again; once; back; before; almost; no; softly; often; first; even; by; well; very; on; long; in; abroad; off; enough; all; sometimes; perhaps; below; always; slow; home; everywhere; ever; alone; twice pronouns: i; it; my; her; me; you; his; we; he; they; she; their; its; him; our; them; your; us; thee; myself; itself; mine; one; himself; thy; herself; themselves; ourselves; yours; thyself; ourself; yourself; pelf; father''d proper nouns: heaven; t; god; thou; _; i.; iv; emily; dickinson; nature; xvi; xv; xiv; xiii; xii; xi; x.; viii; vii; vi; xvii; march; xxii; xxi; xx; xix; v.; paradise; bee; xxvi; xxv; xxiv; xxiii; time; xxviii; xxvii; xxix; love; xxxi; xxx; lord; father; eden; xxxviii; xxxvii; xxxvi; xxxv; xxxiv; xxxiii; xxxii keywords: vii; time; summer; pass; look; little; like; life; god; dickinson; death; day; come one topic; one dimension: like file(s): titles(s): Poems by Emily Dickinson, Series One three topics; one dimension: like; zone; zone file(s): ./cache/12242.txt, , titles(s): Poems by Emily Dickinson, Three Series, Complete | Poems by Emily Dickinson, Series One | Poems by Emily Dickinson, Series One five topics; three dimensions: like little day; zone gurgle hame; zone gurgle hame; zone gurgle hame; zone gurgle hame file(s): ./cache/12242.txt, , , , titles(s): Poems by Emily Dickinson, Three Series, Complete | Poems by Emily Dickinson, Series One | Poems by Emily Dickinson, Series One | Poems by Emily Dickinson, Series One | Poems by Emily Dickinson, Series One Type: gutenberg title: dickinson-from-gutenberg date: 2021-01-09 time: 15:30 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: author:"Dickinson, Emily" NOT 12241 ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 2678 author: Dickinson, Emily title: Poems by Emily Dickinson, Series One date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 2679 author: Dickinson, Emily title: Poems by Emily Dickinson, Series Two date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 12242 author: Dickinson, Emily title: Poems by Emily Dickinson, Three Series, Complete date: words: 31219.0 sentences: 4223.0 pages: flesch: 95.0 cache: ./cache/12242.txt txt: ./txt/12242.txt summary: The little toil of love, I thought, Beware, lest this little brook of life And thread the dews all night, like pearls, And dream the days away, -Like flowers that heard the tale of dews, The wind does, working like a hand These are the days when birds come back, I think the hemlock likes to stand Look back on time with kindly eyes, I shall know why, when time is over, Storm, wind, the wild March sky, sunsets and dawns; the birds and A thought went up my mind to-day I many times thought peace had come, The day came slow, till five o''clock, Belles from some lost summer day, They looked like frightened beads, I thought; And I ''d like to look a little more Noons like these she rose, It''s like the bee, -It''s like the morning, -We only know what time of year ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel