mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-richardIiKingOfEngland-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/28433.txt inflating: ./tmp/input/input-file/1512.txt inflating: ./tmp/input/input-file/1111.txt inflating: ./tmp/input/input-file/1776.txt inflating: ./tmp/input/input-file/2250.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-richardIiKingOfEngland-gutenberg FILE: cache/28433.txt OUTPUT: txt/28433.txt FILE: cache/1512.txt OUTPUT: txt/1512.txt FILE: cache/2250.txt OUTPUT: txt/2250.txt FILE: cache/1111.txt OUTPUT: txt/1111.txt FILE: cache/1776.txt OUTPUT: txt/1776.txt === file2bib.sh === id: 2250 author: Shakespeare, William title: Richard II date: pages: extension: .txt txt: ./txt/2250.txt cache: ./cache/2250.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'2250.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' === file2bib.sh === id: 1776 author: Shakespeare, William title: King Richard II date: pages: extension: .txt txt: ./txt/1776.txt cache: ./cache/1776.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'1776.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' === file2bib.sh === id: 1512 author: Shakespeare, William title: The Tragedy of King Richard the Second date: pages: extension: .txt txt: ./txt/1512.txt cache: ./cache/1512.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'1512.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 1512 txt/../wrd/1512.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 1111 txt/../ent/1111.ent 2250 txt/../pos/2250.pos 1111 txt/../pos/1111.pos === file2bib.sh === id: 1111 author: Shakespeare, William title: King Richard the Second date: pages: extension: .txt txt: ./txt/1111.txt cache: ./cache/1111.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'1111.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 1512 txt/../pos/1512.pos 1776 txt/../ent/1776.ent 1776 txt/../pos/1776.pos 2250 txt/../ent/2250.ent 1512 txt/../ent/1512.ent 1776 txt/../wrd/1776.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 1111 txt/../wrd/1111.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 2250 txt/../wrd/2250.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 28433 txt/../pos/28433.pos 28433 txt/../wrd/28433.wrd 28433 txt/../ent/28433.ent === file2bib.sh === id: 28433 author: Abbott, Jacob title: Richard II Makers of History date: pages: extension: .txt txt: ./txt/28433.txt cache: ./cache/28433.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 5 resourceName b'28433.txt' Done mapping. Reducing subject-richardIiKingOfEngland-gutenberg === reduce.pl bib === === reduce.pl bib === === reduce.pl bib === === reduce.pl bib === id = 28433 author = Abbott, Jacob title = Richard II Makers of History date = pages = extension = .txt mime = text/plain words = 65561 sentences = 3022 flesch = 75 summary = King Richard the Second lived in the days when the chivalry of feudal and the king immediately sent a troop of armed men, with an earl at The father of King Richard the Second was a celebrated Prince of his father, King Edward, died, Richard, who was the oldest son of the King of England, Edward the Third, the father of the Black Prince, Prince.--The country laid waste.--The King of France comes to meet the king's sons.--The victory announced to the prince.--The men called the prisoner.--The war ended.--The king ransomed.--Prince Edward's attempted to conduct the king to Prince Edward, all the knights of the King of France as prisoner to England, had reached London, and though his father, Prince Edward, was the oldest son of the King of though his father, Prince Edward, was the oldest son of the King of were then residing; for all this took place just before King Richard's cache = ./cache/28433.txt txt = ./txt/28433.txt === reduce.pl bib === Building ./etc/reader.txt 28433 2250 1776 28433 2250 1776 number of items: 5 sum of words: 65,561 average size in words: 65,561 average readability score: 75 nouns: king; time; men; prince; people; nobles; castle; way; place; knights; man; country; head; day; army; years; days; son; length; course; father; queen; power; part; barons; manner; death; river; party; government; town; crown; court; order; kings; insurgents; body; state; side; war; case; thing; name; kingdom; respect; palace; battle; mother; marriage; princess verbs: was; were; had; be; is; been; said; made; have; came; took; called; went; come; are; do; sent; go; did; began; being; make; set; see; brought; found; having; take; put; received; died; named; taken; going; became; thought; let; give; heard; gave; led; engaged; remained; appointed; killed; has; say; formed; done; determined adjectives: great; other; many; young; own; french; english; such; little; large; certain; more; long; whole; same; several; ready; old; present; mean; last; good; royal; high; various; grand; first; next; immense; possible; ancient; true; personal; necessary; full; common; angry; public; new; different; armed; much; vast; principal; general; greatest; short; pleased; open; cruel adverbs: not; so; very; then; out; up; however; now; soon; more; thus; down; as; there; too; only; immediately; away; all; on; well; much; off; most; still; back; also; again; sometimes; in; here; afterward; often; greatly; about; once; together; first; always; accordingly; far; ever; almost; never; indeed; long; forward; home; finally; yet pronouns: he; his; they; it; him; them; their; her; you; she; i; himself; we; themselves; my; our; me; your; us; its; itself; yourself; ourselves; one; theirs; myself; mine; yours; herself; prisoner.--his; him.--his proper nouns: richard; edward; england; king; france; london; john; prince; _; duke; henry; wales; lord; lancaster; arthur; walter; english; sir; gaveston; philip; black; tower; french; anne; parliament; evan; castle; paris; de; god; earl; calais; mortimer; isabella; pope; ralph; langurant; gloucester; bordeaux; westminster; lamb; holland; aquitaine; queen; d''albret; stafford; chapter; a.d.; pedro; leolin keywords: wales; richard; prince; lord; london; lancaster; king; john; illustration; henry; french; france; english; england; edward; duke; arthur one topic; one dimension: king file(s): titles(s): The Tragedy of King Richard the Second three topics; one dimension: king; 103; 103 file(s): ./cache/28433.txt, , titles(s): Richard II Makers of History | The Tragedy of King Richard the Second | The Tragedy of King Richard the Second five topics; three dimensions: king richard time; 103 curtailed conflicted; 103 curtailed conflicted; 103 curtailed conflicted; 103 curtailed conflicted file(s): ./cache/28433.txt, , , , titles(s): Richard II Makers of History | The Tragedy of King Richard the Second | The Tragedy of King Richard the Second | The Tragedy of King Richard the Second | The Tragedy of King Richard the Second Type: gutenberg title: subject-richardIiKingOfEngland-gutenberg date: 2021-06-09 time: 18:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Richard II, King of England, 1367-1400" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 28433 author: Abbott, Jacob title: Richard II Makers of History date: words: 65561.0 sentences: 3022.0 pages: flesch: 75.0 cache: ./cache/28433.txt txt: ./txt/28433.txt summary: King Richard the Second lived in the days when the chivalry of feudal and the king immediately sent a troop of armed men, with an earl at The father of King Richard the Second was a celebrated Prince of his father, King Edward, died, Richard, who was the oldest son of the King of England, Edward the Third, the father of the Black Prince, Prince.--The country laid waste.--The King of France comes to meet the king''s sons.--The victory announced to the prince.--The men called the prisoner.--The war ended.--The king ransomed.--Prince Edward''s attempted to conduct the king to Prince Edward, all the knights of the King of France as prisoner to England, had reached London, and though his father, Prince Edward, was the oldest son of the King of though his father, Prince Edward, was the oldest son of the King of were then residing; for all this took place just before King Richard''s id: 1512 author: Shakespeare, William title: The Tragedy of King Richard the Second date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 1111 author: Shakespeare, William title: King Richard the Second date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 1776 author: Shakespeare, William title: King Richard II date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 2250 author: Shakespeare, William title: Richard II date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel