mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-veronaItaly-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/23043.txt inflating: ./tmp/input/input-file/1509.txt inflating: ./tmp/input/input-file/1108.txt inflating: ./tmp/input/input-file/1112.txt inflating: ./tmp/input/input-file/1773.txt inflating: ./tmp/input/input-file/1777.txt inflating: ./tmp/input/input-file/2236.txt inflating: ./tmp/input/input-file/2261.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-veronaItaly-gutenberg FILE: cache/1509.txt OUTPUT: txt/1509.txt FILE: cache/2236.txt OUTPUT: txt/2236.txt FILE: cache/2261.txt OUTPUT: txt/2261.txt FILE: cache/1112.txt OUTPUT: txt/1112.txt FILE: cache/1773.txt OUTPUT: txt/1773.txt FILE: cache/1777.txt OUTPUT: txt/1777.txt FILE: cache/23043.txt OUTPUT: txt/23043.txt FILE: cache/1108.txt OUTPUT: txt/1108.txt === file2bib.sh === id: 2261 author: Shakespeare, William title: Romeo and Juliet date: pages: extension: .txt txt: ./txt/2261.txt cache: ./cache/2261.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'2261.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' === file2bib.sh === id: 1509 author: Shakespeare, William title: The Two Gentlemen of Verona date: pages: extension: .txt txt: ./txt/1509.txt cache: ./cache/1509.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'1509.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' === file2bib.sh === id: 2236 author: Shakespeare, William title: The Two Gentlemen of Verona date: pages: extension: .txt txt: ./txt/2236.txt cache: ./cache/2236.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'2236.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 1509 txt/../ent/1509.ent 1509 txt/../pos/1509.pos 1108 txt/../pos/1108.pos 1108 txt/../ent/1108.ent 2261 txt/../wrd/2261.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 1509 txt/../wrd/1509.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 2261 txt/../ent/2261.ent 2236 txt/../pos/2236.pos 1777 txt/../ent/1777.ent 2236 txt/../ent/2236.ent 1773 txt/../ent/1773.ent 1777 txt/../wrd/1777.wrd 2261 txt/../pos/2261.pos 2236 txt/../wrd/2236.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 1773 txt/../pos/1773.pos 1773 txt/../wrd/1773.wrd 1777 txt/../pos/1777.pos 1108 txt/../wrd/1108.wrd === file2bib.sh === id: 1108 author: Shakespeare, William title: The Two Gentlemen of Verona date: pages: extension: .txt txt: ./txt/1108.txt cache: ./cache/1108.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'1108.txt' === file2bib.sh === id: 1773 author: Shakespeare, William title: Two Gentlemen of Verona date: pages: extension: .txt txt: ./txt/1773.txt cache: ./cache/1773.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'1773.txt' === file2bib.sh === id: 1777 author: Shakespeare, William title: Romeo and Juliet date: pages: extension: .txt txt: ./txt/1777.txt cache: ./cache/1777.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'1777.txt' 23043 txt/../wrd/23043.wrd 1112 txt/../wrd/1112.wrd 23043 txt/../pos/23043.pos 1112 txt/../pos/1112.pos === file2bib.sh === id: 23043 author: Shakespeare, William title: Two Gentlemen of Verona The Works of William Shakespeare [Cambridge Edition] [9 vols.] date: pages: extension: .txt txt: ./txt/23043.txt cache: ./cache/23043.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'23043.txt' 23043 txt/../ent/23043.ent 1112 txt/../ent/1112.ent === file2bib.sh === id: 1112 author: Shakespeare, William title: The Tragedy of Romeo and Juliet date: pages: extension: .txt txt: ./txt/1112.txt cache: ./cache/1112.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'1112.txt' Done mapping. Reducing subject-veronaItaly-gutenberg === reduce.pl bib === id = 23043 author = Shakespeare, William title = Two Gentlemen of Verona The Works of William Shakespeare [Cambridge Edition] [9 vols.] date = pages = extension = .txt mime = text/plain words = 24080 sentences = 5621 flesch = 102 summary = _Pro._ Upon some book I love I'll pray for thee. _Val._ 'Tis true; for you are over boots in love, 25 _Jul._ What think'st thou of the fair Sir Eglamour? _Jul._ What think'st thou of the gentle Proteus? _Luc._ Sir Valentine's page; and sent, I think, from Proteus. _Ant._ Look, what thou want'st shall be sent after thee: _Val._ Go to, sir: tell me, do you know Madam Silvia? _Val._ But tell me, dost thou know my lady Silvia? _Speed._ True, sir; I was in love with my bed: I thank _Enter SILVIA, VALENTINE, THURIO, and SPEED._ _Val._ Why, lady, Love hath twenty pair of eyes. Ff. _in love, if thou wilt go_ Collier (Malone conj.). _Val._ I pray thee, Launce, an if thou seest my boy, _Duke._ Sir Thurio, fear not but that she will love you, _Pro._ Ay, gentle Thurio; for you know that love cache = ./cache/23043.txt txt = ./txt/23043.txt === reduce.pl bib === === reduce.pl bib === id = 1777 author = Shakespeare, William title = Romeo and Juliet date = pages = extension = .txt mime = text/plain words = 40 sentences = 10 flesch = 88 summary = THIS EBOOK WAS ONE OF PROJECT GUTENBERG'S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#1513) at https://www.gutenberg.org/ebooks/1513 cache = ./cache/1777.txt txt = ./txt/1777.txt === reduce.pl bib === id = 1112 author = Shakespeare, William title = The Tragedy of Romeo and Juliet date = pages = extension = .txt mime = text/plain words = 26475 sentences = 4001 flesch = 101 summary = Rom. What, shall I groan and tell thee? Ben. Why, Romeo, art thou mad? Jul. And stint thou too, I pray thee, nurse, say I. Rom. I take thee at thy word. Jul. What man art thou that, thus bescreen'd in night, Jul. Three words, dear Romeo, and good night indeed. Rom. Let me stand here till thou remember it. Rom. I'll tell thee ere thou ask it me again. Rom. What wilt thou tell her, nurse? Jul. Now, good sweet nurseO Lord, why look'st thou sad? Jul. I would thou hadst my bones, and I thy news. Rom. Tybalt, the reason that I have to love thee Wert thou as young as I, Juliet thy love, Jul. Art thou gone so, my lord, my love, my friend? Jul. Speak'st thou this from thy heart? To rouse thee from thy bed, there art thou dead. cache = ./cache/1112.txt txt = ./txt/1112.txt === reduce.pl bib === id = 1773 author = Shakespeare, William title = Two Gentlemen of Verona date = pages = extension = .txt mime = text/plain words = 40 sentences = 10 flesch = 88 summary = THIS EBOOK WAS ONE OF PROJECT GUTENBERG'S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#23043) at https://www.gutenberg.org/ebooks/23043 cache = ./cache/1773.txt txt = ./txt/1773.txt === reduce.pl bib === === reduce.pl bib === === reduce.pl bib === id = 1108 author = Shakespeare, William title = The Two Gentlemen of Verona date = pages = extension = .txt mime = text/plain words = 40 sentences = 10 flesch = 88 summary = THIS EBOOK WAS ONE OF PROJECT GUTENBERG'S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#23043) at https://www.gutenberg.org/ebooks/23043 cache = ./cache/1108.txt txt = ./txt/1108.txt Building ./etc/reader.txt Error: near line 1: database is locked Send options without primary recipient specified. Usage: mailx -eiIUdEFntBDNHRVv~ -T FILE -u USER -h hops -r address -s SUBJECT -a FILE -q FILE -f FILE -A ACCOUNT -b USERS -c USERS -S OPTION users 23043 1112 2261 23043 2261 2236 number of items: 8 sum of words: 50,675 average size in words: 10,135 average readability score: 93 nouns: love; man; night; death; sir; time; scene; day; lady; heart; art; thy; master; thee; exit; word; friend; conj; father; letter; men; wife; name; eyes; dog; nurse; life; hath; house; nothing; tears; servant; face; mother; eye; hand; gentleman; wit; daughter; bed; mistress; earth; world; son; news; mine; gentlemen; doth; thing; peace verbs: is; be; have; do; are; come; ''s; go; am; was; enter; say; see; let; were; know; make; give; did; tell; take; think; love; had; being; look; gone; been; speak; made; hear; comes; pray; call; said; makes; stand; find; hold; stay; leave; hath; done; live; put; die; thou; bid; says; get adjectives: good; more; sweet; fair; such; dead; true; much; own; other; same; old; young; poor; dear; gentle; little; many; holy; new; best; happy; full; very; long; high; great; false; black; better; last; first; rich; light; ill; heavy; worthy; pale; wise; wild; welcome; slow; mine; mad; less; desperate; blind; alone; worth; sure adverbs: not; so; now; then; here; too; as; up; out; yet; well; there; more; away; therefore; hence; again; never; much; even; still; ever; else; thus; very; indeed; down; back; no; most; soon; once; presently; in; early; all; long; off; forth; alone; tis; on; often; far; before; aside; late; better; only; rather pronouns: i; you; my; me; it; her; she; your; his; he; thy; him; thee; we; they; our; them; their; us; myself; mine; himself; thyself; one; yourself; yours; herself; itself; ''s; thou; ay; ''em; hers; theirs; ourselves; on''t; is''t; you,--; ours; me-; its; i-; faith- proper nouns: _; thou; jul.; rom; val; pro; romeo; speed; f1; nurse; f4; f3; f2; pope; sir; launce; valentine; friar; madam; silvia; duke; juliet; ben; mer; capell; sil; proteus; ff; hath; collier; cap; julia; enter; ms; luc; exeunt; capulet; thurio; lady; tybalt; prince; thu; hanmer; lord; paris; heaven; exit; .; god; om keywords: ebook; valentine; val; tybalt; thou; speed; romeo; prince; pope; paris; nurse; montague; mercutio; laurence; launce; july; juliet; friar; capulet one topic; one dimension: thou file(s): titles(s): The Two Gentlemen of Verona three topics; one dimension: thou; love; ebook file(s): ./cache/1112.txt, ./cache/23043.txt, ./cache/1108.txt titles(s): The Tragedy of Romeo and Juliet | Two Gentlemen of Verona The Works of William Shakespeare [Cambridge Edition] [9 vols.] | The Two Gentlemen of Verona five topics; three dimensions: thou thy rom; love _val _pro; ebook gutenberg 23043; 1513 early time; 1513 early time file(s): ./cache/1112.txt, ./cache/23043.txt, ./cache/1108.txt, , titles(s): The Tragedy of Romeo and Juliet | Two Gentlemen of Verona The Works of William Shakespeare [Cambridge Edition] [9 vols.] | The Two Gentlemen of Verona | The Two Gentlemen of Verona | The Two Gentlemen of Verona Type: gutenberg title: subject-veronaItaly-gutenberg date: 2021-06-10 time: 16:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Verona (Italy)" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 23043 author: Shakespeare, William title: Two Gentlemen of Verona The Works of William Shakespeare [Cambridge Edition] [9 vols.] date: words: 24080.0 sentences: 5621.0 pages: flesch: 102.0 cache: ./cache/23043.txt txt: ./txt/23043.txt summary: _Pro._ Upon some book I love I''ll pray for thee. _Val._ ''Tis true; for you are over boots in love, 25 _Jul._ What think''st thou of the fair Sir Eglamour? _Jul._ What think''st thou of the gentle Proteus? _Luc._ Sir Valentine''s page; and sent, I think, from Proteus. _Ant._ Look, what thou want''st shall be sent after thee: _Val._ Go to, sir: tell me, do you know Madam Silvia? _Val._ But tell me, dost thou know my lady Silvia? _Speed._ True, sir; I was in love with my bed: I thank _Enter SILVIA, VALENTINE, THURIO, and SPEED._ _Val._ Why, lady, Love hath twenty pair of eyes. Ff. _in love, if thou wilt go_ Collier (Malone conj.). _Val._ I pray thee, Launce, an if thou seest my boy, _Duke._ Sir Thurio, fear not but that she will love you, _Pro._ Ay, gentle Thurio; for you know that love id: 1509 author: Shakespeare, William title: The Two Gentlemen of Verona date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 1108 author: Shakespeare, William title: The Two Gentlemen of Verona date: words: 40.0 sentences: 10.0 pages: flesch: 88.0 cache: ./cache/1108.txt txt: ./txt/1108.txt summary: THIS EBOOK WAS ONE OF PROJECT GUTENBERG''S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#23043) at https://www.gutenberg.org/ebooks/23043 id: 1112 author: Shakespeare, William title: The Tragedy of Romeo and Juliet date: words: 26475.0 sentences: 4001.0 pages: flesch: 101.0 cache: ./cache/1112.txt txt: ./txt/1112.txt summary: Rom. What, shall I groan and tell thee? Ben. Why, Romeo, art thou mad? Jul. And stint thou too, I pray thee, nurse, say I. Rom. I take thee at thy word. Jul. What man art thou that, thus bescreen''d in night, Jul. Three words, dear Romeo, and good night indeed. Rom. Let me stand here till thou remember it. Rom. I''ll tell thee ere thou ask it me again. Rom. What wilt thou tell her, nurse? Jul. Now, good sweet nurseO Lord, why look''st thou sad? Jul. I would thou hadst my bones, and I thy news. Rom. Tybalt, the reason that I have to love thee Wert thou as young as I, Juliet thy love, Jul. Art thou gone so, my lord, my love, my friend? Jul. Speak''st thou this from thy heart? To rouse thee from thy bed, there art thou dead. id: 1773 author: Shakespeare, William title: Two Gentlemen of Verona date: words: 40.0 sentences: 10.0 pages: flesch: 88.0 cache: ./cache/1773.txt txt: ./txt/1773.txt summary: THIS EBOOK WAS ONE OF PROJECT GUTENBERG''S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#23043) at https://www.gutenberg.org/ebooks/23043 id: 1777 author: Shakespeare, William title: Romeo and Juliet date: words: 40.0 sentences: 10.0 pages: flesch: 88.0 cache: ./cache/1777.txt txt: ./txt/1777.txt summary: THIS EBOOK WAS ONE OF PROJECT GUTENBERG''S EARLY FILES PRODUCED AT A TIME WHEN PROOFING METHODS AND TOOLS WERE NOT WELL DEVELOPED. IS AN IMPROVED EDITION OF THIS TITLE WHICH MAY BE VIEWED AS EBOOK (#1513) at https://www.gutenberg.org/ebooks/1513 id: 2236 author: Shakespeare, William title: The Two Gentlemen of Verona date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: id: 2261 author: Shakespeare, William title: Romeo and Juliet date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel