mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-namesPersonal-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/24374.txt inflating: ./tmp/input/input-file/37520.txt inflating: ./tmp/input/input-file/39284.txt inflating: ./tmp/input/input-file/34215.txt inflating: ./tmp/input/input-file/47627.txt inflating: ./tmp/input/input-file/51210.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-namesPersonal-gutenberg FILE: cache/24374.txt OUTPUT: txt/24374.txt FILE: cache/39284.txt OUTPUT: txt/39284.txt FILE: cache/47627.txt OUTPUT: txt/47627.txt FILE: cache/51210.txt OUTPUT: txt/51210.txt FILE: cache/37520.txt OUTPUT: txt/37520.txt FILE: cache/34215.txt OUTPUT: txt/34215.txt 24374 txt/../ent/24374.ent 24374 txt/../pos/24374.pos === file2bib.sh === id: 24374 author: Weekley, Ernest title: The Romance of Names date: pages: extension: .txt txt: ./txt/24374.txt cache: ./cache/24374.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 1 resourceName b'24374.txt' Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/file2bib.py", line 107, in text = textacy.preprocessing.normalize.normalize_quotation_marks( text ) File "/data-disk/python/lib/python3.8/site-packages/textacy/preprocessing/normalize.py", line 32, in normalize_quotation_marks return text.translate(QUOTE_TRANSLATION_TABLE) AttributeError: 'NoneType' object has no attribute 'translate' 24374 txt/../wrd/24374.wrd Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/txt2keywords.py", line 54, in for keyword, score in ( yake( doc, ngrams=NGRAMS, topn=TOPN ) ) : File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 96, in yake word_scores = _compute_word_scores(doc, word_occ_vals, word_freqs, stop_words) File "/data-disk/python/lib/python3.8/site-packages/textacy/ke/yake.py", line 205, in _compute_word_scores freq_baseline = statistics.mean(freqs_nsw) + statistics.stdev(freqs_nsw) File "/data-disk/python/lib/python3.8/statistics.py", line 315, in mean raise StatisticsError('mean requires at least one data point') statistics.StatisticsError: mean requires at least one data point 51210 txt/../wrd/51210.wrd 51210 txt/../pos/51210.pos === file2bib.sh === id: 51210 author: Sheldon, Walter J. title: I, the Unspeakable date: pages: extension: .txt txt: ./txt/51210.txt cache: ./cache/51210.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 3 resourceName b'51210.txt' 51210 txt/../ent/51210.ent 34215 txt/../wrd/34215.wrd 34215 txt/../pos/34215.pos 39284 txt/../pos/39284.pos 39284 txt/../wrd/39284.wrd 34215 txt/../ent/34215.ent 47627 txt/../pos/47627.pos 47627 txt/../wrd/47627.wrd 37520 txt/../pos/37520.pos 37520 txt/../wrd/37520.wrd 47627 txt/../ent/47627.ent 39284 txt/../ent/39284.ent === file2bib.sh === id: 34215 author: Hearn, Lafcadio title: Shadowings date: pages: extension: .txt txt: ./txt/34215.txt cache: ./cache/34215.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 6 resourceName b'34215.txt' 37520 txt/../ent/37520.ent === file2bib.sh === id: 39284 author: Bardsley, Charles Wareing Endell title: Curiosities of Puritan Nomenclature date: pages: extension: .txt txt: ./txt/39284.txt cache: ./cache/39284.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 5 resourceName b'39284.txt' === file2bib.sh === id: 47627 author: Pickett, Thomas Edward title: The Quest for a Lost Race date: pages: extension: .txt txt: ./txt/47627.txt cache: ./cache/47627.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 6 resourceName b'47627.txt' === file2bib.sh === id: 37520 author: Ferguson, Robert title: Surnames as a Science date: pages: extension: .txt txt: ./txt/37520.txt cache: ./cache/37520.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 7 resourceName b'37520.txt' Done mapping. Reducing subject-namesPersonal-gutenberg === reduce.pl bib === === reduce.pl bib === id = 39284 author = Bardsley, Charles Wareing Endell title = Curiosities of Puritan Nomenclature date = pages = extension = .txt mime = text/plain words = 56082 sentences = 5045 flesch = 82 summary = the Puritan incumbent, should have baptized his own children by such names of English surnames and baptismal names might be written. the old English names had gone down before the year 1200 had been reached. document containing 588 names, 92 are William, 88 John, 55 Richard, 48 upon as altered forms of old favourite names, and were entered in vestry Ann, in these days of double baptismal names, perpetuates the impression that Marion or Marian was compounded of Mary and Ann. Of familiar occurrence were such names as _Perrin_, from Pierre, Peter; following _surnames_ (originally, of course, christian names) from the became household names, John, Simon, Peter, Bartholomew, Matthew, James, old Scripture names of Bartholomew, Peter, Philip, and Nicholas received a popular feeling for a century was against turning the new Scripture names baptized in England, thirteen are entered in the register as John or cache = ./cache/39284.txt txt = ./txt/39284.txt === reduce.pl bib === id = 37520 author = Ferguson, Robert title = Surnames as a Science date = pages = extension = .txt mime = text/plain words = 58598 sentences = 5687 flesch = 82 summary = also common as the endings of Celtic names, _ward_ taking the form of German form as _Sycamore_, the Anglo-Saxon names from which they may be CLUE TO SOME OF THE ANCIENT FORMS REPRESENTED IN ENGLISH NAMES. CLUE TO SOME OF THE ANCIENT FORMS REPRESENTED IN ENGLISH NAMES. Now ancient Teutonic names formed of one single word had commonly, In many cases in Teutonic names we have words thus formed, and also the English names, with the ancient forms corresponding. should, in names of Teutonic origin, exhibit High German forms in The High German forms, then, that appear in English names may be taken Besides the names of Old Frankish, _i.e._ German origin, which have come names of Anglo-Saxon times, the form _ch_ for (as I suppose) _g_, as in in Anglo-Saxon times, nor anything to correspond in Old German names. besides other names in correspondence with ancient forms. cache = ./cache/37520.txt txt = ./txt/37520.txt === reduce.pl bib === id = 34215 author = Hearn, Lafcadio title = Shadowings date = pages = extension = .txt mime = text/plain words = 40208 sentences = 4027 flesch = 84 summary = sang was an old Japanese song about a famous shrine in the town of the old man went away as he had come; and the young girl followed him. He is a good young man; and later in life he will obtain a much higher THERE was a man named Tawaraya Tôtarô, who lived in the Province of Ômi. Here I may remark that Japanese children usually capture sémi by means word sémi to names of insects which are not cicadæ. the same kind of sémi may be called by different names in different attached to the following examples are nearly all names of old-time A very large number of Japanese poems about sémi describe the noise of BY the Japanese a certain kind of girl is called a still do, that Japanese girls are usually named after flowers, or an old rule for Japanese names,--a curious rule that might help to cache = ./cache/34215.txt txt = ./txt/34215.txt === reduce.pl bib === id = 47627 author = Pickett, Thomas Edward title = The Quest for a Lost Race date = pages = extension = .txt mime = text/plain words = 58706 sentences = 4853 flesch = 71 summary = of England and in the authentic annals of the Anglo-Norman races. the Norman to English soil, in time drove him to the great settlements derivation from the Anglo-Norman branch of the great British race. Norman to the English race in England and the United States. of England and the founder of the Anglo-Norman race that swore the the simpler forms of profanity--Anglo-Norman and Early English. When she lost the Norman element in its early Scandinavian form, her scholar, the great English writer--himself of Anglo-Norman blood--found royal Anglo-Norman, "Prince Hal" of England, the English dramatist _Anglo-Saxon Race_,--which in the great Triple Alliance of Norman and Scandinavian stock; the Norman from Normandy, remotely Gothic, is Normans, but broadly speaking, are a great branch of the English race Kentucky derived from English sources and bearing Norman surnames is _Bagot._ A baronial family (Normandy); came to England at the Norman family is readily traceable from Normandy to England, and cache = ./cache/47627.txt txt = ./txt/47627.txt === reduce.pl bib === id = 51210 author = Sheldon, Walter J. title = I, the Unspeakable date = pages = extension = .txt mime = text/plain words = 11967 sentences = 1336 flesch = 93 summary = Like most important places, the Govpub Office in Center Four was I started to turn away and the cyb said, "Information on tanks is I felt like anything but standing there and looking lonely working here?" Personal talk at a time like this wasn't approved We came to a turn in the corridor and something happened; I'm not sure I walked out and wanted to turn and smile at Lara, and get into my "_The woman, Lara, attracts you_," said the voice. "Of course," I said again, and went back to washing my hands. looked at me and said, in approved voice and standard phraseology, Activity Control said they couldn't do a thing until I was registered. "A spy," said Apollo, looking into my open eyes. "I don't," said the Chief, and got up. It was the first time I had heard his voice. cache = ./cache/51210.txt txt = ./txt/51210.txt Building ./etc/reader.txt Error: near line 1: database is locked Send options without primary recipient specified. Usage: mailx -eiIUdEFntBDNHRVv~ -T FILE -u USER -h hops -r address -s SUBJECT -a FILE -q FILE -f FILE -A ACCOUNT -b USERS -c USERS -S OPTION users 37520 39284 47627 47627 37520 51210 number of items: 6 sum of words: 225,561 average size in words: 45,112 average readability score: 82 nouns: name; names; form; man; time; family; day; race; years; origin; century; p.; word; place; son; life; men; forms; case; sense; war; daughter; people; stem; way; times; list; meaning; warrior; wife; instance; ending; woman; surnames; child; cases; world; words; children; rule; blood; something; one; history; days; work; d.; voice; nothing; surname verbs: is; was; be; have; are; had; were; been; has; found; take; being; seems; said; find; do; did; made; called; says; came; come; think; seem; given; taken; derived; became; say; see; know; used; corresponding; having; known; seen; died; referred; become; formed; am; supposed; make; following; signifying; born; give; go; went; identified adjectives: same; old; other; great; such; english; many; ancient; german; common; first; little; more; early; famous; own; good; present; certain; last; scandinavian; japanese; frankish; popular; familiar; few; original; high; different; new; several; anglo; large; french; second; least; modern; long; curious; able; young; strong; norman; christian; full; general; much; possible; latter; personal adverbs: not; so; also; only; now; more; then; even; very; still; probably; as; again; perhaps; n''t; most; here; well; up; thus; never; out; too; there; however; far; no; rather; once; much; almost; sometimes; always; back; yet; hence; just; in; down; already; all; armorially; on; first; indeed; long; especially; ever; away; later pronouns: it; i; his; he; we; they; their; you; our; her; my; its; me; she; him; them; us; your; himself; itself; myself; themselves; one; thy; thee; ya; ourselves; ''em; herself; mine; thyself; ''s; yourself; yours; ye; oneself; iv; æs; zo; yt; you,"--this; yorkshire; yoi; yankee"--the; ya:_--; theirs; tank[6; say--"in; ours; o proper nouns: _; o.g.; eng; england; normandy; a.s.; norman; o; john; f.; william; de; anglo; saxon; english; kentucky; baptized; thomas; mr.; ko; i.; vide; robert; richard; henry; puritan; london; sir; hari; st.; names; man; french; peter; l.v.; france; s.; lord; james; d.; elizabeth; buried; virginia; old; god; charles; lib; mary; le; century keywords: mr.; english; england; william; thomas; st.; sir; saxon; robert; richard; old; man; john; french; anglo; zémi; yanrei; worc; wine; wig; washington; ward; wald; vitæ; virginia; vide; united; tôtarô; tokkei; time; thousand; teutonic; sémi; states; state; september; seiza; scandinavian; san; roman; ric; race; puritan; province; october; o.n.; o.h.g.; o.g.; november; north one topic; one dimension: names file(s): titles(s): The Romance of Names three topics; one dimension: names; normandy; eng file(s): ./cache/39284.txt, ./cache/47627.txt, ./cache/37520.txt titles(s): Curiosities of Puritan Nomenclature | The Quest for a Lost Race | Surnames as a Science five topics; three dimensions: normandy norman _o; names baptized john; eng names german; arrangement wound passes; arrangement wound passes file(s): ./cache/47627.txt, ./cache/39284.txt, ./cache/37520.txt, , titles(s): The Quest for a Lost Race | Curiosities of Puritan Nomenclature | Surnames as a Science | The Romance of Names | The Romance of Names Type: gutenberg title: subject-namesPersonal-gutenberg date: 2021-06-07 time: 12:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Names, Personal" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 39284 author: Bardsley, Charles Wareing Endell title: Curiosities of Puritan Nomenclature date: words: 56082.0 sentences: 5045.0 pages: flesch: 82.0 cache: ./cache/39284.txt txt: ./txt/39284.txt summary: the Puritan incumbent, should have baptized his own children by such names of English surnames and baptismal names might be written. the old English names had gone down before the year 1200 had been reached. document containing 588 names, 92 are William, 88 John, 55 Richard, 48 upon as altered forms of old favourite names, and were entered in vestry Ann, in these days of double baptismal names, perpetuates the impression that Marion or Marian was compounded of Mary and Ann. Of familiar occurrence were such names as _Perrin_, from Pierre, Peter; following _surnames_ (originally, of course, christian names) from the became household names, John, Simon, Peter, Bartholomew, Matthew, James, old Scripture names of Bartholomew, Peter, Philip, and Nicholas received a popular feeling for a century was against turning the new Scripture names baptized in England, thirteen are entered in the register as John or id: 37520 author: Ferguson, Robert title: Surnames as a Science date: words: 58598.0 sentences: 5687.0 pages: flesch: 82.0 cache: ./cache/37520.txt txt: ./txt/37520.txt summary: also common as the endings of Celtic names, _ward_ taking the form of German form as _Sycamore_, the Anglo-Saxon names from which they may be CLUE TO SOME OF THE ANCIENT FORMS REPRESENTED IN ENGLISH NAMES. CLUE TO SOME OF THE ANCIENT FORMS REPRESENTED IN ENGLISH NAMES. Now ancient Teutonic names formed of one single word had commonly, In many cases in Teutonic names we have words thus formed, and also the English names, with the ancient forms corresponding. should, in names of Teutonic origin, exhibit High German forms in The High German forms, then, that appear in English names may be taken Besides the names of Old Frankish, _i.e._ German origin, which have come names of Anglo-Saxon times, the form _ch_ for (as I suppose) _g_, as in in Anglo-Saxon times, nor anything to correspond in Old German names. besides other names in correspondence with ancient forms. id: 34215 author: Hearn, Lafcadio title: Shadowings date: words: 40208.0 sentences: 4027.0 pages: flesch: 84.0 cache: ./cache/34215.txt txt: ./txt/34215.txt summary: sang was an old Japanese song about a famous shrine in the town of the old man went away as he had come; and the young girl followed him. He is a good young man; and later in life he will obtain a much higher THERE was a man named Tawaraya Tôtarô, who lived in the Province of Ômi. Here I may remark that Japanese children usually capture sémi by means word sémi to names of insects which are not cicadæ. the same kind of sémi may be called by different names in different attached to the following examples are nearly all names of old-time A very large number of Japanese poems about sémi describe the noise of BY the Japanese a certain kind of girl is called a still do, that Japanese girls are usually named after flowers, or an old rule for Japanese names,--a curious rule that might help to id: 47627 author: Pickett, Thomas Edward title: The Quest for a Lost Race date: words: 58706.0 sentences: 4853.0 pages: flesch: 71.0 cache: ./cache/47627.txt txt: ./txt/47627.txt summary: of England and in the authentic annals of the Anglo-Norman races. the Norman to English soil, in time drove him to the great settlements derivation from the Anglo-Norman branch of the great British race. Norman to the English race in England and the United States. of England and the founder of the Anglo-Norman race that swore the the simpler forms of profanity--Anglo-Norman and Early English. When she lost the Norman element in its early Scandinavian form, her scholar, the great English writer--himself of Anglo-Norman blood--found royal Anglo-Norman, "Prince Hal" of England, the English dramatist _Anglo-Saxon Race_,--which in the great Triple Alliance of Norman and Scandinavian stock; the Norman from Normandy, remotely Gothic, is Normans, but broadly speaking, are a great branch of the English race Kentucky derived from English sources and bearing Norman surnames is _Bagot._ A baronial family (Normandy); came to England at the Norman family is readily traceable from Normandy to England, and id: 51210 author: Sheldon, Walter J. title: I, the Unspeakable date: words: 11967.0 sentences: 1336.0 pages: flesch: 93.0 cache: ./cache/51210.txt txt: ./txt/51210.txt summary: Like most important places, the Govpub Office in Center Four was I started to turn away and the cyb said, "Information on tanks is I felt like anything but standing there and looking lonely working here?" Personal talk at a time like this wasn''t approved We came to a turn in the corridor and something happened; I''m not sure I walked out and wanted to turn and smile at Lara, and get into my "_The woman, Lara, attracts you_," said the voice. "Of course," I said again, and went back to washing my hands. looked at me and said, in approved voice and standard phraseology, Activity Control said they couldn''t do a thing until I was registered. "A spy," said Apollo, looking into my open eyes. "I don''t," said the Chief, and got up. It was the first time I had heard his voice. id: 24374 author: Weekley, Ernest title: The Romance of Names date: words: nan sentences: nan pages: flesch: nan cache: txt: summary: ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel