mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-huntingGuides-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/1918.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-huntingGuides-gutenberg FILE: cache/1918.txt OUTPUT: txt/1918.txt 1918 txt/../wrd/1918.wrd 1918 txt/../pos/1918.pos 1918 txt/../ent/1918.ent === file2bib.sh === id: 1918 author: Haggard, H. Rider (Henry Rider) title: Long Odds date: pages: extension: .txt txt: ./txt/1918.txt cache: ./cache/1918.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 2 resourceName b'1918.txt' Done mapping. Reducing subject-huntingGuides-gutenberg === reduce.pl bib === id = 1918 author = Haggard, H. Rider (Henry Rider) title = Long Odds date = pages = extension = .txt mime = text/plain words = 6509 sentences = 315 flesch = 86 summary = interior, and so I started with a waggon-load of goods, and came and round granite koppies starting up here and there, looking out like fool I got down off the waggon-box to have a look round, thinking it up I heard the lion behind me, and next second I felt the brute, ay, as lion I ever saw, and I have seen a great many, and he had a most "The lions came back no more that night, and by the next morning my Accordingly Tom took some matches and began starting little fires to like a fan, whereupon I went round to the further side of the pan to the lion, like the lamb of prophesy, but I suppose the reeds were thick, got my gun well on to the lion's shoulder--the black-maned one--so as to less just in time to see the tail of the last lion vanishing round the cache = ./cache/1918.txt txt = ./txt/1918.txt Building ./etc/reader.txt 1918 1918 number of items: 1 sum of words: 6,509 average size in words: 6,509 average readability score: 86 nouns: lion; waggon; lions; way; reeds; pan; man; yards; lioness; kloof; gun; eyes; time; thing; shot; night; head; fever; feet; day; bush; back; air; tail; something; sight; place; people; moment; leg; woman; sound; oxen; ox; hand; eye; death; cartridge; bushes; bullet; blood; animal; wrist; word; wind; top; tongue; thigh; teeth; story verbs: was; had; have; is; got; be; were; did; came; went; get; been; see; heard; go; began; took; made; turned; thought; saw; look; has; come; told; lay; going; found; stood; seen; said; put; killed; gone; done; do; being; think; started; say; making; looked; knew; having; gave; are; used; standing; shot; remember adjectives: old; great; little; other; dead; more; long; last; next; good; yellow; half; first; poor; few; whole; white; second; round; beautiful; young; wild; sure; sudden; single; same; right; new; many; low; large; hot; green; full; dry; dark; black; bad; awful; tough; thicker; thick; stronger; splendid; soft; risky; restless; possible; own; only adverbs: not; up; out; so; then; very; there; down; just; back; about; again; well; on; now; as; pretty; only; never; more; still; right; away; too; ever; all; round; suddenly; quite; off; in; however; straight; soon; slowly; rather; presently; here; accordingly; wonderfully; together; somewhere; nearly; n''t; indeed; first; evidently; along; yet; sometimes pronouns: i; it; he; my; me; his; her; you; him; she; they; them; their; myself; its; we; us; herself; your; one; mantelshelf; itself; himself; ''s proper nouns: tom; kaptein; bush; quatermain; hut; sikukuni; march; heavens; africa; middelburg; kraal; kloof; impala; good; fro; captain; allan; _; zululand; zulu; zanzibar; yorkshire; wife; whereabouts; veldt; trek; tana; tambouki; t''chaka; swazi; south; sir; sequati; road; river; rider; reedy; prophesy; project; oliphant; odds; nullah; northumberland; mimosa; macumazahn; livingstone; lamb; koos; knobnoses; interior keywords: tom; lion; like; kaptein one topic; one dimension: got file(s): ./cache/1918.txt titles(s): Long Odds three topics; one dimension: got; zululand; zululand file(s): ./cache/1918.txt, ./cache/1918.txt, ./cache/1918.txt titles(s): Long Odds | Long Odds | Long Odds five topics; three dimensions: got lion like; zululand grey haggard; zululand grey haggard; zululand grey haggard; zululand grey haggard file(s): ./cache/1918.txt, ./cache/1918.txt, ./cache/1918.txt, ./cache/1918.txt, ./cache/1918.txt titles(s): Long Odds | Long Odds | Long Odds | Long Odds | Long Odds Type: gutenberg title: subject-huntingGuides-gutenberg date: 2021-06-06 time: 17:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Hunting guides" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 1918 author: Haggard, H. Rider (Henry Rider) title: Long Odds date: words: 6509 sentences: 315 pages: flesch: 86 cache: ./cache/1918.txt txt: ./txt/1918.txt summary: interior, and so I started with a waggon-load of goods, and came and round granite koppies starting up here and there, looking out like fool I got down off the waggon-box to have a look round, thinking it up I heard the lion behind me, and next second I felt the brute, ay, as lion I ever saw, and I have seen a great many, and he had a most "The lions came back no more that night, and by the next morning my Accordingly Tom took some matches and began starting little fires to like a fan, whereupon I went round to the further side of the pan to the lion, like the lamb of prophesy, but I suppose the reeds were thick, got my gun well on to the lion''s shoulder--the black-maned one--so as to less just in time to see the tail of the last lion vanishing round the ==== make-pages.sh questions Traceback (most recent call last): File "/data-disk/reader-compute/reader-classic/bin/tsv2htm-questions.py", line 23, in df = pd.read_csv( tsv, sep='\t' ) File "/data-disk/python/lib/python3.8/site-packages/pandas/io/parsers.py", line 676, in parser_f return _read(filepath_or_buffer, kwds) File "/data-disk/python/lib/python3.8/site-packages/pandas/io/parsers.py", line 448, in _read parser = TextFileReader(fp_or_buf, **kwds) File "/data-disk/python/lib/python3.8/site-packages/pandas/io/parsers.py", line 880, in __init__ self._make_engine(self.engine) File "/data-disk/python/lib/python3.8/site-packages/pandas/io/parsers.py", line 1114, in _make_engine self._engine = CParserWrapper(self.f, **self.options) File "/data-disk/python/lib/python3.8/site-packages/pandas/io/parsers.py", line 1891, in __init__ self._reader = parsers.TextReader(src, **kwds) File "pandas/_libs/parsers.pyx", line 529, in pandas._libs.parsers.TextReader.__cinit__ File "pandas/_libs/parsers.pyx", line 720, in pandas._libs.parsers.TextReader._get_header File "pandas/_libs/parsers.pyx", line 916, in pandas._libs.parsers.TextReader._tokenize_rows File "pandas/_libs/parsers.pyx", line 2071, in pandas._libs.parsers.raise_parser_error pandas.errors.ParserError: Error tokenizing data. C error: EOF inside string starting at row 1 ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel