mv: ‘./input-file.zip’ and ‘./input-file.zip’ are the same file Creating study carrel named subject-esperanto-gutenberg Initializing database Unzipping Archive: input-file.zip creating: ./tmp/input/input-file/ inflating: ./tmp/input/input-file/16432.txt inflating: ./tmp/input/input-file/8177.txt inflating: ./tmp/input/input-file/metadata.csv caution: excluded filename not matched: *MACOSX* === DIRECTORIES: ./tmp/input === DIRECTORY: ./tmp/input/input-file === metadata file: ./tmp/input/input-file/metadata.csv === found metadata file === updating bibliographic database Building study carrel named subject-esperanto-gutenberg FILE: cache/16432.txt OUTPUT: txt/16432.txt FILE: cache/8177.txt OUTPUT: txt/8177.txt 16432 txt/../pos/16432.pos 16432 txt/../wrd/16432.wrd 16432 txt/../ent/16432.ent === file2bib.sh === id: 16432 author: Bartholdt, Richard title: Esperanto: Hearings before the Committee on Education date: pages: extension: .txt txt: ./txt/16432.txt cache: ./cache/16432.txt Content-Encoding UTF-8 Content-Type text/plain; charset=UTF-8 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 3 resourceName b'16432.txt' 8177 txt/../pos/8177.pos 8177 txt/../wrd/8177.wrd 8177 txt/../ent/8177.ent === file2bib.sh === id: 8177 author: Fryer, Helen title: The Esperanto Teacher: A Simple Course for Non-Grammarians date: pages: extension: .txt txt: ./txt/8177.txt cache: ./cache/8177.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.csv.TextAndCSVParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 4 resourceName b'8177.txt' Done mapping. Reducing subject-esperanto-gutenberg === reduce.pl bib === id = 8177 author = Fryer, Helen title = The Esperanto Teacher: A Simple Course for Non-Grammarians date = pages = extension = .txt mime = text/plain words = 45578 sentences = 10106 flesch = 86 summary = g hard, as in Go, GiG, Gun. gx soft, as in Gentle, Gem, or like J in Just, Jew. h well breathed, as in Horse, Home, How. hx strongly breathed, and in the throat, as in the Scotch word loCH. the present time, as "Mi vidas", I see; "Sxi estas", She is; "Ili La edzino de mia patro estas mia patrino, kaj la avino de miaj infanoj. cxi arbeto; sed la vento forblovis de mia kapo la cxapon, kaj gxi, Cxie estas floroj, kaj nenie oni povas trovi pli belajn. (These words are called PREPOSITIONS, which means placed before). La birdo flugas en la cxambro ( = gxi estas en la cxambro, kaj flugas en Mi staras ekster la domo, kaj li estas interne. plumujo, kaj bastoneto, sur kiu oni tenas plumon por skribadi, estas Li estas tre purema, kaj ecx unu Ankoraux salton, kaj nun gxi estas cxe la lerneja cache = ./cache/8177.txt txt = ./txt/8177.txt === reduce.pl bib === id = 16432 author = Bartholdt, Richard title = Esperanto: Hearings before the Committee on Education date = pages = extension = .txt mime = text/plain words = 11145 sentences = 686 flesch = 73 summary = advisability of introducing Esperanto as an auxiliary language in the understand Esperanto, and that whenever they would use that language all so on, so that the international language would really be Esperanto. learn, because it is composed of words taken from the English language, greatest living experts in that language and a missionary for Esperanto. Esperanto, and no matter what language he speaks at home I will get a seven years ago, and since that time Esperanto has made very great Esperanto, then I might be able to teach you the other language in the of the vocabulary in Zamenhof's new language--take words that everybody from each Esperanto word you can form mechanically absolutely every meaning of the original Esperanto word. all, the Esperanto language does not contain any words at all; I think other languages, but it is not true of Esperanto. And here Esperanto has over all other languages not only cache = ./cache/16432.txt txt = ./txt/16432.txt Building ./etc/reader.txt 8177 16432 8177 16432 number of items: 2 sum of words: 56,723 average size in words: 28,361 average readability score: 79 nouns: language; words; time; word; day; place; father; languages; christen; thing; something; years; people; estas; subject; sin; house; man; action; one; way; book; state; children; use; brother; world; preposition; noun; town; sense; vocabulary; verb; person; meaning; cxi; sentence; number; kind; domo; cxar; table; room; reason; child; year; root; order; names; life verbs: is; be; are; have; was; do; has; used; estas; means; being; say; been; take; were; going; am; see; had; make; know; go; use; come; having; did; tell; give; made; wish; learn; seen; held; denotes; understand; does; added; want; speak; let; become; read; came; find; done; write; went; think; put; called adjectives: other; many; great; more; same; own; good; such; whole; little; international; antaux; estis; beautiful; different; new; few; first; most; able; simple; present; possible; english; common; various; last; certain; past; much; ankoraux; young; tute; large; true; important; white; several; only; necessary; least; red; long; german; french; difficult; definite; clear; worthy; special adverbs: not; very; so; only; here; now; then; more; also; already; up; out; instead; often; even; away; always; ago; therefore; on; never; most; long; well; all; as; again; yet; that; quite; at; too; together; just; is; back; there; less; far; ever; entirely; down; first; still; nearly; however; exactly; before; alone; perhaps pronouns: i; it; he; you; they; we; his; my; me; she; him; her; their; them; your; its; our; himself; us; one; itself; myself; themselves; herself; yours; iu; yourself; ni; oneself; theirs; ours; ne; u; tri; terurigxis; s; ourselves; mine; je; ii; ido; ia; hx; fajfi proper nouns: la; li; kaj; estas; mi; esperanto; al; de; ne; vi; |; ke; sxi; ili; ni; gxi; sur; kiu; kun; el; lesson; da; sed; mia; pli; cxi; aux; tre; oni; unu; patro; estis; kiel; vocabulary; por; english; en; mr.; prof.; lin; se; kiam; povas; pri; jen; sia; ol; john; je; lia keywords: word; esperanto; english; zamenhof; vocabulary; unu; tre; towner; tiu; time; sxi; sur; state; sin; sia; sed; preposition; post; por; pli; place; patro; oni; old; mia; mean; lin; lesson; language; kun; kiu; kaj; john; johano; jen; ili; gxi; george; france; footnote; father; estas; dr.; day; cxi; christen; aux; antaux one topic; one dimension: la file(s): ./cache/16432.txt titles(s): Esperanto: Hearings before the Committee on Education three topics; one dimension: la; esperanto; universe file(s): ./cache/8177.txt, ./cache/16432.txt, ./cache/16432.txt titles(s): The Esperanto Teacher: A Simple Course for Non-Grammarians | Esperanto: Hearings before the Committee on Education | Esperanto: Hearings before the Committee on Education five topics; three dimensions: la estas mi; esperanto language words; indicate hundredth fatal; indicate hundredth fatal; indicate hundredth fatal file(s): ./cache/8177.txt, ./cache/16432.txt, ./cache/16432.txt, ./cache/16432.txt, ./cache/16432.txt titles(s): The Esperanto Teacher: A Simple Course for Non-Grammarians | Esperanto: Hearings before the Committee on Education | Esperanto: Hearings before the Committee on Education | Esperanto: Hearings before the Committee on Education | Esperanto: Hearings before the Committee on Education Type: gutenberg title: subject-esperanto-gutenberg date: 2021-06-06 time: 14:06 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: facet_subject:"Esperanto" ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 16432 author: Bartholdt, Richard title: Esperanto: Hearings before the Committee on Education date: words: 11145 sentences: 686 pages: flesch: 73 cache: ./cache/16432.txt txt: ./txt/16432.txt summary: advisability of introducing Esperanto as an auxiliary language in the understand Esperanto, and that whenever they would use that language all so on, so that the international language would really be Esperanto. learn, because it is composed of words taken from the English language, greatest living experts in that language and a missionary for Esperanto. Esperanto, and no matter what language he speaks at home I will get a seven years ago, and since that time Esperanto has made very great Esperanto, then I might be able to teach you the other language in the of the vocabulary in Zamenhof''s new language--take words that everybody from each Esperanto word you can form mechanically absolutely every meaning of the original Esperanto word. all, the Esperanto language does not contain any words at all; I think other languages, but it is not true of Esperanto. And here Esperanto has over all other languages not only id: 8177 author: Fryer, Helen title: The Esperanto Teacher: A Simple Course for Non-Grammarians date: words: 45578 sentences: 10106 pages: flesch: 86 cache: ./cache/8177.txt txt: ./txt/8177.txt summary: g hard, as in Go, GiG, Gun. gx soft, as in Gentle, Gem, or like J in Just, Jew. h well breathed, as in Horse, Home, How. hx strongly breathed, and in the throat, as in the Scotch word loCH. the present time, as "Mi vidas", I see; "Sxi estas", She is; "Ili La edzino de mia patro estas mia patrino, kaj la avino de miaj infanoj. cxi arbeto; sed la vento forblovis de mia kapo la cxapon, kaj gxi, Cxie estas floroj, kaj nenie oni povas trovi pli belajn. (These words are called PREPOSITIONS, which means placed before). La birdo flugas en la cxambro ( = gxi estas en la cxambro, kaj flugas en Mi staras ekster la domo, kaj li estas interne. plumujo, kaj bastoneto, sur kiu oni tenas plumon por skribadi, estas Li estas tre purema, kaj ecx unu Ankoraux salton, kaj nun gxi estas cxe la lerneja ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel