Creating study carrel named bahnemann-transforming-2021 Initializing database Building cache Building study carrel named bahnemann-transforming-2021 FILE: cache/0yLgw3HoKQ.pdf OUTPUT: txt/0yLgw3HoKQ.txt 0yLgw3HoKQ txt/../pos/0yLgw3HoKQ.pos 0yLgw3HoKQ txt/../wrd/0yLgw3HoKQ.wrd 0yLgw3HoKQ txt/../ent/0yLgw3HoKQ.ent === file2bib.sh === id: 0yLgw3HoKQ author: Greta Bahnemann title: Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project date: 2021-01-20 pages: 76 extension: .pdf txt: ./txt/0yLgw3HoKQ.txt cache: ./cache/0yLgw3HoKQ.pdf Author ['Greta Bahnemann', 'Michael Carroll', 'Paul Clough', 'Mario Einaudi', 'Chatham Ewing', 'Jeff Mixter', 'Jason Roy', 'Holly Tomren', 'Bruce Washburn', 'Elliot Williams.'] Content-Type application/pdf Creation-Date 2021-01-19T04:57:11Z Last-Modified 2021-01-20T21:12:53Z Last-Save-Date 2021-01-20T21:12:53Z X-Parsed-By ['org.apache.tika.parser.DefaultParser', 'org.apache.tika.parser.pdf.PDFParser'] X-TIKA:content_handler ToTextContentHandler X-TIKA:embedded_depth 0 X-TIKA:parse_time_millis 196 access_permission:assemble_document true access_permission:can_modify true access_permission:can_print true access_permission:can_print_degraded true access_permission:extract_content true access_permission:extract_for_accessibility true access_permission:fill_in_form true access_permission:modify_annotations true created 2021-01-19T04:57:11Z creator ['Greta Bahnemann', 'Michael Carroll', 'Paul Clough', 'Mario Einaudi', 'Chatham Ewing', 'Jeff Mixter', 'Jason Roy', 'Holly Tomren', 'Bruce Washburn', 'Elliot Williams.'] date 2021-01-20T21:12:53Z dc:creator ['Greta Bahnemann', 'Michael Carroll', 'Paul Clough', 'Mario Einaudi', 'Chatham Ewing', 'Jeff Mixter', 'Jason Roy', 'Holly Tomren', 'Bruce Washburn', 'Elliot Williams.'] dc:format application/pdf; version=1.6 dc:language en-US dc:title Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project dcterms:created 2021-01-19T04:57:11Z dcterms:modified 2021-01-20T21:12:53Z language en-US meta:author ['Greta Bahnemann', 'Michael Carroll', 'Paul Clough', 'Mario Einaudi', 'Chatham Ewing', 'Jeff Mixter', 'Jason Roy', 'Holly Tomren', 'Bruce Washburn', 'Elliot Williams.'] meta:creation-date 2021-01-19T04:57:11Z meta:save-date 2021-01-20T21:12:53Z modified 2021-01-20T21:12:53Z pdf:PDFVersion 1.6 pdf:charsPerPage ['131', '0', '518', '1554', '1927', '2499', '1567', '3336', '1365', '1225', '1979', '2345', '2880', '3532', '2420', '741', '3124', '1376', '1376', '2887', '1151', '1245', '1186', '807', '789', '1455', '3574', '3807', '1864', '1615', '3368', '1895', '220', '1900', '1727', '2125', '1368', '2272', '1452', '344', '2735', '782', '1052', '795', '3459', '2132', '297', '1925', '2833', '165', '1141', '809', '459', '729', '1073', '785', '1630', '3677', '3353', '4216', '3524', '2901', '3294', '2953', '2413', '3193', '2608', '1443', '2156', '2266', '2627', '2619', '2978', '2659', '2243', '319'] pdf:docinfo:created 2021-01-19T04:57:11Z pdf:docinfo:creator Greta Bahnemann pdf:docinfo:creator_tool Adobe InDesign 16.0 (Windows) pdf:docinfo:modified 2021-01-20T21:12:53Z pdf:docinfo:producer Adobe PDF Library 15.0 pdf:docinfo:title Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project pdf:docinfo:trapped False pdf:encrypted false pdf:hasMarkedContent true pdf:hasXFA false pdf:hasXMP true pdf:unmappedUnicodeCharsPerPage ['0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0', '0'] producer Adobe PDF Library 15.0 resourceName b'0yLgw3HoKQ.pdf' title Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project trapped False xmp:CreatorTool Adobe InDesign 16.0 (Windows) xmpMM:DerivedFrom:DocumentID xmp.did:438f0a48-a7ac-954e-bc07-d997c00854c4 xmpMM:DerivedFrom:InstanceID xmp.iid:8007630b-59e2-804a-92b1-0e9d435ab75a xmpMM:DocumentID xmp.id:50b9ee5d-c1d4-ee4c-89aa-53fd2b9c434f xmpTPg:NPages 76 Done mapping. Reducing bahnemann-transforming-2021 === reduce.pl bib === id = 0yLgw3HoKQ author = Greta Bahnemann title = Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project date = 2021-01-20 pages = 76 extension = .pdf mime = application/pdf words = 21338 sentences = 1845 flesch = 44 summary = Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project The OCLC CONTENTdm Linked Data Pilot project team consisted of the following OCLC staff: In the CONTENTdm Linked Data Pilot project, OCLC partnered testing new applications built in the Wikibase environment for data retrieval, image annotation, This report describes the course of the CONTENTdm Linked Data Pilot project and its primary CONTENTdm Linked Data Pilot project used the Wikibase environment, which includes several OCLC staff exported CONTENTdm metadata for each suggested collection and created an entity a project for each collection in the program OpenRefine25 (figure 11), which provides tools for data CONTENTdm collection metadata in an OpenRefine project.26 View a larger image online. the Wikibase, OCLC developed a CONTENTdm customization that embeds the Schema.org data https://www.oclc.org/en/events/2020/devconnect-online-2020/devconnect-2020-creating-linked-descriptive-data-for-contentdm.html https://www.oclc.org/en/events/2020/devconnect-online-2020/devconnect-2020-creating-linked-descriptive-data-for-contentdm.html https://researchworks.oclc.org/cdmld/screenshots/google-structured-data-testing-tool.png. https://researchworks.oclc.org/cdmld/screenshots/google-structured-data-testing-tool.png. Transforming Metadata into Linked Data to Improve Digital Collection Discoverability 73 cache = ./cache/0yLgw3HoKQ.pdf txt = ./txt/0yLgw3HoKQ.txt Building ./etc/reader.txt 0yLgw3HoKQ 0yLgw3HoKQ number of items: 1 sum of words: 21,338 average size in words: 21,338 average readability score: 44 nouns: data; project; metadata; entity; entities; image; figure; oclc; collections; pilot; work; user; model; interface; description; tools; staff; system; information; collection; process; discoverability; reconciliation; example; descriptions; participants; discovery; search; view; part; headings; environment; type; materials; field; application; tool; team; relationships; images; workflows; library; results; property; partners; connections; concept; statements; person; systems verbs: linked; be; was; is; improve; are; were; used; using; transforming; based; developed; see; has; created; associated; have; provided; help; •; described; been; updated; found; adding; use; including; included; had; find; create; provide; describing; defined; shared; related; transformed; managing; make; depicted; creating; cataloging; oriented; needed; describe; testing; look; illustrated; evaluated; displayed adjectives: new; other; larger; related; more; digital; different; descriptive; local; explorer; creative; contextual; subject; initial; cultural; separate; first; such; specific; single; same; important; additional; several; many; depicted; useful; potential; external; current; unique; simple; significant; shared; richer; retriever; large; future; able; traditional; reconcile; public; ontological; most; manual; individual; great; essential; effective; unmapped adverbs: also; not; more; online; out; up; only; as; well; most; quickly; very; then; better; rather; locally; frequently; yet; especially; so; previously; particularly; home; effectively; back; ultimately; together; strongly; greatly; even; enough; easily; best; already; perhaps; n’t; much; initially; in; immediately; consistently; below; automatically; truly; thereby; thematically; that; still; sometimes; significantly pronouns: it; our; we; their; its; they; us; you; them; your; itself; one; i; https://www.oclc.org/en/contentdm.html; https://www.mediawiki.org/wiki/mediawiki; https://merrick.library.miami.edu/cubanheritage/chc0468/; https://github.com/wetneb/openrefine-wikibase; her proper nouns: wikibase; contentdm; data; digital; collection; metadata; figure; library; view; annotator; image; oclc; discoverability; university; linked; openrefine; transforming; explorer; wikidata; field; cleveland; transportation; libraries; json; analyzer; public; minnesota; mediawiki; temple; hub; retriever; project; schema.org; pilot; describer; •; miami; ld; iiif; rdf; huntington; dogs; º; user; sparql; wikimedia; viaf; phase; museum; january keywords: wikibase; oclc; metadata; link; library; figure; digital; data; collection one topic; one dimension: data file(s): ./cache/0yLgw3HoKQ.pdf titles(s): Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project three topics; one dimension: data; zoom; zoom file(s): ./cache/0yLgw3HoKQ.pdf, ./cache/0yLgw3HoKQ.pdf, ./cache/0yLgw3HoKQ.pdf titles(s): Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project | Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project | Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project five topics; three dimensions: data https org; zoom idea hopes; zoom idea hopes; zoom idea hopes; zoom idea hopes file(s): ./cache/0yLgw3HoKQ.pdf, ./cache/0yLgw3HoKQ.pdf, ./cache/0yLgw3HoKQ.pdf, ./cache/0yLgw3HoKQ.pdf, ./cache/0yLgw3HoKQ.pdf titles(s): Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project | Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project | Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project | Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project | Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project Type: file2carrel title: bahnemann-transforming-2021 date: 2021-01-21 time: 16:07 username: emorgan patron: Eric Morgan email: emorgan@nd.edu input: 0yLgw3HoKQ.pdf ==== make-pages.sh htm files ==== make-pages.sh complex files ==== make-pages.sh named enities ==== making bibliographics id: 0yLgw3HoKQ author: Greta Bahnemann title: Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project date: 2021-01-20 words: 21338 sentences: 1845 pages: 76 flesch: 44 cache: ./cache/0yLgw3HoKQ.pdf txt: ./txt/0yLgw3HoKQ.txt summary: Transforming Metadata into Linked Data to Improve Digital Collection Discoverability: A CONTENTdm Pilot Project The OCLC CONTENTdm Linked Data Pilot project team consisted of the following OCLC staff: In the CONTENTdm Linked Data Pilot project, OCLC partnered testing new applications built in the Wikibase environment for data retrieval, image annotation, This report describes the course of the CONTENTdm Linked Data Pilot project and its primary CONTENTdm Linked Data Pilot project used the Wikibase environment, which includes several OCLC staff exported CONTENTdm metadata for each suggested collection and created an entity a project for each collection in the program OpenRefine25 (figure 11), which provides tools for data CONTENTdm collection metadata in an OpenRefine project.26 View a larger image online. the Wikibase, OCLC developed a CONTENTdm customization that embeds the Schema.org data https://www.oclc.org/en/events/2020/devconnect-online-2020/devconnect-2020-creating-linked-descriptive-data-for-contentdm.html https://www.oclc.org/en/events/2020/devconnect-online-2020/devconnect-2020-creating-linked-descriptive-data-for-contentdm.html https://researchworks.oclc.org/cdmld/screenshots/google-structured-data-testing-tool.png. https://researchworks.oclc.org/cdmld/screenshots/google-structured-data-testing-tool.png. Transforming Metadata into Linked Data to Improve Digital Collection Discoverability 73 ==== make-pages.sh questions ==== make-pages.sh search ==== make-pages.sh topic modeling corpus Zipping study carrel