quadgram

This is a table of type quadgram and their frequencies. Use it to search & browse the list to learn more about your study carrel.

quadgram frequency
td td td td476
span a href http414
li li a href316
li a href http300
cmd wordsearch phrase zzzz298
zzzz word zzzz bookcode298
phrase zzzz word zzzz298
wordsearch phrase zzzz word298
life of a librarian224
li b source b217
li li b source217
when it comes to215
on the other hand212
td tr tr td208
word zzzz bookcode etext199
bookcode etext segsize usepre199
etext segsize usepre word199
zzzz bookcode etext segsize199
td img src http197
li b keywords b196
the a href http190
university of notre dame167
in the form of166
ul li b keywords165
through the use of155
li li b date154
tr align center td149
at the same time143
was never formally published141
a li li a139
align center td img134
at the university of134
as well as the133
center td img src132
the university of notre128
br span a href119
this text was never108
a a href http104
p a href http103
p img src http103
word zzzz bookcode thoreau99
span td tr table98
table align right tr98
td tr table p98
cmd term id themes97
a span td tr96
with the advent of93
it is possible to92
br a href http89
the number of times88
as well as a87
text was never formally87
p table align right84
for the purposes of84
align right tr align82
right tr align center82
called a href http82
a part of the80
cmd term id formats78
catalogue of electronic texts78
li a href https77
li b facet terms77
local annotated a li77
annotated a li ul77
ul li b creator77
li li b facet77
pdf local annotated a77
li b rights b77
b date read b77
li b date created77
li b date read77
li li b versions77
li li b rights77
b facet terms b77
b date created b77
quite a number of72
alex catalogue of electronic72
jpg br span a71
id p table align71
it a span td71
td td align right71
getwater id p table71
map it a span71
the use of the71
i li li strong71
cmd getwater id p71
p p a href69
can be used to68
my experiences at the67
the purpose of the67
p p img src67
open li li b66
td td align center66
table align center tr65
on the topic of65
td td img src65
the problem of find64
as a part of63
p table align center63
may or may not62
is a list of61
the use of a60
the result will be60
next generation library catalogs58
the library of congress58
of open source software57
the form of a57
td align right td57
of a href http56
will be able to56
digital humanities computing techniques55
in an effort to55
number of words in54
available on the web54
you may or may53
png img src http53
a list of all53
the content of the53
div id attachment class53
com sandbox liam sparql53
id attachment class wp53
width height class size52
ul li a href52
align center img src51
center img src http51
of words in a50
of the great books50
td align center img50
or may not know49
is a type of49
had the opportunity to48
the most frequently used48
frequency inverse document frequency48
jpg img src http48
enabling the reader to47
a part of a47
of the western world46
the result ought to46
as you may or46
great books of the46
list of all the46
of a number of46
books of the western46
p ul li strong45
for the most part45
for a good time45
each item in the45
conference on digital libraries45
gov authorities names n45
this is a pre45
and at the same44
are a number of44
center tr align center44
p ul li a44
align center tr align44
more things in common43
the distant reader is43
alt width height class43
here at notre dame43
a target blank href43
enable the reader to43
the number of words42
term frequency inverse document42
txt plain text a42
with a number of41
words in a document41
open source software and41
result ought to be40
things in common than39
the whole of the39
p div id attachment39
digital public library of39
library association annual meeting39
public library of america39
result will be a38
but not limited to38
documents my experiences at38
of the great ideas38
european conference on digital38
and a href http38
at the very least37
is intended to be37
i had the opportunity37
target blank href http37
there are a number36
apache tika release apache36
tika release apache tika36
library collections and services36
total number of documents36
are the great books36
i was able to36
in the united states36
the hathitrust research center36
this article was originally36
in common than differences35
the full text of35
a p id caption35
it comes to the35
the download page for35
create a list of35
download page for more35
com sandbox liam id35
not intended to be34
a number of things34
all of these things34
the great ideas coefficient34
contains a set of34
is a set of34
height class aligncenter size34
open source software in34
width height class aligncenter34
make it easier to34
open source software is33
allowing the reader to33
caption aligncenter a href33
is one of the33
html main sandbox liam33
www html main sandbox33
the alex catalogue of33
to read and write33
disk www html main33
an overview of the33
and open source software33
the next step is33
aligncenter a href http33
can be applied to33
next step is to32
a type of http32
ought to be a32
are intended to be32
in the same breath32
intended to be read32
as opposed to the32
the person next to32
in a number of32
the end of the32
to the use of32
center for digital scholarship32
total number of words32
as if it were32
have more things in32
a plain text file31
a look at the31
linked data is a31
a number of ways31
catholic research resources alliance31
source software in libraries31
in the public domain31
i am able to31
li li b keywords31
for each of the31
the american library association30
li ul p the30
a span violent span30
see the list of30
the problem of use30
align right src http30
will need to be30
the great books of30
i collected this water30
the items in the30
this text documents my30
the world wide web30
to be able to30
was originally published in30
of plain text files30
is an acronym for29
center for research computing29
if you want to29
old is new again29
you will need to29
each of the great29
for more information on29
span man is span29
is old is new29
the code lib community29
align right td td29
in the triple store28
align right td tr28
catholic youth literature project28
formats web articles a28
number of documents in28
person next to you28
plot on a timeline28
the frequency of words28
described in this proposal28
the good work of28
id formats web articles28
jpg alt width height28
of the semantic web28
the use of computers28
number of times the28
is a part of28
most frequently used words28
the balance of the28
enables the reader to28
img width src http28
term id formats web28
a chttp a f28
have been able to27
speech and named entities27
north carolina state university27
of these things are27
article was originally published27
intended to be used27
libraries of notre dame27
page for more information27
img align right src27
the names of people27
a greater number of27
in a href http27
text documents my experiences27
have a look at27
p blockquote p code27
text mining is a27
p img align right27
this is a list27
plot on a map26
more information on how26
the other end of26
putting it on the26
more like this one26
look at the download26
are some of the26
are not limited to26
the idea of the26
the university of michigan26
top tech trends for26
linked data is not26
wiki declaration of independence26
is expected to be26
are expected to be26
information on how to26
org wiki declaration of26
height br a href26
and the number of26
it on the web26
the result is a26
of full text content26
a span td td26
at the download page26
h summary h p26
width height br a26
but at the same25
at the other end25
here are a few25
next generation library catalog25
list of changes in25
plain text versions of25
a td tr table25
rdf and linked data25
libraries at the university25
full list of changes25
my a href http25
as a set of25
the advent of the25
on how to obtain25
please see the changes25
right td tr tr25
how to obtain apache25
the complete works of25
of documents in a25
the file named data25
to obtain apache tika25
and natural language processing25
in order to be25
american library association annual24
is not so much24
a number of years24
provide access to the24
open community knowledge hypermedia24
all of this is24
computer programs and scripts24
the value of the24
find more like this24
this presentation was given24
knowledge hypermedia administration and24
be used as a24
denoting the number of24
hypermedia administration and metadata24
code p blockquote p24
the lita blog at24
number of times each24
figure out how to24
the whole thing is24
lita blog at http24
and have a look24
documents in a corpus24
in the life of24
to learn how to24
eric lease morgan eric24
and the digital humanities24
a digital library framework24
community knowledge hypermedia administration24
frequency li li a24
make it easier for23
morgan eric morgan infomotions23
from the internet archive23
this is a good23
the north carolina state23
is made up of23
what is old is23
hesburgh libraries at the23
a perl module called23
lease morgan eric morgan23
of the digital humanities23
in the right direction23
is a good thing23
code pre blockquote p23
to answer the question23
the hesburgh libraries at23
there are a few23
is akin to a23
p blockquote pre code23
to accomplish this goal23
it is important to23
problem to be solved22
to make a book22
to what degree is22
the content of a22
the ala annual meeting22
the opportunity to visit22
with a set of22
and why should i22
this subdirectory contains a22
the use of these22
as but not limited22
the catholic research resources22
of globally networked computers22
p ol li strong22
tr tr align center22
word in a text22
the creation of a22
of the things i22
id formats journal articles22
how to use the22
p blockquote p the22
to figure out how22
are not intended to22
the context of the22
of publishing linked data22
themes libraries and librarianship22
whether or not the22
the total number of22
a violent fit of22
the purpose of this22
introduction to the nltk22
and or their frequency22
of the hesburgh libraries22
why should i care22
some of the things22
of one or more22
in the end i22
formats journal articles a22
but are not limited22
include but are not22
not seem to be22
such as but not22
eric lease morgan lt22
cite a href http22
subdirectory contains a set22
term id formats journal22
word cloud illustrating the22
with the exception of22
the national library of22
available as linked data21
of the human condition21
themes data curation a21
this was originally a21
p this is a21
is a collection of21
document was never formally21
in the first place21
li go to step21
a few years ago21
in this repository all21
analytics cookies to understand21
this repository all github21
to understand how you21
this document was never21
edited edited copy for21
cookies to understand how21
id themes data curation21
this travel log documents21
a and a href21
copy for eric lease21
understand how you use21
term id themes data21
articulating a research question21
a presentation at the21
a number of other21
this essay was originally21
td a href http21
from a set of21
edited copy for eric21
for eric lease morgan21
of the a href21
td tr tr align21
what it means to21
the catholic pamphlets project21
li li go to21
university libraries of notre21
p h summary h21
in any number of20
a td td img20
and putting it on20
is no such thing20
p h a id20
this is where the20
the use of rdf20
to what degree do20
plain text version of20
a great deal of20
it seems as if20
from the command line20
water and putting it20
into a coherent whole20
locations in a text20
is not the problem20
the contents of the20
is an introduction to20
a number of us20
advocate the use of20
for feature in features20
words phrases in a20
a plain text version20
a growing number of20
founding date ad http20
we do not advocate20
cloud illustrating the most20
in a text services20
the bulk of the20
p this is the20
the idea of a20
my point of view20
at least a couple20
the services against texts20
release and have a20
from my point of20
the work of the20
in the file named20
frequently used words in20
on the good work20
on a web server20
org oclc worldcat a20
has founding date ad20
in the case of20
it is time to20
through the creation of20
is probably the most20
number of documents containing20
and locations in a20
very similar to the20
take the form of20
li ul li li20
there is no such20
here at the university20
a word cloud illustrating20
at a href http20
back a list of20
a long time ago20
have a number of19
to a href http19
used to denote the19
code li li find19
how to make a19
tech trends for ala19
an analysis of the19
step in the right19
a number of different19
the name of a19
a code li li19
archival descriptions as linked19
last of the mohicans19
tools described in this19
in a presentation called19
in order to make19
be a list of19
return a list of19
descriptions as linked data19
p p in the19
open source software for19
freely available on the19
this posting describes how19
one of the more19
in the same way19
of the top ten19
a wide variety of19
you will find a19
more than ten years19
answer questions such as19
online public access catalogs19
on the lita blog19
intended to be a19
code lib mailing list19
i gave a presentation19
carolina state university libraries19
content uploads c l19
the code lib mailing19
the reader to do19
an integrated library system19
p blockquote p a19
a set of perl19
would not have been19
early english books online19
from a number of19
is located in the18
a better understanding of18
to what degree are18
be applied to the18
a a a a18
published in computers in18
the results of text18
is a travel log18
in computers in libraries18
is a good example18
least a couple of18
item in the corpus18
the reader can see18
the collection as a18
as well as some18
make sense of the18
on a world map18
a word of interest18
use of a concordance18
org library virtue htm18
this blog posting describes18
a whole lot of18
get back a list18
not advocate the use18
counting the number of18
as if they were18
it used to be18
the answers to these18
of the items in18
ought to be able18
if it were a18
each record in the18
for the long haul18
can be used in18
the content they find18
can be imported into18
the increasing availability of18
to make sense of18
collection as a whole18
list of a href18
of books and journals18
the university libraries of18
people who work in18
a whole lot like18
the number of documents18
this is the first18
illustrating the most frequent18
to participate in the18
a set of books18
the reader to use18
was not able to18
do not advocate the18
use of the levenshtein18
at the a href18
the problem to be18
text version of a18
what are some of18
copies keep stuff safe18
he went on to17
to take advantage of17
it is more than17
tr td align center17
words in a text17
through the application of17
web and linked data17
worth a thousand words17
in the digital humanities17
they can be used17
text as well as17
ask and answer questions17
a li ul p17
picture is worth a17
the list of a17
mining and natural language17
outlines some of my17
width height src http17
advent of the internet17
is worth a thousand17
this posting documents my17
is one way to17
collecting water and putting17
provide the means for17
it is used to17
concord and merrimack rivers17
version of an article17
university of illinois at17
through a set of17
p p img align17
the google books project17
text mining and natural17
in the process of17
the concord and merrimack17
how they can be17
for the library profession17
to come up with17
on the concord and17
part ii of iii17
a list of urls17
li ol p the17
when it came to17
is a sort of17
it means to be17
close and distant reading17
td tr table h17
semantic web and linked17
by eric lease morgan17
to count and tabulate17
in a given text17
img width height src17
com alex alex catalogue17
provides an overview of17
as illustrated by the17
dpla beta sprint submission17
the scholarly communications process17
the center for research17
after a bit of17
digital library collections and17
this text is a17
a large number of17
different types of input17
td img width src17
target blank img src16
the opportunity to attend16
no such thing as16
a stop word list16
photos infomotions in set16
com images img src16
or just about any16
problem of find amp16
the distant reader can16
to step for each16
corpus li li a16
a report against the16
day configure use constant16
some of my experiences16
main sandbox liam etc16
may need to be16
sanity check my output16
is akin to the16
images img src http16
reader ought to be16
is a lot like16
by the number of16
authorities names n gt16
reader is intended to16
com photos infomotions in16
the heart of the16
to do the work16
after wrestling with wilson16
in this proposal are16
into a search box16
the day configure use16
with the mobile web16
process each record in16
search retrieve url service16
are plain text files16
outlines some of the16
of data and information16
this is not a16
goes a long way16
for a limited period16
the digital public library16
the traditional reading process16
and sanity check my16
for most of the16
the idea of love16
this posting describes a16
configure use constant etc16
the goal of the16
initialize and sanity check16
the second document is16
wilson for most of16
libraries are expected to16
results of text mining16
a good time was16
problem of find get16
number of times a16
a p p the16
border vspace hspace a16
the great books are16
comes to the idea16
to interact with the16
a means to an16
time of the reader16
world wide web servers16
from the university of16
one and only one16
this sort of work16
limited period of time16
wrestling with wilson for16
find is not the16
the author of the16
the principles of librarianship16
of the day configure16
is based on the16
the home page for16
a list of the16
is a tool for16
of the alex catalogue16
updated name and id16
in the hopes of16
p h links h16
most of the day16
org net c dm16
in the realm of16
the semantic web and16
they are intended to16
in the corpus li16
blank img src http16
posting outlines some of16
means to an end16
in the middle of16
some of the more16
is to figure out16
with wilson for most16
library and information science16
including but not limited16
the reader ought to16
the reader wanted to16
border br span a16
a limited period of16
provide services against the16
org library virtue tsv16
to the idea of16
the time of the16
png colors a li16
com sandbox liam tmp16
to a greater degree16
allows the reader to16
the center for digital16
some of them are16
of copies keep stuff16
spread full text indexing16
upon and visualize parts16
an introduction to the16
what words are used16
does not seem to16
subsets of the collection15
search retrieve via url15
of electronic texts a15
span good man span15
this does not mean15
it is relatively easy15
where in the world15
this is the briefest15
originally a blog posting15
blockquote p a href15
length of a book15
into other plain text15
denoting the location of15
set of perl modules15
one or more words15
of a corpus with15
is the briefest of15
not have been able15
publishing archival descriptions as15
called the great books15
i would not have15
chttp a f fsimile15
version of a corpus15
for more information about15
span td td align15
five different types of15
identifying themes and clustering15
of the functionality of15
clustering documents using mallet15
see these sorts of15
the open content alliance15
creating a plain text15
to the a href15
org library virtue figures15
this text was originally15
hspace vspace align right15
in conjunction with the15
seven of the top15
li li click the15
edu sandbox reader hackaton15
good work of others15
to share some of15
getting started with xml15
with the internet archive15
is in the form15
themes and clustering documents15
considering the fact that15
and clustering documents using15
a li ul h15
width br a href15
number of unique words15
used in conjunction with15
in the field of15
a corpus with tika15
creation and maintenance of15
edited version of an15
part i of iii15
as well as an15
of open access journals15
part iii of iii15
was originally a blog15
essay was originally published15
to any number of15
presentation was given at15
eye candy by eric15
documents some of my15
a picture is worth15
is just one example15
not be able to15
was originally given at15
a step in the15
set of books called15
the corpus li li15
and a set of15
written by the author14
set of marc records14
something to do with14
issues pull requests actions14
what degree do these14
was originally published on14
the university of chicago14
be as simple as14
when things were published14
how you use github14
you will be able14
does this have to14
d total number of14
td justice td td14
and a list of14
of text mining and14
the atom publishing protocol14
once and at the14
a data management plan14
hspace a a href14
td rose td td14
imported into for favorite14
it is a process14
github desktop if nothing14
word and phrase frequencies14
the answer lies in14
files that can be14
stanford named entity recognizer14
the size of the14
number of years ago14
desktop if nothing happens14
make it easy to14
great idea tfidf scores14
at the north carolina14
a set of documents14
was had by all14
into for favorite spreadsheet14
party analytics cookies to14
they used to be14
desktop and try again14
collection predicts the future14
the advantages and disadvantages14
along the way i14
changes in this release14
pull requests actions projects14
in a set of14
begin to see how14
services against the index14
is not only about14
determine whether or not14
term id formats magazine14
unable to create model14
vspace hspace a a14
td td td tr14
the files in the14
adventures of huckleberry finn14
download github desktop and14
words in each item14
good time was had14
planet eric lease morgan14
in the current environment14
code issues pull requests14
the result as a14
txt file for a14
a number of library14
as a data structure14
we use optional third14
distant reader is intended14
reload to refresh your14
elaborate upon and visualize14
many of the things14
the size of a14
i participated in a14
id formats magazine articles14
into a number of14
the advent of ubiquitous14
com so we can14
none none none none14
making it easier for14
in a galaxy far14
for a full list14
the body of the14
is left up to14
t total number of14
at once and at14
for better or for14
it is a good14
as the number of14
with a script called14
can build better products14
my water collection predicts14
with a violent fit14
the gnu public license14
of the levenshtein algorithm14
of the more interesting14
in order to get14
launching github desktop if14
get a list of14
there is a need14
definitions of these columns14
the life of a14
text versions of the14
to refresh your session14
be imported into for14
the distant reader and14
of interest to you14
musings on information and14
answers to these questions14
its purpose is to14
png alt width height14
edu f f fontologies14
another tab or window14
for a presentation at14
documenting my experiences at14
this have to do14
indexed true stored true14
a full list of14
publishing linked data is14
that needs to be14
of cultural heritage institutions14
the good folks at14
the type of content14
number of times they14
the declaration of independence14
i believe it is14
the definitions of these14
the totality of the14
to be read by14
a set of marc14
we can build better14
upon a sort of14
at the american library14
td chair td td14
of changes in this14
of rdf and linked14
alex alex catalogue of14
sort the result by14
integer denoting the number14
in the near future14
td word td td14
of services against texts14
file for a full14
time was had by14
the first part of14
the catholic youth literature14
on information and librarianship14
whether or not it14
when compared to the14
have to do with14
a reference to a14
left up to the14
an integer denoting the14
problem to solve is14
this being the case14
reference to a hash14
go to step until14
of books called the14
what does this have14
better or for worse14
imaginative or intellectual content14
galaxy far far away14
of king henry the14
formats magazine articles a14
as it used to14
a number of people14
c number of times14
a list of words14
a galaxy far far14
of a marc record14
f f fontologies fmods14
globally networked computers and14
github desktop and try14
water collection predicts the14
commenced upon a sort14
a presentation to the14
the tools described in14
an rdf triple store14
access to the materials14
p ol li a14
this is the tiniest14
in the linked data14
in cultural heritage institutions14
the middle of the14
become a part of14
advantages and disadvantages of14
items in a corpus14
so we can build14
a number of my14
each of the items13
align center td align13
submission will describe and13
in a single word13
tcp baxter graphs catalog13
with the help of13
related open source software13
requires the skills of13
how it can be13
given a corpus of13
the reader to the13
advent of globally networked13
a next generation library13
copyright and the digital13
the creation and maintenance13
more important than the13
similarities and differences between13
mods a chttp a13
and a number of13
p the a href13
the same time we13
colors a li li13
f fontologies fmods e13
the same time it13
and provide access to13
the advent of globally13
and the distant reader13
at the end of13
the majority of the13
set of plain text13
an a href http13
by henry david thoreau13
h p img src13
linkcode as camp creative13
i am in the13
browser emerson graphs catalog13
was originally written for13
is relatively easy to13
within the context of13
a couple of the13
li li use the13
using a set of13
week on the concord13
it is nice to13
ie utf tag infomotions13
td td a href13
with the use of13
one of the oldest13
not as important as13
will describe and demonstrate13
ol li a href13
is the tiniest of13
a couple of years13
p this posting describes13
might be able to13
tcp love graphs catalog13
com sandbox liam data13
of what it means13
build on the good13
absence from my employer13
the functionality of the13
seems to be an13
query prefix mods a13
on a google map13
td tr table this13
distant reader is a13
tr table p the13
enables a person to13
semantic web in libraries13
items in the collection13
it is not about13
share some of my13
an open source software13
some automated analysis of13
used to create the13
as camp creative creativeasin13
browser thoreau graphs catalog13
you can download the13
books when it comes13
it can be used13
in the world of13
on a regular basis13
it seems to be13
a set of tab13
the output of the13
how to use a13
but it is not13
a week on the13
of eric lease morgan13
directory of open access13
prefix mods a chttp13
may be able to13
could be associated with12
and answer questions of12
a travel log this12
exploit the use of12
count the number of12
the process of publishing12
relationships between subjects and12
this proposal are not12
versions of the pamphlets12
of these files are12
in the etc directory12
a few words into12
address the problem of12
outlined in this proposal12
these sorts of services12
tables of contents and12
declaration of independence gt12
amount of unstructured data12
a travel log documenting12
to be in the12
to do text mining12
triple store eric lease12
query language of relational12
containing the query terms12
the help of a12
need to have been12
to the file system12
open source software development12
a description of how12
this can be done12
imagine being able to12
not suppose to do12
enter a few words12
strong text mining strong12
on one hand there12
read and write marc12
full text of the12
in the world is12
the other way around12
of the works of12
by counting and tabulating12
this was done with12
the form of an12
there were a number12
for each item in12
configure use constant index12
services against texts outlined12
po s file not12
form of a uri12
what degree is the12
it is not only12
distribution comes with a12
to be used as12
here is a list12
what to do with12
describes how to use12
text was never published12
has founding date bc12
speed of four records12
sense of all the12
the local file system12
would be possible to12
only one root element12
thumbnail alt a a12
such a thing is12
into a triple store12
in the body of12
problem of find is12
uses of these files12
it is difficult to12
do a search in12
of linked open data12
metadata as linked data12
search engines of google12
the appearance of the12
average number of words12
in your text editor12
people want to do12
content uploads london img12
code li li code12
i am interested in12
library of congress authority12
eric lease morgan emorgan12
of a given word12
the form of plain12
in a relational database12
allow the reader to12
because libraries are expected12
for the reader to12
was attended by approximately12
ideas behind the semantic12
file contains a set12
store my etc etc12
of a serial nature12
of four records per12
this is a travel12
word to other words12
it does this by12
the value of marc12
list of words and12
of word in a12
number of words per12
originally given at the12
the proximity of a12
provide a means to12
the sru interface to12
few words into a12
marc file in your12
the results of the12
the problem is addressed12
create a word cloud12
range of imaginative or12
the speed of four12
marrying close and distant12
the semantics of xml12
text files that can12
these files are described12
and open access publishing12
store eric lease morgan12
to take full advantage12
the way to go12
mining is a process12
tr valign top td12
an overview of what12
a number of times12
behind the semantic web12
is a whole lot12
and select items of12
the hypertext transfer protocol12
arbitrary amount of unstructured12
blog posting on the12
to do things with12
location of word in12
notice how the word12
that can be imported12
is far from perfect12
of congress authority record12
unable to open store12
published on techessence at12
sp o file not12
who work in libraries12
were not limited to12
at the speed of12
with the linked data12
live in a world12
td tr table i12
and possible uses of12
log documenting my experiences12
td td tr tr12
the number of topics12
the store my etc12
the reader can run12
of words are used12
is not too difficult12
while the problem of12
they make it easier12
as well as from12
li ul h a12
collections as a whole12
and write marc records12
originally published in computers12
save the time of12
each file contains a12
taken with a violent12
i have to do12
such thing as a12
against the triple store12
documents my experience at12
given word to other12
size of library collections12
the size of library12
so p file not12
and plot on a12
l as o a12
travel log documents my12
edited version of a12
makes it easier to12
they allow you to12
type text indexed true12
day in the life12
the root of the12
as much as they12
this process is called12
is the mail going12
books called the great12
documents containing the query12
the most common words12
a file of marc12
world is the mail12
to enable the reader12
all the words in12
was done with a12
a few special characters12
to the distant reader12
form of plain text12
described in the file12
of traditional library principles12
an irish catholic layman12
and microsoft a reality12
of illinois at urbana12
files ought to be12
content uploads ngc lib12
has something to do12
select items of interest12
was taken with a12
and get back a12
information technology and libraries12
a given word to12
plain text files that12
allow you to define12
until you get tired12
this posting outlines some12
this presentation was originally12
open the store my12
this proposal assumes the12
a diverse set of12
possible uses of these12
it is not as12
suppose the reader wanted12
originally published on techessence12
and the frequency of12
of these columns and12
this travel log was12
i went to the12
top td align center12
in a document and12
the value of db12
the hesburgh libraries of12
and the result will12
i get many hits12
uk xslt ead rdf12
the problem to solve12
have the computer programmer12
days in the life12
words in the corpus12
center td align right12
to be manifested in12
it is intended to12
posting on the lita12
than ten years ago12
may be transformed into12
language of relational databases12
the topic of the12
presentation was originally given12
harvested data has been12
one of the most12
p file not found12
appears in a document12
includes a number of12
sort search results by12
div div id attachment12
the results in a12
next to you is12
all of the works12
blockquote p code curl12
of imaginative or intellectual12
align right tr td12
primary purpose was to12
travel log documenting my12
while i was there12
the search engines of12
letters of an irish12
at the present time12
presentation of the day12
of no more than12
width height border alt12
do this sort of12
columns and possible uses12
what sorts of things12
arguments my db argv12
are uniquely positioned to12
a document or corpus12
of the sru interface12
the creation of an12
it would be possible12
this posting documents some12
this analysis can be12
of an irish catholic12
this is because the12
files are described elsewhere12
line arguments my db12
and only one root12
the average length of12
if i get many12
the words in each12
p div div id12
my alex catalogue of12
be transformed into a12
lease morgan emorgan nd12
in a networked environment12
of contents and back12
words into a search12
text indexed true stored12
four records per minute12
these columns and possible12
actions projects security insights12
founding date bc http12
my experience at the12
o a code li12
report against the database12
the following query will12
of each of these12
the reader to see12
mentioned in the text12
authorities at the speed12
just about any other12
on techessence at http12
o file not found12
proximity of a given12
command line arguments my12
take full advantage of12
td colspan document td12
was going to be12
you can see the12
on a set of12
is very important to12
they are plain text12
s file not found12
using a number of12
the world is the12
can also be used12
there is too much12
are described in the12
some sort of database12
f total number of12
in a given document12
up to the reader12
requests actions projects security12
as many of the12
of all the content12
set or get the12
valign top td align12
chatter at code lib12
libraries are uniquely positioned12
on open source software12
a tool for reading12
the result was a12
use of the sru12
of notre dame is12
sets of words are12
a blog posting on12
total number of unique12
of documents containing the12
alt a a href12
process of publishing linked12
in the semantic web12
personal tei publishing system12
it does this in12
such a thing would12
a combination of the12
of perl modules called11
text and data mining11
is the home page11
in conjunction with other11
entities with stanford tools11
themes digital humanities a11
a christmas carol cite11
book describes how to11
id themes digital humanities11
named entities with stanford11
the better part of11
by the end of11
to facilitate searching keywords11
tr th align right11
enumerated a number of11
give it a whirl11
libraries and librarianship a11
of the project was11
all words in the11
fs fp fo d11
have a better understanding11
the goals of librarianship11
i have commenced upon11
with a bit of11
in the th century11
to learn about the11
wrote seven of the11
the full list of11
but in order to11
span td td img11
as a person who11
on the shoulders of11
of henry david thoreau11
p for a good11
shakespeare wrote seven of11
as a means of11
source software for libraries11
than one way to11
ul h a id11
digital humanities and the11
the goals of the11
as well as to11
term id themes digital11
and describes how they11
of changes in the11
the length of a11
outputs sets of structured11
voyant tools to do11
cite a christmas carol11
about open source software11
in no priority order11
the answer is not11
is a matter of11
then it would be11
things can be done11
of my experiences at11
the profession needs to11
of the code lib11
the integrated library system11
graphing with tableau public11
the shoulders of giants11
constant chatter at code11
other end of a11
take better advantage of11
the top ten books11
the other day i11
other plain text files11
metadata and full text11
of the collection is11
other end of the11
software is never done11
a part of this11
a person needs to11
in the release and11
with open source software11
i think it is11
here you will find11
of the archival community11
or their frequency li11
information as well as11
the vast majority of11
the proverbial fire hose11
takes an arbitrary amount11
tr td img width11
p blockquote p i11
would be able to11
of the university of11
where b fs fp11
restricted li li b11
posting documents some of11
more than one way11
american and english literature11
used to describe the11
some of my ideas11
wondering whether or not11
file for the full11
of term frequency inverse11
p table tr td11
searching keywords in context11
to a set of11
have commenced upon a11
in the set was11
a mailing list called11
it is interesting to11
for the full list11
align center tr td11
is a step in11
the creation of your11
term id themes libraries11
s of copies keep11
text mining and the11
ten books when it11
in a previous posting11
does not have to11
will need to have11
to the source code11
can be used as11
some of the possibilities11
a few weeks ago11
to do this work11
the advancement of learning11
is the first of11
in the context of11
b fs fp fo11
the role of the11
and named entities with11
id themes libraries and11
your milage may vary11
be a part of11
as a href http11
in a way that11
to be transformed into11
primary purpose is to11
sets of structured data11
given a set of11
the archival community has11
to the code lib11
top ten books when11
called the a href11
means to be human11
the similarities and differences11
changes in the release11
click a word to11
to rule them all11
their frequency li li11
tiny text mining tools11
for the advancement of11
to answer this question11
are we there yet11
save the result as11
and enable the reader11
facilitate searching keywords in11
of words and associated11
txt file for the11
p p there are11
an arbitrary amount of11
together for the advancement11
program called a href11
about a number of11
at the bottom of11
to be a list11
using voyant tools to11
working together for the11
of interest from the11
a good idea to11
align center tr valign11
of computers in libraries11
the release and have11
the time and effort11
tools to do some11
availability of full text11
needs to be a11
of the library catalog11
thank you for the10
to begin to see10
really intended to be10
the results can be10
given the full text10
p on the other10
were done against the10
outlined a number of10
this is the readme10
numeric characteristics of records10
is almost always a10
examples include but are10
their frequencies are listed10
myriad of reports enabling10
is the readme file10
is the way to10
many things it contains10
the values of the10
the works in the10
to include links to10
subjects and the objects10
li ul p in10
group of technical services10
in the traditional manner10
notre dame journal of10
expensive in terms of10
phrases in a corpus10
and sort the result10
of words per item10
and a sparql endpoint10
all have more things10
rdf as linked data10
this particular corpus employs10
lists connoting an idea10
of times the query10
counting and tabulating the10
one of a number10
the sizes of its10
average number of pages10
each book in the10
together can be illustrated10
library federation annual meeting10
the readme file for10
can be put into10
how many records are10
communication is the key10
how many things it10
t get me wrong10
application of computer science10
the processes of librarianship10
categories uncategorized comment on10
item in the collection10
in the afternoon i10
are there one or10
methods in the humanities10
in the midst of10
the characteristics of the10
o a camp creative10
occur in specific items10
of each of the10
written by the same10
can see there are10
tiny list of part10
the move with the10
phrases in a text10
the conference was a10
given the opportunity to10
a p blockquote p10
these sorts of questions10
to do some of10
advocated the creation of10
tr tr th align10
in it he described10
making it easy to10
sizes of its items10
the exception of the10
a set of words10
data into plain text10
washington university in st10
of the collection as10
choosing occur in specific10
and how it can10
of pages per item10
idea of interest to10
lib open source software10
can prove to be10
and put the result10
from a given document10
vocabularies used to describe10
well as a few10
p p the second10
of items in words10
cite td td img10
went on to describe10
it has something to10
tabulating the words in10
use your text editor10
occur in the corpus10
of records in the10
in the catalog can10
the past couple of10
as opposed to a10
the result into a10
services against the texts10
are expected to know10
to be quite insightful10
given document in your10
in the united kingdom10
be illustrated through a10
of what and how10
illustrated by the following10
to the materials through10
words are used across10
of linked data publishing10
display location of word10
the united states is10
is very similar to10
li li strong most10
in a corpus and10
and the sizes of10
documents in the collection10
has been saved in10
once this is done10
used across a corpus10
the linked data of10
select one or more10
list of top tech10
creative width height border10
all i have to10
my goal is to10
only game in town10
who is mentioned in10
and their associated links10
code lib open source10
new and different ways10
or more words in10
and in the end10
the great idea tfidf10
advances in information retrieval10
com behas oai lod10
number of pages per10
based on the information10
content available on the10
the directory of open10
or not it is10
gp product ref as10
of congress subject headings10
be able to use10
of the number of10
tcp love html a10
for creating and maintaining10
product ref as li10
the process is not10
h links h p10
is a process for10
and tabulating the words10
click the start button10
the future of library10
been saved in the10
any number of ways10
item of the corpus10
chttp a f fdata10
more relevant than the10
of words or phrases10
correlation between pages and10
move with the mobile10
each item of the10
in each item of10
provide a means for10
i was there i10
posting describes how i10
as o a camp10
not the problem to10
width height class alignright10
of linked data is10
the only game in10
you will want to10
open source software award10
your choosing occur in10
would be a good10
if the answer is10
in each of the10
the distant reader will10
learning how to use10
url pointing to the10
humanities computing techniques to10
services against the result10
the beginning of the10
catalog can be illustrated10
it outputs sets of10
things into a single10
originally written for a10
records in the catalog10
into the search box10
describes some of the10
the result in a10
increasing availability of full10
zip file with a10
width height hspace vspace10
do everything you would10
of times each occurs10
i learned about the10
the location of the10
your milage will vary10
and it outputs sets10
it needs to be10
tools of the trade10
will become a part10
collected this water while10
associated with the given10
of documents in the10
the metaphysics of morals10
through the process i10
as well as in10
structured data for analysis10
a given document in10
on the world wide10
possible to count and10
items from a corpus10
there are so many10
the use of words10
without the use of10
to count the number10
is the key to10
possible to measure additional10
connoting an idea of10
d aselect where b10
height class alignright size10
edu emorgan files img10
on the information above10
employs three such dictionaries10
texas library association annual10
are akin to the10
you would do in10
a corpus of documents10
is possible to measure10
part of king henry10
such as the one10
the open archives initiative10
as well as computers10
for a number of10
a zip file with10
and the application of10
specific sets of words10
the catalog can be10
located in the text10
prove to be quite10
to the principles of10
to what degree does10
in order to learn10
much as it is10
it is better to10
texts in ways that10
word appears in a10
document in your corpus10
as well as all10
in a text and10
one of the original10
what is text mining10
the application of computer10
these lists connoting an10
is the heart of10
of the day was10
is possible to count10
computational methods in the10
a word cloud of10
words are used in10
display the proximity of10
in the spirit of10
d printing working group10
to the wider community10
center tr valign top10
particular corpus employs three10
advocated the use of10
of text mining are10
the creation of locally10
metadata provides an overview10
have a mindset of10
written for a presentation10
analysis of the corpus10
a word or phrase10
the tools of the10
can be illustrated through10
all of the files10
is possible to create10
not the end itself10
do these words occur10
valley group of technical10
brought to my attention10
is not the only10
words of your choosing10
to build on the10
a few months ago10
aselect where b fs10
number of topic words10
in a library catalog10
these words occur in10
and tabulate how specific10
the ideas behind the10
of a triple store10
the frequency of the10
how specific sets of10
a limited number of10
perusing the list of10
ohio valley group of10
home page for the10
almost always a correlation10
the whys and hows10
ead into rdf xml10
overview of what and10
are not really about10
is a need for10
sizes of items in10
we will have to10
a member of the10
the university of toronto10
how words of your10
in order to keep10
tcp baxter html a10
with the availability of10
characteristics of records in10
as a librarian i10
words occur in the10
more than a few10
by the same author10
png width height alt10
and dissemination of data10
an idea of interest10
file with a companion10
with an overview of10
as you would expect10
ref as li tf10
to provide services against10
we have a mindset10
there are at least10
of the library profession10
the name of the10
pages and number of10
between pages and number10
given the increasing availability10
these files help answer10
seem to be the10
but they are not10
allow the user to10
makes a lot of10
is the creation of10
use any number of10
there is almost always10
see how words of10
provides a means for10
will continue to be10
camp creative width height10
are of possible interest10
always a correlation between10
count word and phrase10
a myriad of reports10
page for more details10
are a part of10
the purposes of this10
com sandbox bibframe data10
sandbox bibframe data data10
wonder whether or not10
all the works of10
you may want to10
whys and hows of10
a correlation between pages10
and how many things10
td thesis td td10
the right of the10
collected this water on10
what is the average10
in regards to the10
the process of find10
they are expected to10
cite td tr table10
the first is to10
will not be able10
at the beginning of10
can find plenty of10
content as well as10
collection as well as10
believe it or not10
to do with librarianship10
my first epub file10
searched the library of10
with a presentation by10
tcp baxter xml a10
as much as it10
to measure additional characteristics10
edit the value of10
what and how many10
its primary purpose was10
done with a script10
described and demonstrated a10
tfidf score for each10
such as text mining10
total number of pages10
of natural language processing10
matrix of scatter plots10
some of the characteristics10
primary purpose of the10
of open access publishing10
it is very important10
and it is a10
it is not easy10
code br a href10
of structured data for10
are used across a10
tabulate how specific sets10
you to go to10
library of congress subject10
frequencies are listed below10
as long as the10
gave a presentation called10
you are going to10
amount of full text10
there are many ways10
the same time they10
corpus employs three such10
this essay was written10
on the move with10
for items of interest10
a greater amount of10
the length of the10
sets of loosely defined10
everything you would do10
more words in these10
is a href http10
com gp product ref10
possible correlations between numeric10
tcp baxter text a10
is a form of10
against the database to10
creation of locally defined10
surrounding the topic of10
enter a word of10
a camp creative width10
alignright a href http10
caption alignright a href10
between numeric characteristics of10
how this can be10
the primary purpose of10
degree do these words10
org resource walt disney10
it is now possible10
be able to read10
to get a list10
e d aselect where10
the reader will be10
of top tech trends10
and how these frequencies10
a leave of absence10
not a whole lot10
and through the use10
were a number of10
linked data is about10
in these lists connoting10
li li create a10
com codeforkjeff refine viaf10
the reader to select10
pl configure use constant10
the results of step10
the library profession has10
a triple store and10
right of the query10
count and tabulate how10
every item in the10
i learned a lot10
is mentioned in the10
to answer questions like10
the whole thing into10
queries can be applied10
the humanities and sciences10
transformed into plain text10
the result on a10
interface to the index10
indiana library federation annual10
access control in libraries10
just like any other10
to see how words10
digital versions of books10
be able to do10
the semantic web is10
in the humanities and10
correlations between numeric characteristics10
the distribution of words10
then to what degree10
words in these lists10
in order to find10
day of the conference10
of your choosing occur10
there one or more10
notes on word usage10
and number of words10
with the content they10
the levenshtein distance algorithm10
li code br a10
on the web is10
a huge number of10
great ideas coefficient is10
a role in the9
from all over the9
is not easy to9
end of the workshop9
based on personal experience9
edited version of eric9
files need to be9
see how easy it9
some of my take9
as a way of9
sort of leave of9
the a href https9
preservationists have the most9
i wrote a perl9
via linked data eric9
configure use constant root9
the great books survey9
this posting describes the9
my personal tei publishing9
in the recent past9
as of this writing9
this is a set9
it comes to love9
interactive map pie chart9
find all triples with9
interface allowing the reader9
with optical character recognition9
the current environment where9
to take better advantage9
new dog old tricks9
the result to a9
go to step on9
outlines my experiences there9
tr table h day9
i was happy to9
know about literary history9
a set of plain9
i was not able9
will probably happen because9
to investigate how to9
content of the hathitrust9
also be used as9
p p i then9
this is a tiny9
the profession does not9
in the current directory9
hspace vspace br a9
posting outlines my experiences9
be used to facilitate9
fun with elasticsearch and9
did my best to9
the last two weeks9
id formats technical report9
rss and the rss9
is more than possible9
number of open source9
the center of the9
linked data eric lease9
all over the world9
p p for the9
on the processes of9
accessible via linked data9
td newton td td9
are a few sample9
examples of how the9
to know how to9
p the distant reader9
page describes a corpus9
used to determine the9
to see how easy9
one of the things9
and the use of9
somewhere along the line9
because of sparql syntax9
they will want to9
term id formats technical9
bottom of the page9
fruits of my labors9
was given by strong9
for the creation of9
viaf identifiers for more9
order to be useful9
al li li b9
p the purpose of9
version of eric lease9
it ought to be9
there is the disclaimer9
the topic of mass9
the current state of9
p blockquote p where9
it is simply not9
a subset of liam9
to create and maintain9
org target blank http9
fun with rss and9
the content of books9
endpoint to a subset9
is simply not possible9
to evolve in order9
think of it as9
but not necessarily limited9
tarzan of the apes9
the forest from the9
for a set of9
compass by andrew sutton9
essay about my water9
leave of absence from9
the most challenging job9
output will be saved9
probably happen because of9
script called a href9
gallery valencia pages img9
of leave of absence9
the structure of the9
a number of open9
i learned a number9
the western world cite9
the beginnings of the9
exploiting the content of9
posting documents my experience9
files in the current9
library of america beta9
opportunities for future study9
provide a way to9
source software and libraries9
opportunity to visit the9
article was originally written9
museum and library services9
full text of all9
edu sandbox hathi downloadable9
script to rule them9
some of the challenges9
png width a br9
to create your own9
td tr tr th9
sparql endpoint to a9
formats technical report a9
sandbox liam tmp guidebook9
forest from the trees9
happens to be a9
been processed with optical9
shared with the audience9
using bibframe for bibliographic9
a document li li9
library school graduate students9
into a set of9
and this is my9
for the a href9
of absence from my9
for the purpose of9
of sparql syntax errors9
can be implemented through9
vspace br a href9
where the output will9
one way to skin9
fontologies fmods e d9
all triples with rdf9
specific types of nouns9
is not a panacea9
a br code li9
into a data structure9
in this release and9
log documents my experiences9
this is a simple9
topic modeling is an9
of the types of9
this page describes a9
align center a href9
provide the means to9
a few of my9
come up with a9
number of ways the9
creation of your own9
this travel log outlines9
php fm article view9
in order to remain9
not as necessary as9
be able to understand9
then what might that9
once in a while9
goal of the project9
with rss and the9
notre dame digital humanities9
and the rss aggregator9
not necessarily limited to9
the core principles of9
subset of liam linked9
happen because of sparql9
what open source software9
python natural language toolkit9
how the process of9
and demonstrates how the9
new york technical services9
way to skin a9
a special issue of9
more information about the9
identifiers for more than9
on the use of9
of liam linked data9
mining and the digital9
combined with primo central9
this was a presentation9
of my alex catalogue9
institute of museum and9
but there is the9
this posting outlines how9
software and open access9
in the last century9
was written for the9
was given at the9
presentation was given to9
just about any type9
p p in a9
a few of us9
michael hart in roanoke9
p ul li what9
et al li li9
one way to accomplish9
describes a corpus named9
the tennessee library association9
been able to do9
a number of rudimentary9
td align center a9
a new dog old9
it is easier to9
this posting outlines my9
states agriculture information network9
provide better library service9
tr table p i9
of the conference was9
edu emorgan files model9
coercing the corpus into9
bibframe for bibliographic description9
the corpus as a9
to make it easier9
of top technology trends9
on the content of9
with elasticsearch and marc9
very much like the9
is more important than9
text of all the9
td car td td9
triples with rdf schema9
this article describes the9
at the national library9
a few sample queries9
the existence of a9
of years ago i9
and links to the9
source software and open9
a travel log http9
its primary purpose is9
to skin a cat9
about my water collection9
the skills of many9
outlines my experiences at9
characteristics of a text9
reader is a tool9
the state of the9
order to remain relevant9
teaching a new dog9
topic of mass digitization9
tcp love xml a9
of museum and library9
given a text and9
i wish i could9
written by pete johnston9
as well as others9
and english literature as9
the same time i9
it is imperative to9
this release and have9
linked data is the9
a sort of leave9
and saving the result9
to a subset of9
an essay about my9
there is more than9
please see the download9
a person to do9