Distant Reader Gateway

How to use the Distant Reader


There are more than a few steps for to getting the Distant Reader to work for you:

  1. Articulate a research question: What do you want to know?
  2. Identify one of three different types of input pointing to the content you want to "read": 1) a URL, 2) a file of URLS, or 3) just about any set of files.
  3. If you haven't already done so create an account.
  4. Log in.
  5. Submit your input.
  6. Wait.
  7. Download the resulting Distant Reader "study carrel".
  8. Use the "study carrel" to supplement your usual reading process.

There is a lot of information there, and believe me, the process is faster the second time around.


Articulate a research question

Ask yourself, "What do I want to know?" The answer to this question can range from the simple to the sublime, for example:

  1. How many documents are in my corpus?
  2. How long are my documents?
  3. On a scale from 0 to 100, how difficult are my documents to read?
  4. How can I summarize each document in my corpus?
  5. What are some of the most statistically significant keywords in my corpus?
  6. Count & tabulate all the email addresses in my documents.
  7. Count & tabulate all the URLs in my documents.
  8. What is discussed in my corpus?
  9. Who are the people in my corpus?
  10. What are the actions that take place in the corpus?
  11. How are things in my corpus described?
  12. What places, organizations, dates & times, number & money amounts are mentioned in my corpus?
  13. How can I describe my corpus in a single word? How can I describe my corpus in three words? If I were to divide my corpus into five themes, and each theme were enumerated with five words, then what might those themes & words be?

Identify your input

The input can be any one of the following three things: a URL, a file of URLs, or just about any set of files on your computer.

Create an account

If you haven't already done so, create an account.

The Distant Reader process creates its "study carrels" for you, and consequently the Reader needs to know for whom it is doing its processing. Your employer may already be affiliated with the Reader's authentication process, and if so, you will not need to create an account. Otherwise, you will need to tell us a bit about yourself. The process is harmless. We promise.

Log in

Authenticate yourself; let the Distant Reader know for whom it is doing its good work.

Submit your input

Share your input with the Distant Reader so it can do its good work.

Wait

Be patient. The Distant Reader is designed to handle hundreds of URLs pointing to 100's of megabytes of files. Depending on the size of your input, processing can take anywhere between two minutes and a few hours, with a "usual" wait time of ten to fifteen minutes.

Download your "study carrel"

The results of the Distant Reader process is a "study carrel" -- a .zip file containing your original content, various transformations of it, and various sets of analysis.

Read

Use the contents of the "study carrel" to suppliment your usual reading process.

Eric Lease Morgan <emorgan@nd.edu>
May 13, 2019