Ted Lawless

Ted Lawless
	Connecting Python's RDFLib to AWS Neptune
I've written previously about using Python's RDFLib to connect to various triple stores. For a current project, I'm using Amazon Neptune as a triple store and the RDFLib SPARQLStore implemenation did not work out of the box. I thought I would share my solution.
The problem
Neptune returns ntriples by …
	Usable sample researcher profile data
I've published a small set of web harvesting scripts to fetch information about researchers and their activities from the NIH Intramural Research Program website.
On various projects I've been involved with, it has been difficult to acquire usable sample, or test data, about researchers and their activities. You either need …
	Exploring 10 years of the New Yorker Fiction Podcast with Wikidata
Note: The data behind this datasette was updated on 7/24/18 to capture new episodes. The examples below might result in slightly different numbers.

The New Yorker Fiction Podcast recently celebrated its ten year anniversary. For those of you not familiar, this is a monthly podcast hosted by New …
	Now Publishing Complete Lahman Baseball Database with Datasette
Summary: The Datasette API available at https://baseballdb.lawlesst.net now contains the full Lahman Baseball Database.

In a previous post, I described how I'm using Datasette to publish a subset of the Lahman Baseball Database. At that time, I only published three of the 27 tables available in the …
	Publishing the Lahman Baseball Database with Datasette
Summary: publishing the Lahman Baseball Database with Datasette. API available at https://baseballdb.lawlesst.net.

For those of us interested in open data, an exciting new tool was released this month. It's by Simon Willison and called Datasette. Datasette allows you to very quickly convert CSV files to a sqlite …
	SPARQL to Pandas Dataframes
Update: See this Python module for converting SPARQL query results into Pandas dataframes.

Using Pandas to explore data SPARQL
Pandas is a Python based power tool for munging and analyzing data. While working with data from SPARQL endpoints, you may prefer to explore and analyze it with pandas given its …
	Querying Wikidata to Identify Globally Famous Baseball Players
Earlier this year I had the pleasure of attending a lecture by Cesar Hidalgo of MIT's Media Lab. One of the projects Hidalgo discussed was Pantheon. Pantheon is a website and dataset that ranks "globally famous individuals" based on a metric the team created called the Historical Popularity Index (HPI …
	Python ETL and JSON-LD
I've written an extension to petl, a Python ETL library, that applies JSON-LD contexts to data tables for transformation into RDF.
The problem
Converting existing data to RDF, such as for VIVO, often involves taking tabular data exported from a system of record, transforming or augmenting it in some way …
	OrgRef data as RDF
Notes on mapping OrgRef to DBPedia and publishing with Linked Data Fragments .
	Using Pelican for a Gitub personal page
Notes on converting this Github user page based site to Pelican, a Python based static site generator.
	Select2 for RDF editing interfaces
A description and example of using Select2 to build an editing interface for RDF data.  Sample HTML and JavaScript is included that relates skos:Concepts from the assignFAST webservice provided by OCLC to local researcher URIs.
	Connecting Python's RDFLib and Stardog
For a couple of years I have been working with the Python RDFLib library for converting data from various formats to RDF.  This library serves this work well but it's sometimes difficult to track down a straightforward, working example of performing a particular operation or task in RDFLib.  I have …
	Python and JSON-LD
I've published some code for mapping CSV data to RDF using Python and JSON-LD on Github.  The motivation for this work was:

to provide sample data to help people get started with VIVO, the research profile system built on Semantic Web standards
to learn more about JSON-LD and explore it …
	An OpenRefine reconciliation service for academic journal data
Recently I've been working to link local data stored in VIVO as RDF with other sources on the Web.  The RDF Refine extension for OpenRefine1 has been a useful tool in this work.  OpenRefine allows you to query a reconciliation service to match local strings to entities from another …
	Solr Document Signatures
I previously wrote about working with Apache mod_cache, HTTP ETags, and VIVO to cache public pages.  After writing that post, I found that Solr supports adding "signatures" to documents as a way to identify if a document is identical to another.  This feature was added to Solr as a way …
	Caching VIVO profiles with ETags and mod_cache
Update - Caching VIVO pages with ETags was made part of the VIVO/Vitro core code in
release 1.6. This solution is no longer necessary and the methods described here have been made part of the software. See the project documentation for information on how to set this up. Any …
	A utility script for developing VIVO custom list views
As you continue with your VIVO implementation, you might want to adjust the properties that are displayed on profile pages.  VIVO handles what properties display on a profile page through a set of list views, which are SPARQL queries that pull data from your VIVO store and pass it on …
	Using Z39.50 to produce a Stack View.
The Harvard Library Innovation Lab has developed a library browsing tool called Stack View.  It provides a way to virtual browse through a collection of items from a library.
The examples on the Stack View website show how to pull data in from a variety of sources and example scripts …
	Reading and writing RDF for VIVO with RDFAlchemy
Note: This post is most likely out of date. Code samples are unlikely to still work. Please use only as background/reference.

For the last few months I have been working on converting a diverse set of data about the university and its faculty into RDF for import into VIVO …
	A Python module for placing requests in ILLiad
This post describes a Python module for creating requests in ILLiad, the interlibrary loan software used in libraries.  

Many libraries use ILLiad as the software system for document delivery and interlibrary loan services.  As a developer working with this system, you might find a need to create ILLiad requests programmatically …