id author title date pages extension mime words sentences flesch summary cache txt work_ofdqqvswinby7gitk5jlbq3674 Michael J. Cafarella Structured data on the web 2011.0 41 .pdf application/pdf 788 203 75 Structured Data and the Web Structured Data and the Web • A huge amount of structured data on the Web – Government data, crime, water condiLons, … Goal: Structured Data Ecosystem (points, polygons) from a large data set ü Google Fusion Tables: ü Google Fusion Tables: Tables on the Web Goal: Search for Structured Data • Finding the good tables on the Web • Understanding user's intenLons See "Google's Deep Web Crawl", VLDB 2008 – Single-table databases; Schema = attr labels + types – Recovers good relations from crawl and enables search Searching Tables is Tricky – Hits on table body results for tables Modeling Challenge: Data is About Everything – AcLon movies Recovering Table Semantics Raw HTML Tables Recovered Relations Relation Search Job-title, company, date 104 • Fusion Tables: helping get the ecosystem started. • Search for structured data sets: – Create new data sets • Deep web: VLDB 2008 ./cache/work_ofdqqvswinby7gitk5jlbq3674.pdf ./txt/work_ofdqqvswinby7gitk5jlbq3674.txt