id author title date pages extension mime words sentences flesch summary cache txt literarymachin-es-7470 literary machines .xml application/atom+xml 617 56 72 Also, some very big webarchiving initiatives have moved and used pywb in these years: Webrecorder itself, Rhizome, Perma, Arquivo PT in Portugal, the Italian National Library in Florence (Italy), (others i'm missing). Webarchiving activities, as any other activity where an HTTP client is involved, leave marks of their steps: the web server you are visiting or crawling will save your IP address in its logs (or even worse it can decide to ban your IP). Can we also archive the web through TOR? Version higher than 7.40 can generate static tiles of big images in deepzoom format, saving them directly into a zip archive. For a long time the only free (i'm unaware of commercial ones) implementation of a web archival replay software has been the Wayback Machine (now Openwayback). But there is a new player in the game, pywb, developed by Ilya Kramer, a former Internet Archive developer. ./cache/literarymachin-es-7470.xml ./txt/literarymachin-es-7470.txt